Defining Chunks and Chunking using Its Corpus and Bi-LSTM/CRFs in Korean 


Vol. 47,  No. 6, pp. 587-595, Jun.  2020
10.5626/JOK.2020.47.6.587


PDF

  Abstract

There are several notorious problems in Korean dependency parsing: the head position problem and the constituent unit problem. Such problems can be somewhat resolved by chunking. Chunking seeks to locate and classify constituents referred to as chunks into predefined categories. So far, several studies in Korean have been conducted without a clear definition of chunks partially. Thus, we define chunks in Korean thoroughly and build a chunk-tagged corpus based on the definition as well as propose a Bi-LSTM/CRF chunking model using the corpus. Through experiments, we have shown that the proposed model achieved a F1-score of 98.54% and can be used for practical applications. We analyzed performance variations according to word embedding and so fastText showed the best performance. Error analysis was performed so that it could be used to improve the proposed model in the near future.


  Statistics
Cumulative Counts from November, 2022
Multiple requests among the same browser session are counted as one view. If you mouse over a chart, the values of data points will be shown.


  Cite this article

[IEEE Style]

Y. Namgoong, C. Kim, M. Cheon, H. Park, H. Yoon, M. Choi, J. Kim, J. Kim, "Defining Chunks and Chunking using Its Corpus and Bi-LSTM/CRFs in Korean," Journal of KIISE, JOK, vol. 47, no. 6, pp. 587-595, 2020. DOI: 10.5626/JOK.2020.47.6.587.


[ACM Style]

Young Namgoong, Chang-Hyun Kim, Min-ah Cheon, Ho-min Park, Ho Yoon, Min-seok Choi, Jae-kyun Kim, and Jae-Hoon Kim. 2020. Defining Chunks and Chunking using Its Corpus and Bi-LSTM/CRFs in Korean. Journal of KIISE, JOK, 47, 6, (2020), 587-595. DOI: 10.5626/JOK.2020.47.6.587.


[KCI Style]

남궁영, 김창현, 천민아, 박호민, 윤호, 최민석, 김재균, 김재훈, "한국어 말덩이 정의와 구묶음: 한국어 말덩이 부착 말뭉치와 Bi-LSTM/CRFs 모델을 활용하여," 한국정보과학회 논문지, 제47권, 제6호, 587~595쪽, 2020. DOI: 10.5626/JOK.2020.47.6.587.


[Endnote/Zotero/Mendeley (RIS)]  Download


[BibTeX]  Download



Search




Journal of KIISE

  • ISSN : 2383-630X(Print)
  • ISSN : 2383-6296(Electronic)
  • KCI Accredited Journal

Editorial Office

  • Tel. +82-2-588-9240
  • Fax. +82-2-521-1352
  • E-mail. chwoo@kiise.or.kr