Performance Analysis of Korean Morphological Analyzer based on Transformer and BERT 


Vol. 47,  No. 8, pp. 730-741, Aug.  2020
10.5626/JOK.2020.47.8.730


PDF

  Abstract

This paper introduces a Korean morphological analyzer using the Transformer, which is one of the most popular sequence-to-sequence deep neural models. The Transformer comprises an encoder and a decoder. The encoder compresses a raw input sentence into a fixed-size vector, while the decoder generates a morphological analysis result for the vector. We also replace the encoder with BERT, a pre-trained language representation model. An attention mechanism and a copying mechanism are integrated in the decoder. The processing units of the encoder and the decoder are eojeol-based WordPiece and morpheme-based WordPiece, respectively. Experimental results showed that the Transformer with fine-tuned BERT outperforms the randomly initialized Transformer by 2.9% in the F1 score. We also investigated the effects of the WordPiece embedding on morphological analysis when they are not fully updated in the training phases.


  Statistics
Cumulative Counts from November, 2022
Multiple requests among the same browser session are counted as one view. If you mouse over a chart, the values of data points will be shown.


  Cite this article

[IEEE Style]

Y. Choi and K. J. Lee, "Performance Analysis of Korean Morphological Analyzer based on Transformer and BERT," Journal of KIISE, JOK, vol. 47, no. 8, pp. 730-741, 2020. DOI: 10.5626/JOK.2020.47.8.730.


[ACM Style]

Yongseok Choi and Kong Joo Lee. 2020. Performance Analysis of Korean Morphological Analyzer based on Transformer and BERT. Journal of KIISE, JOK, 47, 8, (2020), 730-741. DOI: 10.5626/JOK.2020.47.8.730.


[KCI Style]

최용석, 이공주, "트랜스포머와 BERT로 구현한 한국어 형태소 분석기의 성능 분석," 한국정보과학회 논문지, 제47권, 제8호, 730~741쪽, 2020. DOI: 10.5626/JOK.2020.47.8.730.


[Endnote/Zotero/Mendeley (RIS)]  Download


[BibTeX]  Download



Search




Journal of KIISE

  • ISSN : 2383-630X(Print)
  • ISSN : 2383-6296(Electronic)
  • KCI Accredited Journal

Editorial Office

  • Tel. +82-2-588-9240
  • Fax. +82-2-521-1352
  • E-mail. chwoo@kiise.or.kr