Morpheme-based Korean Word Vector Generation Considering the Subword and Part-Of-Speech Information 


Vol. 47,  No. 4, pp. 395-403, Apr.  2020
10.5626/JOK.2020.47.4.395


PDF

  Abstract

Word vectors enable finding the relationship between words by vector computation. They are also widely used as pre-trained data for high-level neural network programs. Various modified models from English models have been proposed for the generation of Korean word vectors, with various segmentation units such as Eojeol(word phrase), morpheme, syllable and Jaso. In this study, we propose Korean word vector generation methods that segment Eojeol into morphemes and convert them into subwords comprising either syllable or Jaso. We also propose methods using Part-Of-Speech tags provided in the pre-processing to reflect semantic and syntactic information regarding the morphemes. Intrinsic and extrinsic experiments showed that the method using morpheme segments with Jaso subwords and additional Part-Of-Speech tags showed better performance than others under the condition that the target data are normal text and not as grammatically incorrect.


  Statistics
Cumulative Counts from November, 2022
Multiple requests among the same browser session are counted as one view. If you mouse over a chart, the values of data points will be shown.


  Cite this article

[IEEE Style]

J. Youn and J. S. Lee, "Morpheme-based Korean Word Vector Generation Considering the Subword and Part-Of-Speech Information," Journal of KIISE, JOK, vol. 47, no. 4, pp. 395-403, 2020. DOI: 10.5626/JOK.2020.47.4.395.


[ACM Style]

Junyoung Youn and Jae Sung Lee. 2020. Morpheme-based Korean Word Vector Generation Considering the Subword and Part-Of-Speech Information. Journal of KIISE, JOK, 47, 4, (2020), 395-403. DOI: 10.5626/JOK.2020.47.4.395.


[KCI Style]

윤준영, 이재성, "부분단어와 품사 태깅 정보를 활용한 형태소 기반의 한국어 단어 벡터 생성," 한국정보과학회 논문지, 제47권, 제4호, 395~403쪽, 2020. DOI: 10.5626/JOK.2020.47.4.395.


[Endnote/Zotero/Mendeley (RIS)]  Download


[BibTeX]  Download



Search




Journal of KIISE

  • ISSN : 2383-630X(Print)
  • ISSN : 2383-6296(Electronic)
  • KCI Accredited Journal

Editorial Office

  • Tel. +82-2-588-9240
  • Fax. +82-2-521-1352
  • E-mail. chwoo@kiise.or.kr