Analyzing Semantic Relations of Word Vectors trained by The Word2vec Model 


Vol. 46,  No. 10, pp. 1088-1093, Oct.  2019
10.5626/JOK.2019.46.10.1088


PDF

  Abstract

As the usage of artificial intelligence (AI) in natural language processing has increased, the importance of word embedding has grown significantly. This paper qualitatively analyzes the representational capability of word2vec models to structure semantic relation in terms of antonymy and hyponymy based on clustering characteristics and t-SNE distribution. To this end, a K-means clustering algorithm was applied to a set of words drawn from 10 categories. Some words in antonymy are found not to be embedded properly. This is attributed to the fact that they typically have many common attributes with a very few opposite ones. It is also observed that words in hyponymy are not properly embedded at all. This can be attributed to the fact that the hyponymic relations of those words are based on the information gathered through a learning process of a knowledge system, as opposed to a natural process of language acquisition. Thus, it appears that word2vec models based on the distributional hypothesis are limited to representing certain antonymic relations and do not properly represent hyponymic relations at all.


  Statistics
Cumulative Counts from November, 2022
Multiple requests among the same browser session are counted as one view. If you mouse over a chart, the values of data points will be shown.


  Cite this article

[IEEE Style]

H. Kang and J. Yang, "Analyzing Semantic Relations of Word Vectors trained by The Word2vec Model," Journal of KIISE, JOK, vol. 46, no. 10, pp. 1088-1093, 2019. DOI: 10.5626/JOK.2019.46.10.1088.


[ACM Style]

Hyungsuc Kang and Janghoon Yang. 2019. Analyzing Semantic Relations of Word Vectors trained by The Word2vec Model. Journal of KIISE, JOK, 46, 10, (2019), 1088-1093. DOI: 10.5626/JOK.2019.46.10.1088.


[KCI Style]

강형석, 양장훈, "Word2vec 모델로 학습된 단어 벡터의 의미 관계 분석," 한국정보과학회 논문지, 제46권, 제10호, 1088~1093쪽, 2019. DOI: 10.5626/JOK.2019.46.10.1088.


[Endnote/Zotero/Mendeley (RIS)]  Download


[BibTeX]  Download



Search




Journal of KIISE

  • ISSN : 2383-630X(Print)
  • ISSN : 2383-6296(Electronic)
  • KCI Accredited Journal

Editorial Office

  • Tel. +82-2-588-9240
  • Fax. +82-2-521-1352
  • E-mail. chwoo@kiise.or.kr