News Topic Extraction based on Word Similarity 


Vol. 44,  No. 11, pp. 1138-1148, Nov.  2017
10.5626/JOK.2017.44.11.1138


PDF

  Abstract

Topic extraction is a technology that automatically extracts a set of topics from a set of documents, and this has been a major research topic in the area of natural language processing. Representative topic extraction methods include Latent Dirichlet Allocation (LDA) and word clustering-based methods. However, there are problems with these methods, such as repeated topics and mixed topics. The problem of repeated topics is one in which a specific topic is extracted as several topics, while the problem of mixed topic is one in which several topics are mixed in a single extracted topic. To solve these problems, this study proposes a method to extract topics using an LDA that is robust against the problem of repeated topic, going through the steps of separating and merging the topics using the similarity between words to correct the extracted topics. As a result of the experiment, the proposed method showed better performance than the conventional LDA method.


  Statistics
Cumulative Counts from November, 2022
Multiple requests among the same browser session are counted as one view. If you mouse over a chart, the values of data points will be shown.


  Cite this article

[IEEE Style]

D. Jin and S. Lee, "News Topic Extraction based on Word Similarity," Journal of KIISE, JOK, vol. 44, no. 11, pp. 1138-1148, 2017. DOI: 10.5626/JOK.2017.44.11.1138.


[ACM Style]

Dongxu Jin and Soowon Lee. 2017. News Topic Extraction based on Word Similarity. Journal of KIISE, JOK, 44, 11, (2017), 1138-1148. DOI: 10.5626/JOK.2017.44.11.1138.


[KCI Style]

김동욱, 이수원, "단어 유사도를 이용한 뉴스 토픽 추출," 한국정보과학회 논문지, 제44권, 제11호, 1138~1148쪽, 2017. DOI: 10.5626/JOK.2017.44.11.1138.


[Endnote/Zotero/Mendeley (RIS)]  Download


[BibTeX]  Download



Search




Journal of KIISE

  • ISSN : 2383-630X(Print)
  • ISSN : 2383-6296(Electronic)
  • KCI Accredited Journal

Editorial Office

  • Tel. +82-2-588-9240
  • Fax. +82-2-521-1352
  • E-mail. chwoo@kiise.or.kr