Inverse Document Frequency-Based Word Embedding of Unseen Words for Question Answering Systems 


Vol. 43,  No. 8, pp. 902-909, Aug.  2016


PDF

  Abstract

Question answering system (QA system) is a system that finds an actual answer to the question posed by a user, whereas a typical search engine would only find the links to the relevant documents. Recent works related to the open domain QA systems are receiving much attention in the fields of natural language processing, artificial intelligence, and data mining. However, the prior works on QA systems simply replace all words that are not in the training data with a single token, even though such unseen words are likely to play crucial roles in differentiating the candidate answers from the actual answers. In this paper, we propose a method to compute vectors of such unseen words by taking into account the context in which the words have occurred. Next, we also propose a model which utilizes inverse document frequencies (IDF) to efficiently process unseen words by expanding the system’s vocabulary. Finally, we validate that the proposed method and model improve the performance of a QA system through experiments.


  Statistics
Cumulative Counts from November, 2022
Multiple requests among the same browser session are counted as one view. If you mouse over a chart, the values of data points will be shown.


  Cite this article

[IEEE Style]

W. Lee, G. Song, K. Shim, "Inverse Document Frequency-Based Word Embedding of Unseen Words for Question Answering Systems," Journal of KIISE, JOK, vol. 43, no. 8, pp. 902-909, 2016. DOI: .


[ACM Style]

Wooin Lee, Gwangho Song, and Kyuseok Shim. 2016. Inverse Document Frequency-Based Word Embedding of Unseen Words for Question Answering Systems. Journal of KIISE, JOK, 43, 8, (2016), 902-909. DOI: .


[KCI Style]

이우인, 송광호, 심규석, "질의응답 시스템에서 처음 보는 단어의 역문헌빈도 기반 단어 임베딩 기법," 한국정보과학회 논문지, 제43권, 제8호, 902~909쪽, 2016. DOI: .


[Endnote/Zotero/Mendeley (RIS)]  Download


[BibTeX]  Download



Search




Journal of KIISE

  • ISSN : 2383-630X(Print)
  • ISSN : 2383-6296(Electronic)
  • KCI Accredited Journal

Editorial Office

  • Tel. +82-2-588-9240
  • Fax. +82-2-521-1352
  • E-mail. chwoo@kiise.or.kr