A Semi-automatic Construction method of a Named Entity Dictionary Based on Wikipedia 


Vol. 42,  No. 11, pp. 1397-1403, Nov.  2015


PDF

  Abstract

A named entity(NE) dictionary is an important resource for the performance of NE recognition. However, it is not easy to construct a NE dictionary manually since human annotation is time consuming and labor-intensive. To save construction time and reduce human labor, we propose a semi-automatic system for the construction of a NE dictionary. The proposed system constructs a pseudo-document with Wiki-categories per NE class by using an active learning technique. Then, it calculates similarities between Wiki entries and pseudo-documents using the BM25 model, a well-known information retrieval model. Finally, it classifies each Wiki entry into NE classes based on similarities. In experiments with three different types of NE class sets, the proposed system showed high performance(macro-average F1-score of 0.9028 and micro-average F1-score 0.9554).


  Statistics
Cumulative Counts from November, 2022
Multiple requests among the same browser session are counted as one view. If you mouse over a chart, the values of data points will be shown.


  Cite this article

[IEEE Style]

Y. Song, S. Jeong, H. Kim, "A Semi-automatic Construction method of a Named Entity Dictionary Based on Wikipedia," Journal of KIISE, JOK, vol. 42, no. 11, pp. 1397-1403, 2015. DOI: .


[ACM Style]

Yeongkil Song, Seokwon Jeong, and Harksoo Kim. 2015. A Semi-automatic Construction method of a Named Entity Dictionary Based on Wikipedia. Journal of KIISE, JOK, 42, 11, (2015), 1397-1403. DOI: .


[KCI Style]

송영길, 정석원, 김학수, "위키피디아 기반 개체명 사전 반자동 구축 방법," 한국정보과학회 논문지, 제42권, 제11호, 1397~1403쪽, 2015. DOI: .


[Endnote/Zotero/Mendeley (RIS)]  Download


[BibTeX]  Download



Search




Journal of KIISE

  • ISSN : 2383-630X(Print)
  • ISSN : 2383-6296(Electronic)
  • KCI Accredited Journal

Editorial Office

  • Tel. +82-2-588-9240
  • Fax. +82-2-521-1352
  • E-mail. chwoo@kiise.or.kr