TY - JOUR T1 - A Semi-automatic Construction method of a Named Entity Dictionary Based on Wikipedia AU - Song, Yeongkil AU - Jeong, Seokwon AU - Kim, Harksoo JO - Journal of KIISE, JOK PY - 2015 DA - 2015/1/14 DO - KW - named entity dictionary construction KW - Wikipedia KW - information retrieval method KW - active learning AB - A named entity(NE) dictionary is an important resource for the performance of NE recognition. However, it is not easy to construct a NE dictionary manually since human annotation is time consuming and labor-intensive. To save construction time and reduce human labor, we propose a semi-automatic system for the construction of a NE dictionary. The proposed system constructs a pseudo-document with Wiki-categories per NE class by using an active learning technique. Then, it calculates similarities between Wiki entries and pseudo-documents using the BM25 model, a well-known information retrieval model. Finally, it classifies each Wiki entry into NE classes based on similarities. In experiments with three different types of NE class sets, the proposed system showed high performance(macro-average F1-score of 0.9028 and micro-average F1-score 0.9554).