An Automatic Method of Generating a Large-Scale Train Set for Bi-LSTM based Sentiment Analysis 


Vol. 46,  No. 8, pp. 800-813, Aug.  2019
10.5626/JOK.2019.46.8.800


PDF

  Abstract

Sentiment analysis using deep learning requires a large-scale train set labeled sentiment. However, direct labeling of sentiment by humans is time and cost-constrained, and it is not easy to collect the required data for sentiment analysis from many data. In the present work, to solve the existing problems, the existing sentiment lexicon was used to assign sentiment score, and when there was sentiment transformation element, the sentiment score was reset through dependency parsing and morphological analysis for automatic generation of large-scale train set labeled with the sentiment. The Top-k data with high sentiment score was extracted. Sentiment transformation elements include sentiment reversal, sentiment activation, and sentiment deactivation. Our experimental results reveal the generation of a large-scale train set in a shorter time than manual labeling and improvement in the performance of deep learning with an increase in the amount of train set. The accuracy of the model using only sentiment lexicon was 80.17% and the accuracy of the proposed model, which includes natural language processing technology was 89.17%. Overall, a 9% improvement was observed.


  Statistics
Cumulative Counts from November, 2022
Multiple requests among the same browser session are counted as one view. If you mouse over a chart, the values of data points will be shown.


  Cite this article

[IEEE Style]

M. Choi and B. On, "An Automatic Method of Generating a Large-Scale Train Set for Bi-LSTM based Sentiment Analysis," Journal of KIISE, JOK, vol. 46, no. 8, pp. 800-813, 2019. DOI: 10.5626/JOK.2019.46.8.800.


[ACM Style]

Min-Seong Choi and Byung-Won On. 2019. An Automatic Method of Generating a Large-Scale Train Set for Bi-LSTM based Sentiment Analysis. Journal of KIISE, JOK, 46, 8, (2019), 800-813. DOI: 10.5626/JOK.2019.46.8.800.


[KCI Style]

최민성, 온병원, "Bi-LSTM 기반 감성분석을 위한 대용량 학습데이터 자동 생성 방안," 한국정보과학회 논문지, 제46권, 제8호, 800~813쪽, 2019. DOI: 10.5626/JOK.2019.46.8.800.


[Endnote/Zotero/Mendeley (RIS)]  Download


[BibTeX]  Download



Search




Journal of KIISE

  • ISSN : 2383-630X(Print)
  • ISSN : 2383-6296(Electronic)
  • KCI Accredited Journal

Editorial Office

  • Tel. +82-2-588-9240
  • Fax. +82-2-521-1352
  • E-mail. chwoo@kiise.or.kr