Effective Generative Chatbot Model Trainable with a Small Dialogue Corpus 


Vol. 46,  No. 3, pp. 246-252, Mar.  2019
10.5626/JOK.2019.46.3.246


PDF

  Abstract

Contrary to popular retrieval-based chatbot models, generative chatbot models do not depend on predefined responses, but rather generate new responses based on well-trained neural networks. However, they require a large number of training corpus in the form of query-response pairs. If the training corpus are insufficient, they make grammatical errors emanating from out-of-vocabulary or sparse data problems, mostly in longer sentences. To overcome this challenge, we proposed a chatbot model based on sequence-to-sequence neural network using a mixture of words and syllables as encoding-decoding units. Moreover, we proposed a two-step training procedure involving pre-training using a large non-dialogue corpus and retraining using a smaller dialogue corpus. In the experiment involving small dialogue corpus (47,089 query-response pairs for training and 3,000 query-response pairs for evaluation), the proposed encoding-decoding units resulted to a reduction in out-of-vocabulary problem while the two-step training method led to improved performance measures like BLEU and ROUGE.


  Statistics
Cumulative Counts from November, 2022
Multiple requests among the same browser session are counted as one view. If you mouse over a chart, the values of data points will be shown.


  Cite this article

[IEEE Style]

J. Kim, H. Lee, H. Kim, "Effective Generative Chatbot Model Trainable with a Small Dialogue Corpus," Journal of KIISE, JOK, vol. 46, no. 3, pp. 246-252, 2019. DOI: 10.5626/JOK.2019.46.3.246.


[ACM Style]

Jintae Kim, Hyeon-gu Lee, and Harksoo Kim. 2019. Effective Generative Chatbot Model Trainable with a Small Dialogue Corpus. Journal of KIISE, JOK, 46, 3, (2019), 246-252. DOI: 10.5626/JOK.2019.46.3.246.


[KCI Style]

김진태, 이현구, 김학수, "소량의 대화 말뭉치에서 학습 가능한 효과적인 생성 기반 챗봇 모델," 한국정보과학회 논문지, 제46권, 제3호, 246~252쪽, 2019. DOI: 10.5626/JOK.2019.46.3.246.


[Endnote/Zotero/Mendeley (RIS)]  Download


[BibTeX]  Download



Search




Journal of KIISE

  • ISSN : 2383-630X(Print)
  • ISSN : 2383-6296(Electronic)
  • KCI Accredited Journal

Editorial Office

  • Tel. +82-2-588-9240
  • Fax. +82-2-521-1352
  • E-mail. chwoo@kiise.or.kr