Calibration of Pre-trained Language Model for the Korean Language 


Vol. 48,  No. 4, pp. 434-443, Apr.  2021
10.5626/JOK.2021.48.4.434


PDF

  Abstract

The development of deep learning models has continuously demonstrated performance beyond humans reach in various tasks such as computer vision and natural language understanding tasks. In particular, pre-trained Transformer models have recently shown remarkable performance in natural language understanding problems such as question answering (QA) tasks and dialogue tasks. However, despite the rapid development of deep learning models such as Transformer-based models, the underlying mechanisms of action remain relatively unknown. As a method of analyzing deep learning models, calibration of models measures the extent of matching of the predicted value of the model (confidence) with the actual value (accuracy). Our study aims at interpreting pre-trained Korean language models based on calibration. In particular, we have analyzed whether pre-trained Korean language models can capture ambiguities in sentences and applied the smoothing methods to quantitatively measure such ambiguities with confidence. In addition, in terms of calibration, we have evaluated the capability of pre-trained Korean language models in identifying grammatical characteristics in the Korean language, which affect semantic changes in the Korean sentences.


  Statistics
Cumulative Counts from November, 2022
Multiple requests among the same browser session are counted as one view. If you mouse over a chart, the values of data points will be shown.


  Cite this article

[IEEE Style]

S. Jeong, W. Yang, C. Park, J. C. Park, "Calibration of Pre-trained Language Model for the Korean Language," Journal of KIISE, JOK, vol. 48, no. 4, pp. 434-443, 2021. DOI: 10.5626/JOK.2021.48.4.434.


[ACM Style]

Soyeong Jeong, Wonsuk Yang, ChaeHun Park, and Jong C. Park. 2021. Calibration of Pre-trained Language Model for the Korean Language. Journal of KIISE, JOK, 48, 4, (2021), 434-443. DOI: 10.5626/JOK.2021.48.4.434.


[KCI Style]

정소영, 양원석, 박채훈, 박종철, "사전 학습된 한국어 언어 모델의 보정," 한국정보과학회 논문지, 제48권, 제4호, 434~443쪽, 2021. DOI: 10.5626/JOK.2021.48.4.434.


[Endnote/Zotero/Mendeley (RIS)]  Download


[BibTeX]  Download



Search




Journal of KIISE

  • ISSN : 2383-630X(Print)
  • ISSN : 2383-6296(Electronic)
  • KCI Accredited Journal

Editorial Office

  • Tel. +82-2-588-9240
  • Fax. +82-2-521-1352
  • E-mail. chwoo@kiise.or.kr