Calibration of Pre-trained Language Model for the Korean Language

Soyeong Jeong; Wonsuk Yang; ChaeHun Park; Jong C. Park

Calibration of Pre-trained Language Model for the Korean Language

Vol. 48, No. 4, pp. 434-443, Apr. 2021

10.5626/JOK.2021.48.4.434

Language Model

Ambiguity

PDF

Abstract

The development of deep learning models has continuously demonstrated performance beyond humans reach in various tasks such as computer vision and natural language understanding tasks. In particular, pre-trained Transformer models have recently shown remarkable performance in natural language understanding problems such as question answering (QA) tasks and dialogue tasks. However, despite the rapid development of deep learning models such as Transformer-based models, the underlying mechanisms of action remain relatively unknown. As a method of analyzing deep learning models, calibration of models measures the extent of matching of the predicted value of the model (confidence) with the actual value (accuracy). Our study aims at interpreting pre-trained Korean language models based on calibration. In particular, we have analyzed whether pre-trained Korean language models can capture ambiguities in sentences and applied the smoothing methods to quantitatively measure such ambiguities with confidence. In addition, in terms of calibration, we have evaluated the capability of pre-trained Korean language models in identifying grammatical characteristics in the Korean language, which affect semantic changes in the Korean sentences.

Statistics

Cumulative Counts from November, 2022
Multiple requests among the same browser session are counted as one view. If you mouse over a chart, the values of data points will be shown.

Cite this article

[IEEE Style]

S. Jeong, W. Yang, C. Park, J. C. Park, "Calibration of Pre-trained Language Model for the Korean Language," Journal of KIISE, JOK, vol. 48, no. 4, pp. 434-443, 2021. DOI: 10.5626/JOK.2021.48.4.434.

[ACM Style]

Soyeong Jeong, Wonsuk Yang, ChaeHun Park, and Jong C. Park. 2021. Calibration of Pre-trained Language Model for the Korean Language. Journal of KIISE, JOK, 48, 4, (2021), 434-443. DOI: 10.5626/JOK.2021.48.4.434.

[KCI Style]

정소영, 양원석, 박채훈, 박종철, "사전 학습된 한국어 언어 모델의 보정," 한국정보과학회 논문지, 제48권, 제4호, 434~443쪽, 2021. DOI: 10.5626/JOK.2021.48.4.434.

[Endnote/Zotero/Mendeley (RIS)] Download

[BibTeX] Download

Search

Journal of KIISE

ISSN : 2383-630X(Print)
ISSN : 2383-6296(Electronic)
KCI Accredited Journal

Editorial Office

Tel. +82-2-588-9240
Fax. +82-2-521-1352
E-mail. chwoo@kiise.or.kr