A Study on Improving the Accuracy of Korean Speech Recognition Texts Using KcBERT 


Vol. 51,  No. 12, pp. 1115-1124, Dec.  2024
10.5626/JOK.2024.51.12.1115


PDF

  Abstract

In the field of speech recognition, models such as Whisper, Wav2Vec2.0, and Google STT are widely utilized. However, Korean speech recognition faces challenges because complex phonological rules and diverse pronunciation variations hinder performance improvements. To address these issues, this study proposed a method that combined the Whisper model with a post-processing approach using KcBERT. By applying KcBERT’s bidirectional contextual learning to text generated by the Whisper model, the proposed method could enhance contextual coherence and refine the text for greater naturalness. Experimental results showed that post-processing reduced the Character Error Rate (CER) from 5.12% to 1.88% in clean environments and from 22.65% to 10.17% in noisy environments. Furthermore, the Word Error Rate (WER) was significantly improved, decreasing from 13.29% to 2.71% in clean settings and from 38.98% to 11.15% in noisy settings. BERTScore also exhibited overall improvement. These results demonstrate that the proposed approach is effective in addressing complex phonological rules and maintaining text coherence within Korean speech recognition.


  Statistics
Cumulative Counts from November, 2022
Multiple requests among the same browser session are counted as one view. If you mouse over a chart, the values of data points will be shown.


  Cite this article

[IEEE Style]

D. Min, S. Nam, D. Choi, "A Study on Improving the Accuracy of Korean Speech Recognition Texts Using KcBERT," Journal of KIISE, JOK, vol. 51, no. 12, pp. 1115-1124, 2024. DOI: 10.5626/JOK.2024.51.12.1115.


[ACM Style]

Donguk Min, Seungsoo Nam, and Daeseon Choi. 2024. A Study on Improving the Accuracy of Korean Speech Recognition Texts Using KcBERT. Journal of KIISE, JOK, 51, 12, (2024), 1115-1124. DOI: 10.5626/JOK.2024.51.12.1115.


[KCI Style]

민동욱, 남승수, 최대선, "KcBERT를 활용한 한국어 음성인식 텍스트 정확도 향상 연구," 한국정보과학회 논문지, 제51권, 제12호, 1115~1124쪽, 2024. DOI: 10.5626/JOK.2024.51.12.1115.


[Endnote/Zotero/Mendeley (RIS)]  Download


[BibTeX]  Download



Search




Journal of KIISE

  • ISSN : 2383-630X(Print)
  • ISSN : 2383-6296(Electronic)
  • KCI Accredited Journal

Editorial Office

  • Tel. +82-2-588-9240
  • Fax. +82-2-521-1352
  • E-mail. chwoo@kiise.or.kr