Prompt Engineering for Korean OCR Error Correction and Text Damage Restoration 


Vol. 52,  No. 11, pp. 940-947, Nov.  2025
10.5626/JOK.2025.52.11.940


PDF

  Abstract

Optical Character Recognition (OCR) is a technology that converts text within images into machine-readable formats, making it essential in industries where document management is critical. However, the Korean language has a complex structure, featuring combined consonants and vowels, which can lead to low recognition accuracy. Improving this situation requires a vast dataset that encompasses all 11,172 complete Korean characters. Additionally, errors such as spacing and spelling mistakes, along with text distortion and damage, complicate post-processing with conventional spell-check models. To tackle these challenges, this paper proposes the use of a Large Language Model combined with Few-shot Learning and Prompt Engineering. Experimental results indicate that error correction accuracy improved by up to 18.18% compared to basic prompts, while text restoration and spacing correction achieved performance improvements of 21.6% and 17.26%, respectively. These findings demonstrate that even with a limited number of examples, Korean OCR errors can be effectively corrected, and damaged text can be restored.


  Statistics
Cumulative Counts from November, 2022
Multiple requests among the same browser session are counted as one view. If you mouse over a chart, the values of data points will be shown.


  Cite this article

[IEEE Style]

S. Park, H. Lee, S. Choi, "Prompt Engineering for Korean OCR Error Correction and Text Damage Restoration," Journal of KIISE, JOK, vol. 52, no. 11, pp. 940-947, 2025. DOI: 10.5626/JOK.2025.52.11.940.


[ACM Style]

Suhyun Park, Hyojin Lee, and Sung-Pil Choi. 2025. Prompt Engineering for Korean OCR Error Correction and Text Damage Restoration. Journal of KIISE, JOK, 52, 11, (2025), 940-947. DOI: 10.5626/JOK.2025.52.11.940.


[KCI Style]

박수현, 이효진, 최성필, "프롬프트 엔지니어링 기반 한글 OCR 오류 교정 및 텍스트 손상 복원," 한국정보과학회 논문지, 제52권, 제11호, 940~947쪽, 2025. DOI: 10.5626/JOK.2025.52.11.940.


[Endnote/Zotero/Mendeley (RIS)]  Download


[BibTeX]  Download



Search




Journal of KIISE

  • ISSN : 2383-630X(Print)
  • ISSN : 2383-6296(Electronic)
  • KCI Accredited Journal

Editorial Office

  • Tel. +82-2-588-9240
  • Fax. +82-2-521-1352
  • E-mail. chwoo@kiise.or.kr