Lowest Level Term Recommendation in MedDRA-Based Medical Coding Using Subword Tokenization 


Vol. 52,  No. 10, pp. 825-832, Oct.  2025
10.5626/JOK.2025.52.10.825


PDF

  Abstract

In clinical trials, the terms reported by investigators for subjects’ medical history and adverse events are called Reported Terms (RT). Since the same symptom can be expressed differently by each investigator, a medical coding process that maps RTs to the MedDRA standard Low-Level Terms (LLTs) is essential. While English-based systems exist, domestic studies face challenges due to the mixture of Korean and English. This study proposes a method to convert RTs, including Korean expressions, into LLTs. We constructed a training dataset of 4,398 RT–LLT pairs collected from domestic clinical trials and applied subword tokenization algorithms—SentencePiece and WordPiece—to handle noise such as multilingual inputs, spacing, typos, and punctuation. Using the token vocabulary generated during training, we segmented new RTs and implemented an algorithm that recommends top-k LLTs based on matching scores. Test results showed that the correct LLT was included within the top five candidate list on average.


  Statistics
Cumulative Counts from November, 2022
Multiple requests among the same browser session are counted as one view. If you mouse over a chart, the values of data points will be shown.


  Cite this article

[IEEE Style]

S. Park and I. Oh, "Lowest Level Term Recommendation in MedDRA-Based Medical Coding Using Subword Tokenization," Journal of KIISE, JOK, vol. 52, no. 10, pp. 825-832, 2025. DOI: 10.5626/JOK.2025.52.10.825.


[ACM Style]

Se-Hee Park and Il-Seok Oh. 2025. Lowest Level Term Recommendation in MedDRA-Based Medical Coding Using Subword Tokenization. Journal of KIISE, JOK, 52, 10, (2025), 825-832. DOI: 10.5626/JOK.2025.52.10.825.


[KCI Style]

박세희, 오일석, "MedDRA 기반 의료 코딩에서 서브워드 토큰화를 활용한 최하위 용어 추천," 한국정보과학회 논문지, 제52권, 제10호, 825~832쪽, 2025. DOI: 10.5626/JOK.2025.52.10.825.


[Endnote/Zotero/Mendeley (RIS)]  Download


[BibTeX]  Download



Search




Journal of KIISE

  • ISSN : 2383-630X(Print)
  • ISSN : 2383-6296(Electronic)
  • KCI Accredited Journal

Editorial Office

  • Tel. +82-2-588-9240
  • Fax. +82-2-521-1352
  • E-mail. chwoo@kiise.or.kr