Pretrained Large Language Model-based Drug-Target Binding Affinity Prediction for Mutated Proteins

Taeung Song; Jin Hyuk Kim; Hyeon Jun Park; Jonghwan Choi

Pretrained Large Language Model-based Drug-Target Binding Affinity Prediction for Mutated Proteins

Vol. 52, No. 6, pp. 539-547, Jun. 2025

10.5626/JOK.2025.52.6.539

Machine Learning

Large Language Model

Drug Discovery

binding affinity prediction

mutant protein

PDF

Abstract

Drug development is a costly and time-consuming process. Accurately predicting the impact of protein mutations on drug-target binding affinity remains a major challenge. Previous studies have utilized long short-term memory (LSTM) and transformer models for amino acid sequence processing. However, LSTMs suffer from long-sequence dependency issues, while transformers face high computational costs. In contrast, pretrained large language models (pLLMs) excel in handling long sequences, yet prompt-based approaches alone are insufficient for accurate binding affinity prediction. This study proposed a method that could leverage pLLMs to analyze protein structural data, transform it into embedding vectors, and use a separate machine learning model for numerical binding affinity prediction. Experimental results demonstrated that the proposed approach outperformed conventional LSTM and prompt-based methods, achieving lower root mean square error (RMSE) and higher Pearson correlation coefficient (PCC), particularly in mutation-specific predictions. Additionally, performance analysis of pLLM quantization confirmed that the method maintained sufficient accuracy with reduced computational cost.

Statistics

Cumulative Counts from November, 2022
Multiple requests among the same browser session are counted as one view. If you mouse over a chart, the values of data points will be shown.

Cite this article

[IEEE Style]

T. Song, J. H. Kim, H. J. Park, J. Choi, "Pretrained Large Language Model-based Drug-Target Binding Affinity Prediction for Mutated Proteins," Journal of KIISE, JOK, vol. 52, no. 6, pp. 539-547, 2025. DOI: 10.5626/JOK.2025.52.6.539.

[ACM Style]

Taeung Song, Jin Hyuk Kim, Hyeon Jun Park, and Jonghwan Choi. 2025. Pretrained Large Language Model-based Drug-Target Binding Affinity Prediction for Mutated Proteins. Journal of KIISE, JOK, 52, 6, (2025), 539-547. DOI: 10.5626/JOK.2025.52.6.539.

[KCI Style]

송태웅, 김진혁, 박현준, 최종환, "돌연변이 단백질에 대한 사전 학습 대규모 언어 모델 기반 약물-표적 결합 친화도 예측," 한국정보과학회 논문지, 제52권, 제6호, 539~547쪽, 2025. DOI: 10.5626/JOK.2025.52.6.539.

[Endnote/Zotero/Mendeley (RIS)] Download

[BibTeX] Download

Search

Journal of KIISE

ISSN : 2383-630X(Print)
ISSN : 2383-6296(Electronic)
KCI Accredited Journal

Editorial Office

Tel. +82-2-588-9240
Fax. +82-2-521-1352
E-mail. chwoo@kiise.or.kr