Search : [ keyword: 머신러닝 ] (11)

Pretrained Large Language Model-based Drug-Target Binding Affinity Prediction for Mutated Proteins

Taeung Song, Jin Hyuk Kim, Hyeon Jun Park, Jonghwan Choi

http://doi.org/10.5626/JOK.2025.52.6.539

Drug development is a costly and time-consuming process. Accurately predicting the impact of protein mutations on drug-target binding affinity remains a major challenge. Previous studies have utilized long short-term memory (LSTM) and transformer models for amino acid sequence processing. However, LSTMs suffer from long-sequence dependency issues, while transformers face high computational costs. In contrast, pretrained large language models (pLLMs) excel in handling long sequences, yet prompt-based approaches alone are insufficient for accurate binding affinity prediction. This study proposed a method that could leverage pLLMs to analyze protein structural data, transform it into embedding vectors, and use a separate machine learning model for numerical binding affinity prediction. Experimental results demonstrated that the proposed approach outperformed conventional LSTM and prompt-based methods, achieving lower root mean square error (RMSE) and higher Pearson correlation coefficient (PCC), particularly in mutation-specific predictions. Additionally, performance analysis of pLLM quantization confirmed that the method maintained sufficient accuracy with reduced computational cost.

UnityPGTA: A Unity Platformer Game Testing Automation Tool Using Reinforcement Learning

Se-chan Park, Deock Yeop Kim, Woo Jin Lee

http://doi.org/10.5626/JOK.2024.51.2.149

The cost of game testing in the video game industry is significant, accounting for nearly half of the expenses. Research efforts are underway to automate testing processes to reduce testing costs. However, existing research on test automation often involves manual tasks such as script writing, which is costly and labor-intensive. Additionally, implementations using virtual environments like VGDL and GVG-AI pose challenges when applied to real game testing. In this paper, we propose a tool for automating game testing with the aim of system fault detection, focusing on a Unity platformer game. The proposed tool is based on a commercial game engine, autonomously analyzing the game without human intervention to establish an automated game testing environment. We compare and analyze the error detection results of the proposed tool with a random baseline model using open-source games, demonstrating the tool"s effectiveness in performing automated game analysis and testing environment setup, ultimately reducing testing costs and improving quality and stability.

Prediction of Toothbrushing Position Based on Gyro Sensor Data and its Validation Using Unsupervised Learning-based Clustering

DoYoon Kim, MinWook Kwon, SeungJu Baek, HyeRin Yoon, DaeYeon Lim, Eunah Jo, Seungjae Ryu, Young Wook Kim, Jin Hyun Kim

http://doi.org/10.5626/JOK.2023.50.12.1143

Oral health is an important health indicator that is directly related to longevity. For this reason, oral health has become a key component of public health, from infants to the elderly. The foundation of good oral health is good brushing habits. However, the recommended correct brushing method is not easy to adopt, and this harms oral health. This paper proposes a method to distinguish brushing zones using low-cost IMU sensors to track the correct brushing method. We evaluated the accuracy of the brushing zone estimation method using clustering algorithms in machine learning. In this paper, we propose a method for determining the brushing area based on toothbrush posture alone using the gyro sensor of an IMU sensor. In this paper, we propose a method for determining the brushing area using only the gyro sensor of an IMU sensor based on toothbrush posture. We showed that relatively inexpensive 6-axis IMU gyro sensor data could be used to estimate the user’s brushing area with an accuracy of 80.6%. In addition, we applied a clustering algorithm to these data and trained a logistic regression model using the clustered data to estimate the brushing area. The result was obtained with an accuracy of 86.7%, showing that clustering was effective and that the toothbrush posture-based brushing area estimation proposed in this paper was effective. In conclusion, it is expected that the brushing zone estimation algorithm can be implemented as a function of a relatively low-cost toothbrush and that it can help to maintain oral health by analyzing and improving personal brushing habits.

Predicting Significant Blood Marker Values for Pressure Ulcer Forecasting Utilizing Feature Minimization and Selection

Yeonhee Kim, Hoyoul Jung, Jang-Hwan Choi

http://doi.org/10.5626/JOK.2023.50.12.1054

Pressure ulcers are difficult to treat once they occur, and huge economic costs are incurred during the treatment process. Therefore, predicting the occurrence of pressure ulcers is important in terms of patient suffering and economics. In this study, the correlation between the lab codes (features) and pressure ulcers obtained from blood tests of patients with spinal cord injury was analyzed to provide meaningful characteristic information for the prediction of pressure ulcers. We compare and analyze the correlation coefficients of Pearson, Spearman, and Kendall"s tau, which are mainly used in feature selection methods. In addition, the importance of features is calculated using XGBoost and LightGBM, which are machine learning methods based on gradient boosting. In order to verify the performance of this model, we use the long short-term memory (LSTM) model to predict other features using the features occupying the top-5 in importance. In this way, unnecessary features can be minimized in diagnosing pressure ulcers and guidelines can be provided to medical personnel.

A Visual Analytics System for Interpretable Machine Learning

Chanhee Park, Kyungwon Lee

http://doi.org/10.5626/JOK.2023.50.1.57

Interpretable machine learning is a technology that assists people understand the behavior and prediction of machine learning systems. This study proposes a visual analytics system that can interpret the relationship between how machine learning models relate output results from input data. It supports users to interpret machine learning models easily and clearly. The visual analytics system proposed in this study takes an approach to effectively interpret the machine learning model through an iterative adjustment procedure that filters and groups model decision results according to input variables, target variables, and predicted/classified values. Through use case analysis and in-depth user interviews, we confirmed that our system could provide insights into the complex behavior of machine learning models, gain scientific understanding of input variables, target variables, and model predictions, and help users understand the stability and reliability of models.

OANet: Ortho-Attention Net Based on Attention Mechanism for Database Performance Prediction

Chanho Yeom, Jieun Lee, Sanghyun Park

http://doi.org/10.5626/JOK.2022.49.11.1026

Various parameters in a database can be modified, which are called knobs. Since the performance of the database varies according to the settings of the knobs, it is important to tune the knobs of the database. And when tuning, a model that can reliably and quickly predict database performance according to the knob setting is needed. However, even when the knob setting is the same, the results may be different if the workload performing the benchmark is different. Therefore, in this paper, we propose an OANet using the attention mechanism so that the relationship between the knob and the workload can also be considered. Through experiments, the performance prediction results of the database were compared to various machine learning techniques, and the superiority of the model was confirmed by showing the highest score.

Recommendation Technique for Bug Fixers by Fine-tuning Language Models

Dae-Sung Wang, Hoon Seong, Chan-Gun Lee

http://doi.org/10.5626/JOK.2022.49.11.987

The scale and complexity of software continue to increase; hence they contribute to the occurrence of diverse bugs. Therefore, the necessity of systematic bug management has been raised. A few studies have proposed automating the assignment of bug fixers using word-based deep learning models. However, their accuracy is not satisfactory due to context of the word is ignored, and there is an excessive number of classes. In this paper, the accuracy was improved by about 27%p over the top-10 accuracies by using a fine-tuned pre-trained language model based on BERT, RoBERTa, DeBERTa, and CodeBERT. Experiments confirmed that the accuracy was about 70%. Through this, we showed that the fine-tuned pretrained language model could be effectively applied to automated bug-fixer assignments.

VNF Anomaly Detection Method based on Unsupervised Machine Learning

Seondong Heo, Seunghoon Jeong, Hosang Yun

http://doi.org/10.5626/JOK.2022.49.9.780

By applying virtualization technology to telecommunication networks, it is possible to reduce hardware dependencies and provide flexible control and management to the operators. In addition, since Capital Expenditure (CAPEX) and Operating Expenditure (OPEX) can be reduced by utilizing the technology, modern telco operators and service providers are using Software-Defined Networking(SDN) and Network Function Virtualization (NFV) technology to provide services more efficiently. As SDN and NFV are widely used, cyber attacks on Vitualized Network Functions (VNF) that degrade the quality of service or cause service denial are increasing. In this study, we propose a VNF anomaly detection method based on unsupervised machine learning techniques that models the steady states of VNFs and detects abnormal states caused by cyber attacks.

Drunk Driving Detection System Using Wearable Devices

Seunghwa Lee, Joon Yoo

http://doi.org/10.5626/JOK.2022.49.5.397

Drunk driving may cause traffic accidents that result in human casualties. Even though most people are well aware of the danger, many traffic accidents still occur due to poor judgement caused by drinking. In this paper, we propose a drunk driving detection system using wearable devices. First, we use a smart watch, a wearable device, to collect data using only general-purpose sensors and sends the data via a smart phone to a server that performs machine learning to determine if the user is drinking. Then, the driver detection algorithm, which uses in-car beacons, sends a warning to the user to prevent drunk driving. We implemented the system on a smart watch, smart phone, and server, and also developed a practical user mobile app. The results showed that the accuracy of drinking detection and driver detection algorithms were around 92% and 99%, respectively.

Design and Implementation of a Concurrency Error Detection Method for Embedded Software Using Machine Learning

Dongeon Lee, Jiwon Kim, Junghun Jin, Kyutae Cho

http://doi.org/10.5626/JOK.2022.49.5.327

Unlike general-purpose software, embedded software is designed by optimizing hardware for a specific purpose, so it is important to satisfy the target performance in a limited environment. Embedded software is increasing significantly in scale and complexity compared to the past. As the scale and complexity increase, the types of errors that occur in the software also diversify. Among them, there are many issues regarding concurrency errors that may occur between complex software modules. To detect concurrency errors in such embedded software, we have previously relied on manual input from the user. However, in this study, we propose a machine learning-based concurrency error detection tool (MCED) using SVM and deep learning.


Search




Journal of KIISE

  • ISSN : 2383-630X(Print)
  • ISSN : 2383-6296(Electronic)
  • KCI Accredited Journal

Editorial Office

  • Tel. +82-2-588-9240
  • Fax. +82-2-521-1352
  • E-mail. chwoo@kiise.or.kr