TY - JOUR T1 - Anomaly Detection in Mammography Data using Dimensionality Reduction and DBSCAN Clustering for Enhancing Diagnostic Model Performance AU - Kim, Donghee AU - Kim, Jaeil JO - Journal of KIISE, JOK PY - 2025 DA - 2025/1/14 DO - 10.5626/JOK.2025.52.9.787 KW - mammography data KW - anomaly detection KW - autoencoder KW - dimensionality reduction KW - DBSCAN clustering AB - This study introduces a data cleaning technique for identifying and removing anomalous images from mammography data. An autoencoder extracts low-dimensional latent features, which are then refined through dimensionality reduction (methods such as PCA, t-SNE, and Isomap) to highlight irregular patterns. DBSCAN clustering is subsequently employed to detect anomalies. An ablation study confirmed that dimensionality reduction enhances anomaly detection, and the impact of anomaly removal on model training was assessed. Results indicate that the combination of t-SNE and DBSCAN yields superior performance, with the refined model demonstrating significant improvements in accuracy and sensitivity. These findings enhance the reliability of AI-based breast cancer diagnosis and present a promising pre-processing method for medical imaging.