TY - JOUR T1 - Mixed Sound Source Localization via Audio-Visual Information Fusion AU - Lee, YuEun AU - Um, Sung Jin AU - Kim, Jung Uk JO - Journal of KIISE, JOK PY - 2025 DA - 2025/1/14 DO - 10.5626/JOK.2025.52.9.762 KW - sound source localization KW - audio-visual fusion KW - iterative method KW - prior knowledge AB - Multi-source localization is a research topic that uses audio mixed with multiple sources within a visual scene to identify the locations of individual sound sources. Existing studies have limitations in that they primarily use auditory information to assist the spatial domain of visual information, and they require prior knowledge information integration module that fuses audio-visual information, allowing auditory information to be utilized alongside spatial cues and visual information. Additionally, we introduce an object repetition detection module designed to identify objects that produce sounds repeatedly, enabling effective localization and separation of multiple sound sources without needing prior knowledge of the number of objects. The proposed method address the limitations of existing studies and enhances sound source localization capabilities. We also conducted experiments on the VGGSound dataset and achieved better performance than existing approaches.