A Deep Learning based Speech Quality Enhancement Scheme Using Environmental Sound Classification and Location Information 


Vol. 50,  No. 4, pp. 344-350, Apr.  2023
10.5626/JOK.2023.50.4.344


PDF

  Abstract

In the field of speech processing, deep learning has made great advances by improving the precision of speech recognition. One of advances, voice improvement, is a technique that can improve voice recognition by separating voice and noise from input mixed with speaking voice and noise. This is used in AI-speakers and smartphones to facilitate human-to-human communication and enable clean voice data collection for robots and text-to-speech. However, conventional speech enhancement techniques that use only a single model are not effective in eliminating noise that occurs specifically in each environment. To effectively eliminate environmental specific noise, this paper proposes a deep learning model that combines acoustic scene classification techniques with location information utilization techniques to enable optimal environmental-specific speech enhancements. As a result of the experiment, it is confirmed that this technique shows high voice quality improvement with low computational cost in various environments compared to the existing technique.


  Statistics
Cumulative Counts from November, 2022
Multiple requests among the same browser session are counted as one view. If you mouse over a chart, the values of data points will be shown.


  Cite this article

[IEEE Style]

B. H. Kang and D. K. Noh, "A Deep Learning based Speech Quality Enhancement Scheme Using Environmental Sound Classification and Location Information," Journal of KIISE, JOK, vol. 50, no. 4, pp. 344-350, 2023. DOI: 10.5626/JOK.2023.50.4.344.


[ACM Style]

Byung Hee Kang and Dong Kun Noh. 2023. A Deep Learning based Speech Quality Enhancement Scheme Using Environmental Sound Classification and Location Information. Journal of KIISE, JOK, 50, 4, (2023), 344-350. DOI: 10.5626/JOK.2023.50.4.344.


[KCI Style]

강병휘, 노동건, "환경음 분류와 위치 정보를 이용한 딥러닝 기반 음성 품질 향상 기법," 한국정보과학회 논문지, 제50권, 제4호, 344~350쪽, 2023. DOI: 10.5626/JOK.2023.50.4.344.


[Endnote/Zotero/Mendeley (RIS)]  Download


[BibTeX]  Download



Search




Journal of KIISE

  • ISSN : 2383-630X(Print)
  • ISSN : 2383-6296(Electronic)
  • KCI Accredited Journal

Editorial Office

  • Tel. +82-2-588-9240
  • Fax. +82-2-521-1352
  • E-mail. chwoo@kiise.or.kr