CNN-based Speech Emotion Recognition Model Applying Transfer Learning and Attention Mechanism 


Vol. 47,  No. 7, pp. 665-673, Jul.  2020
10.5626/JOK.2020.47.7.665


PDF

  Abstract

Existing speech-based emotion recognition studies can be classified into the case of using a voice feature value and a variety of voice feature values. In the case of using a voice feature value, there is a problem that it is difficult to reflect the complex factors of the voice such as loudness, overtone structure, and range of voices. In the case of using various voice feature values, studies based on machine learning comprise a large number, and there is a disadvantage in that emotion recognition accuracy is relatively lower than that of deep learning-based studies. To resolve this problem, we propose a speech emotion recognition model based on a CNN(Convolutional Neural Network) using Mel-Spectrogram and Mel Frequency Cepstral Coefficient (MFCC) as voice feature values. The proposed model applied transfer learning and attention to improve learning speed and accuracy, and achieved 77.65% emotion recognition accuracy, showing higher performance than the comparison works.


  Statistics
Cumulative Counts from November, 2022
Multiple requests among the same browser session are counted as one view. If you mouse over a chart, the values of data points will be shown.


  Cite this article

[IEEE Style]

J. H. Lee, U. N. Yoon, G. Jo, "CNN-based Speech Emotion Recognition Model Applying Transfer Learning and Attention Mechanism," Journal of KIISE, JOK, vol. 47, no. 7, pp. 665-673, 2020. DOI: 10.5626/JOK.2020.47.7.665.


[ACM Style]

Jung Hyun Lee, Ui Nyoung Yoon, and Geun-Sik Jo. 2020. CNN-based Speech Emotion Recognition Model Applying Transfer Learning and Attention Mechanism. Journal of KIISE, JOK, 47, 7, (2020), 665-673. DOI: 10.5626/JOK.2020.47.7.665.


[KCI Style]

이정현, 윤의녕, 조근식, "전이 학습과 어텐션(Attention)을 적용한 합성곱 신경망 기반의 음성 감정 인식 모델," 한국정보과학회 논문지, 제47권, 제7호, 665~673쪽, 2020. DOI: 10.5626/JOK.2020.47.7.665.


[Endnote/Zotero/Mendeley (RIS)]  Download


[BibTeX]  Download



Search




Journal of KIISE

  • ISSN : 2383-630X(Print)
  • ISSN : 2383-6296(Electronic)
  • KCI Accredited Journal

Editorial Office

  • Tel. +82-2-588-9240
  • Fax. +82-2-521-1352
  • E-mail. chwoo@kiise.or.kr