Deep Neural Networks and End-to-End Learning for Audio Compression 


Vol. 48,  No. 8, pp. 940-946, Aug.  2021
10.5626/JOK.2021.48.8.940


PDF

  Abstract

Recent advances in end-to-end deep learning have encouraged the exploration of tasks dealing with highly structured data using unified deep network models. The fabrication and design of such models for compressing audio signals has been a challenge due to the need for discrete representations that are not easy to train with end-to-end backpropagation. In this paper, we present an end-to-end deep learning approach that combines recurrent neural networks (RNNs) within the training strategy of variational autoencoders (VAEs) with a binary representation of the latent space. We apply a reparametrization trick for the Bernoulli distribution for the discrete representations, which allows smooth backpropagation. In addition, our approach enables the separation of the encoder and decoder, which is necessary for compression tasks. To the best of our knowledge, this is the first end-to-end learning for a single audio compression model with RNNs, and our model achieves a Signal to Distortion Ratio (SDR) of 20.53dB.


  Statistics
Cumulative Counts from November, 2022
Multiple requests among the same browser session are counted as one view. If you mouse over a chart, the values of data points will be shown.


  Cite this article

[IEEE Style]

D. N. Rim, I. Jang, H. Choi, "Deep Neural Networks and End-to-End Learning for Audio Compression," Journal of KIISE, JOK, vol. 48, no. 8, pp. 940-946, 2021. DOI: 10.5626/JOK.2021.48.8.940.


[ACM Style]

Daniela N. Rim, Inseon Jang, and Heeyoul Choi. 2021. Deep Neural Networks and End-to-End Learning for Audio Compression. Journal of KIISE, JOK, 48, 8, (2021), 940-946. DOI: 10.5626/JOK.2021.48.8.940.


[KCI Style]

Daniela N. Rim, Inseon Jang, Heeyoul Choi, "Deep Neural Networks and End-to-End Learning for Audio Compression," 한국정보과학회 논문지, 제48권, 제8호, 940~946쪽, 2021. DOI: 10.5626/JOK.2021.48.8.940.


[Endnote/Zotero/Mendeley (RIS)]  Download


[BibTeX]  Download



Search




Journal of KIISE

  • ISSN : 2383-630X(Print)
  • ISSN : 2383-6296(Electronic)
  • KCI Accredited Journal

Editorial Office

  • Tel. +82-2-588-9240
  • Fax. +82-2-521-1352
  • E-mail. chwoo@kiise.or.kr