Chia Dai
Carnegie Mellon University
5 Papers
145 Citations
Chia Dai is an academic researcher from Carnegie Mellon University. The author has contributed to research in topics: Dropout (neural networks) & Computer science. The author has an hindex of 2, co-authored 2 publications.
Chat about Author
Papers
Very deep convolutional neural networks for raw waveforms
Wei Dai,Chia Dai,Shuhui Qu,Juncheng Li,Samarjit Das +4 more
- 05 Mar 2017
TL;DR: Very deep convolutional neural networks (CNNs) were proposed in this article to directly use time-domain waveforms as inputs, which can optimize over very long sequences (e.g., vector of size 32000) necessary for processing acoustic waveforms.
•Posted Content
Very Deep Convolutional Neural Networks for Raw Waveforms
TL;DR: This work proposes very deep convolutional neural networks that directly use time-domain waveforms as inputs that are efficient to optimize over very long sequences, necessary for processing acoustic waveforms.
144
Leveraging the perceptual metric loss to improve the DEMUCS system in speech enhancement
Qinglin Hong,Chia Dai,Hui-Chun Hsu,Zong-Tai Wu,Jeih-weih Hung +4 more
- 22 Apr 2022
TL;DR: In this article , a modified loss function was proposed to improve the performance of DEMUCS by considering the perceptual metric scores, including perceptual speech quality (PESQ) and short-time objective intelligibility (STOI).
1
Proceedings Article
Exploiting the compressed spectral loss for the learning of the DEMUCS speech enhancement network
TL;DR: In this article , the authors proposed to improve a highly effective speech enhancement technique, DEMUCS, by revising the respective loss function in learning, which is done by either the power-law operation with a positive exponent less than one, or the logarithmic operation.
Improving the performance of CMGAN in speech enhancement with the phone fortified perceptual loss
Chia Dai,Jia-Xuan Zeng,Wan-Ling Zeng,Eric S. Li,Jeih-weih Hung +4 more
- 17 Jul 2023
TL;DR: The preliminary experiments carried out on the VoiceBank-DEMAND task suggest that the incorporation of the PFPL into the CMGAN learning process leads to a notable enhancement in the objective metrics (PESQ and STOI) scores for the test data.