1. What contributions have the authors mentioned in the paper "Deepmmse: a deep learning approach to mmse-based noise power spectral density estimation author" ?
Motivated by this, the authors propose an MMSE-based noise PSD tracker that employs a temporal convolutional network ( TCN ) a priori SNR estimator.. Furthermore, when employed in a speech enhancement framework, the proposed DeepMMSE method is able to outperform state-of-the-art noise PSD trackers, as well as multiple deep learning approaches to speech enhancement.
read more
2. What are the future works mentioned in the paper "Deepmmse: a deep learning approach to mmse-based noise power spectral density estimation author" ?
This may be investigated in future work to obtain a further improvement in performance.. Further improvements in performance may be obtained by using a deep learning approach to estimate the a posteriori SNR directly.
read more
3. How many gamma priors are used to enhance the noisy speech magnitude spectrum?
3) γ̂ and ξ̂ are then used by the MMSE clean speech spectrum estimator with generalised Gamma priors from [2] to enhance the noisy speech magnitude spectrum.
read more
4. how many epochs are used to train the tnn?
A total of 175 epochs is used to train the TCN, where thenumber of training examples in an epoch is equal to the number of clean speech files in the training set (70 537).
read more





![TABLE IV STOI SCORES (IN %) OF THE ENHANCED SPEECH PRODUCED BY EACH OF THE NOISE PSD TRACKERS, AS WELL AS LSTM-IRM [57] AND XU2017 [58]. THE HIGHEST STOI SCORE FOR EACH TESTED CONDITION IS INDICATED IN BOLDFACE.](/figures/table-iv-stoi-scores-in-of-the-enhanced-speech-produced-by-1nmjufj1.png)