Journal Article10.1109/TASL.2007.911054
Evaluation of Objective Quality Measures for Speech Enhancement
Yi Hu,Philipos C. Loizou +1 more
1.9K
TL;DR: The evaluation of correlations of several objective measures with these three subjective rating scales is reported on and several new composite objective measures are also proposed by combining the individual objective measures using nonparametric and parametric regression analysis techniques.
read more
Abstract: In this paper, we evaluate the performance of several objective measures in terms of predicting the quality of noisy speech enhanced by noise suppression algorithms. The objective measures considered a wide range of distortions introduced by four types of real-world noise at two signal-to-noise ratio levels by four classes of speech enhancement algorithms: spectral subtractive, subspace, statistical-model based, and Wiener algorithms. The subjective quality ratings were obtained using the ITU-T P.835 methodology designed to evaluate the quality of enhanced speech along three dimensions: signal distortion, noise distortion, and overall quality. This paper reports on the evaluation of correlations of several objective measures with these three subjective rating scales. Several new composite objective measures are also proposed by combining the individual objective measures using nonparametric and parametric regression analysis techniques.
read more
Chat with Paper
AI Agents for this Paper
Find similar papers on Google Scholar, PubMed and Arxiv
Write a critical review of this paper
Analyze citations of this paper to find unaddressed research gaps
Citations
Correction to "Maximum Likelihood PSD Estimation for Speech Enhancement in Reverberation and Noise" [Sep 16 1599-1612]
TL;DR: A novel ML PSD estimation scheme is derived that is suitable for sound scenes which besides speech and reverberation consists of an additional noise component whose second-order statistics are known.
44
Sixty Years of Frequency-Domain Monaural Speech Enhancement: From Traditional to Deep Learning Methods.
Chengshi Zheng,Huiyong Zhang,Wenzhe Liu,XiaoXue Luo,Andong Li,Xiaodong Li,Brian C J Moore +6 more
TL;DR: A comprehensive evaluation of some typical monaural speech enhancement methods using the WSJ + Deep Noise Suppression challenge and Voice Bank + DEMAND datasets to give an intuitive and unified comparison and showed that compression of the input features was important for simulated normal-hearing listeners but not for simulated hearing-impaired listeners.
44
Robustness of speech quality metrics to background noise and network degradations: Comparing ViSQOL, PESQ and POLQA
Andrew Hines,Jan Skoglund,Anil Kokaram,Naomi Harte +3 more
- 26 May 2013
TL;DR: The Virtual Speech Quality Objective Listener is described and tests the model using two speech corpora: NOIZEUS and E4 and shows that for both datasets ViSQOL performed comparably with PESQ and POLQA.
New orthogonal polynomials for speech signal and image processing
TL;DR: This study attempts to demonstrate that the proposed polynomials can be applied in the field of signal and image processing because of the promising properties of this polynomial especially in its localisation and energy compaction capabilities.
43
An Evaluation of Objective Quality Measures for Speech Intelligibility Prediction
Cees H. Taal,Richard C. Hendriks,Richard Heusdens,Jesper Jensen,Ulrik Kjems +4 more
- 06 Sep 2009
TL;DR: It is shown that cSII does not necessarily show better performance compared to conventional objective (speech)-quality measures, and the DAU-model is the only method with reasonable results for all processing conditions.
References
Multivariate Adaptive Regression Splines
TL;DR: In this article, a new method is presented for flexible regression modeling of high dimensional data, which takes the form of an expansion in product spline basis functions, where the number of basis functions as well as the parameters associated with each one (product degree and knot locations) are automatically determined by the data.
Perceptual evaluation of speech quality (PESQ)-a new method for speech quality assessment of telephone networks and codecs
Antony William Rix,John G. Beerends,Michael Peter Hollier,Andries Pieter Hekstra +3 more
- 07 May 2001
TL;DR: A new model has been developed for use across a wider range of network conditions, including analogue connections, codecs, packet loss and variable delay, known as perceptual evaluation of speech quality (PESQ).
2.8K
•Book
Speech Enhancement: Theory and Practice
Philipos C. Loizou
- 07 Jun 2007
TL;DR: Clear and concise, this book explores how human listeners compensate for acoustic noise in noisy environments and suggests steps that can be taken to realize the full potential of these algorithms under realistic conditions.
2.5K
Evaluation of Objective Quality Measures for Speech Enhancement
Yi Hu,Philipos C. Loizou +1 more
TL;DR: The evaluation of correlations of several objective measures with these three subjective rating scales is reported on and several new composite objective measures are also proposed by combining the individual objective measures using nonparametric and parametric regression analysis techniques.
1.9K