Journal Article10.1109/TASL.2007.911054
Evaluation of Objective Quality Measures for Speech Enhancement
Yi Hu,Philipos C. Loizou +1 more
1.9K
TL;DR: The evaluation of correlations of several objective measures with these three subjective rating scales is reported on and several new composite objective measures are also proposed by combining the individual objective measures using nonparametric and parametric regression analysis techniques.
read more
Abstract: In this paper, we evaluate the performance of several objective measures in terms of predicting the quality of noisy speech enhanced by noise suppression algorithms. The objective measures considered a wide range of distortions introduced by four types of real-world noise at two signal-to-noise ratio levels by four classes of speech enhancement algorithms: spectral subtractive, subspace, statistical-model based, and Wiener algorithms. The subjective quality ratings were obtained using the ITU-T P.835 methodology designed to evaluate the quality of enhanced speech along three dimensions: signal distortion, noise distortion, and overall quality. This paper reports on the evaluation of correlations of several objective measures with these three subjective rating scales. Several new composite objective measures are also proposed by combining the individual objective measures using nonparametric and parametric regression analysis techniques.
read more
Chat with Paper
AI Agents for this Paper
Find similar papers on Google Scholar, PubMed and Arxiv
Write a critical review of this paper
Analyze citations of this paper to find unaddressed research gaps
Citations
An expectation-maximization algorithm for multichannel adaptive speech dereverberation in the frequency-domain
Dominic Schmid,Sarmad Malik,Gerald Enzner +2 more
- 25 Mar 2012
TL;DR: This paper forms an overlap-save observation model for the multichannel blind problem in the DFT-domain and derives an iterative ML algorithm for blind equalization and channel identification (ML-BENCH) which comprises two distinct and coupled subsystems.
17
A complex-valued multichannel speech enhancement learning algorithm for optimal tradeoff between noise reduction and speech distortion
TL;DR: A new optimal tradeoff method for multichannel speech enhancement by solving a complex-valued optimization problem subject to a residual noise constraint with the masking threshold of the clean speech is proposed.
17
•Posted Content
DPT-FSNet: Dual-path Transformer Based Full-band and Sub-band Fusion Network for Speech Enhancement.
TL;DR: In this paper, a dual-path transformer-based full-band and sub-band fusion network (DPT-FSNet) was proposed for speech enhancement in the frequency domain.
17
•Proceedings Article
Phase-Only Speech Reconstruction Using Very Short Frames
Erfan Loweimi,Seyed Mohammad Ahadi,Hamid Sheikhzadeh +2 more
- 01 Jan 2011
TL;DR: This paper removed the SIE and found the reason for quality improvement of ordinary phase-only reconstructed speech by frame length extension, and shows that phase spectrum, even in very short frame lengths, can be highly informative.
17
Supervised Speech Separation Using Deep Neural Networks
Yuxuan Wang
- 01 Jan 2015
TL;DR: This dissertation presents a systematic effort to develop monaural speech separation systems using DNNs using support vector machine as classifier to predict the ideal binary mask (IBM), which is a primary goal in computational auditory scene analysis.
17
References
Multivariate Adaptive Regression Splines
TL;DR: In this article, a new method is presented for flexible regression modeling of high dimensional data, which takes the form of an expansion in product spline basis functions, where the number of basis functions as well as the parameters associated with each one (product degree and knot locations) are automatically determined by the data.
Perceptual evaluation of speech quality (PESQ)-a new method for speech quality assessment of telephone networks and codecs
Antony William Rix,John G. Beerends,Michael Peter Hollier,Andries Pieter Hekstra +3 more
- 07 May 2001
TL;DR: A new model has been developed for use across a wider range of network conditions, including analogue connections, codecs, packet loss and variable delay, known as perceptual evaluation of speech quality (PESQ).
2.8K
•Book
Speech Enhancement: Theory and Practice
Philipos C. Loizou
- 07 Jun 2007
TL;DR: Clear and concise, this book explores how human listeners compensate for acoustic noise in noisy environments and suggests steps that can be taken to realize the full potential of these algorithms under realistic conditions.
2.5K
Evaluation of Objective Quality Measures for Speech Enhancement
Yi Hu,Philipos C. Loizou +1 more
TL;DR: The evaluation of correlations of several objective measures with these three subjective rating scales is reported on and several new composite objective measures are also proposed by combining the individual objective measures using nonparametric and parametric regression analysis techniques.
1.9K