Evaluation of Objective Quality Measures for Speech Enhancement

doi:10.1109/TASL.2007.911054

Journal Article10.1109/TASL.2007.911054

Evaluation of Objective Quality Measures for Speech Enhancement

Yi Hu, +1 more

- 01 Jan 2008

- IEEE Transactions on Audio, Speech, and ...

- Vol. 16, Iss: 1, pp 229-238

1.9K

TL;DR: The evaluation of correlations of several objective measures with these three subjective rating scales is reported on and several new composite objective measures are also proposed by combining the individual objective measures using nonparametric and parametric regression analysis techniques.

Chat with Paper

AI Agents for this Paper

Find similar papers on Google Scholar, PubMed and Arxiv
Write a critical review of this paper
Analyze citations of this paper to find unaddressed research gaps

Citations

Proceedings Article•10.1109/ICASSP.2012.6287806

An expectation-maximization algorithm for multichannel adaptive speech dereverberation in the frequency-domain

Dominic Schmid, +2 more

- 25 Mar 2012

TL;DR: This paper forms an overlap-save observation model for the multichannel blind problem in the DFT-domain and derives an iterative ML algorithm for blind equalization and channel identification (ML-BENCH) which comprises two distinct and coupled subsystems.

...read moreread less

17

Journal Article•10.1016/J.NEUCOM.2017.06.018

A complex-valued multichannel speech enhancement learning algorithm for optimal tradeoff between noise reduction and speech distortion

Jingxian Tu, +2 more

- 06 Dec 2017

- Neurocomputing

TL;DR: A new optimal tradeoff method for multichannel speech enhancement by solving a complex-valued optimization problem subject to a residual noise constraint with the masking threshold of the clean speech is proposed.

...read moreread less

17

•Posted Content

DPT-FSNet: Dual-path Transformer Based Full-band and Sub-band Fusion Network for Speech Enhancement.

Feng Dang, +2 more

- 27 Apr 2021

- arXiv: Sound

TL;DR: In this paper, a dual-path transformer-based full-band and sub-band fusion network (DPT-FSNet) was proposed for speech enhancement in the frequency domain.

...read moreread less

17

•Proceedings Article

Phase-Only Speech Reconstruction Using Very Short Frames

Erfan Loweimi, +2 more

- 01 Jan 2011

TL;DR: This paper removed the SIE and found the reason for quality improvement of ordinary phase-only reconstructed speech by frame length extension, and shows that phase spectrum, even in very short frame lengths, can be highly informative.

...read moreread less

17

Supervised Speech Separation Using Deep Neural Networks

Yuxuan Wang

- 01 Jan 2015

TL;DR: This dissertation presents a systematic effort to develop monaural speech separation systems using DNNs using support vector machine as classifier to predict the ideal binary mask (IBM), which is a primary goal in computational auditory scene analysis.

...read moreread less

17

...

Expand

References

•Journal Article•10.1136/BJO.46.11.704

A and V.

Robert W. Stephenson

- 01 Nov 1962

- British Journal of Ophthalmology

46.7K

•Journal Article•10.1214/AOS/1176347963

Multivariate Adaptive Regression Splines

Jerome H. Friedman

- 01 Mar 1991

- Annals of Statistics

TL;DR: In this article, a new method is presented for flexible regression modeling of high dimensional data, which takes the form of an expansion in product spline basis functions, where the number of basis functions as well as the parameters associated with each one (product degree and knot locations) are automatically determined by the data.

...read moreread less

7.9K

Proceedings Article•10.1109/ICASSP.2001.941023

Perceptual evaluation of speech quality (PESQ)-a new method for speech quality assessment of telephone networks and codecs

Antony William Rix, +3 more

- 07 May 2001

TL;DR: A new model has been developed for use across a wider range of network conditions, including analogue connections, codecs, packet loss and variable delay, known as perceptual evaluation of speech quality (PESQ).

...read moreread less

2.8K

•Book

Speech Enhancement: Theory and Practice

Philipos C. Loizou

- 07 Jun 2007

TL;DR: Clear and concise, this book explores how human listeners compensate for acoustic noise in noisy environments and suggests steps that can be taken to realize the full potential of these algorithms under realistic conditions.

...read moreread less

2.5K

Journal Article•10.1109/TASL.2007.911054