Efficient Noise Robust Feature Extraction Algorithms for Distributed Speech Recognition (DSR) Systems

doi:10.1023/A:1023410018862

Journal Article10.1023/A:1023410018862

Efficient Noise Robust Feature Extraction Algorithms for Distributed Speech Recognition (DSR) Systems

Bojan Kotnik, +2 more

- 01 Jul 2003

- International Journal of Speech Technolo...

- Vol. 6, Iss: 3, pp 205-219

16

TL;DR: Two innovative front-end processing techniques for noise robust speech recognition are presented and compared and include different forms of frame-attenuation, improvement of spectral subtraction based on minimum statistics, as well as a mel-cepstrum feature extraction procedure.

Chat with Paper

AI Agents for this Paper

Find similar papers on Google Scholar, PubMed and Arxiv
Write a critical review of this paper
Analyze citations of this paper to find unaddressed research gaps

Citations

•Journal Article•10.3390/S20010021

Bee Swarm Activity Acoustic Classification for an IoT-Based Farm Service.

Andrej Zgank

- 19 Dec 2019

- Sensors

TL;DR: The evaluation results showed that good acoustic classification performance can be achieved with the proposed IoT-based bee activity acoustic classification system, and the objective was to successfully classify sound between the normal and swarming conditions in a beehive.

...read moreread less

81

•Dissertation

Making music through real-time voice timbre analysis: machine learning and timbral control

Dan Stowell

- 01 Jan 2010

TL;DR: This thesis develops approaches that can be used with a wide variety of musical instruments by applying machine learning techniques to automatically derive the mappings between expressive audio input and control output, with a focus on timbral control.

...read moreread less

40

Journal Article•10.1016/J.SIGPRO.2006.10.009

A noise robust feature extraction algorithm using joint wavelet packet subband decomposition and AR modeling of speech signals

Bojan Kotnik, +1 more

- 01 Jun 2007

- Signal Processing

TL;DR: This paper presents a noise robust feature extraction algorithm NRFE using joint wavelet packet decomposition (WPD) and autoregressive (AR) modeling of a speech signal to improve noise robustness and performance.

...read moreread less

35

•Journal Article•10.1155/2009/628570

Online speech/music segmentation based on the variance mean of filter bank energy

Marko Kos, +2 more

- 01 Jan 2009

- EURASIP Journal on Advances in Signal Pr...

TL;DR: The proposed VMFBE feature as a stand-alone speech/music discriminator in a segmentation system achieves an overall accuracy of over 94% on radio broadcast material and it outperforms other features used for comparison, by more than 8%.

...read moreread less

14

Journal Article•10.1016/J.COMPELECENG.2012.09.003

Voice activity detection algorithm using nonlinear spectral weights, hangover and hangbefore criteria

Damjan Vlaj, +2 more

- 01 Nov 2012

- Computers & Electrical Engineering

TL;DR: A nonlinear function into the frequency spectrum that improves the detection of vowels, diphthongs, and semivowels within the speech signal and presents a procedure for faster definition of those optimal constants used by hangover and hangbefore criteria.

...read moreread less

13

...

Expand

References

Journal Article•10.1109/TASSP.1979.1163209

Suppression of acoustic noise in speech using spectral subtraction

S. Boll

- 01 Apr 1979

- IEEE Transactions on Acoustics, Speech, ...

TL;DR: A stand-alone noise suppression algorithm that resynthesizes a speech waveform and can be used as a pre-processor to narrow-band voice communications systems, speech recognition systems, or speaker authentication systems.

...read moreread less

5.3K

•Book

Discrete-Time Processing of Speech Signals

J. R. Deller, +2 more

- 01 Mar 1993

TL;DR: The preface to the IEEE Edition explains the background to speech production, coding, and quality assessment and introduces the Hidden Markov Model, the Artificial Neural Network, and Speech Enhancement.

...read moreread less

3.1K

•Proceedings Article

The aurora experimental framework for the performance evaluation of speech recognition systems under noisy conditions

David Pearce, +1 more

- 01 Jan 2000

TL;DR: A database designed to evaluate the performance of speech recognition algorithms in noisy conditions and recognition results are presented for the first standard DSR feature extraction scheme that is based on a cepstral analysis.

...read moreread less

2K

Spectral Subtraction Based on Minimum Statistics

Rainer Martin

- 01 Jan 2001

TL;DR: An unbiased noise power estimator based on minimum statistics is derived and its statistical properties and its performance in the context of spectral subtraction are discussed.

...read moreread less

680

Proceedings Article•10.1109/ICASSP.1990.115970

Hidden Markov model decomposition of speech and noise

Andrew Varga, +1 more

- 03 Apr 1990

TL;DR: A technique of signal decomposition using hidden Markov models is described that provides an optimal method of decomposing simultaneous processes and has wide implications for signal separation in general and improved speech modeling in particular.

...read moreread less

577