Robust Features for Speech Recognition using Temporal Filtering Technique in the Presence of Impulsive Noise

doi:10.5815/IJIGSP.2014.11.03

Open AccessJournal Article10.5815/IJIGSP.2014.11.03

Robust Features for Speech Recognition using Temporal Filtering Technique in the Presence of Impulsive Noise

Hajer Rahali, +2 more

- 08 Oct 2014

- International Journal of Image, Graphics...

- Vol. 6, Iss: 11, pp 17-24

9

TL;DR: A robust feature extractor, dubbed as Modified Function Cepstral Coefficients (MODFCC), based on gammachirp filterbank, Relative Spectral (RASTA) and Autoregressive Moving-Average (ARMA) filter is introduced to improve the robustness of speech recognition systems in additive noise and real-time reverberant environments.

Chat with Paper

AI Agents for this Paper

Find similar papers on Google Scholar, PubMed and Arxiv
Write a critical review of this paper
Analyze citations of this paper to find unaddressed research gaps

Citations

•Journal Article•10.15680/IJIRSET.2014.0312034

A Comparative Study of Feature Extraction Techniques for Speech Recognition System

Pratik K. Kurzekar, +4 more

- 15 Dec 2014

- International Journal of Innovative Rese...

TL;DR: Speech processing has vast applications in voice dialing, telephone communication, call routing, domestic appliances control, Speech to Text conversion, Text to Speech conversion, lip synchronization, automation systems etc.

...read moreread less

76

Proceedings Article•10.1109/CONFLUENCE.2016.7508170

An analysis on LPC, RASTA and MFCC techniques in Automatic Speech recognition system

Kartiki Gupta, +1 more

- 01 Jan 2016

TL;DR: The main objective of this research paper is to briefly summarize speech recognition system and three feature extraction methods that are an integral part of ASR.

...read moreread less

69

Book Chapter•10.1016/B978-1-85617-678-1.00012-0

Speech and Audio Processing

Hazarathaiah Malepati

- 01 Jan 2010

TL;DR: This chapter provides the discussion of sound and audio signals, and then explores how audio data is presented to the processor from a variety of audio converters.

...read moreread less

54

•Journal Article

Recognition and Classification of Human Behavior in Intelligent Surveillance Systems using Hidden Markov Model

Adeleh Farzad, +1 more

- 08 Nov 2015

- International Journal of Image, Graphics...

TL;DR: A high accuracy human action classification and recognition method using hidden Markov model classifier, which classifies the investigated behaviors and detects abnormal actions with high accuracy in comparison by other abnormal detection reported in previous works.

...read moreread less

9

•Journal Article•10.5815/ijmecs.2022.03.03

Enhanced Deep Hierarchal GRU & BILSTM using Data Augmentation and Spatial Features for Tamil Emotional Speech Recognition

Johnathan Fernandes, +1 more

- 08 Jun 2022

- International Journal of Modern Educatio...

9

References

Journal Article•10.1109/89.326616

RASTA processing of speech

Hynek Hermansky, +1 more

- 01 Oct 1994

- IEEE Transactions on Speech and Audio Pr...

TL;DR: The theoretical and experimental foundations of the RASTA method are reviewed, the relationship with human auditory perception is discussed, the original method is extended to combinations of additive noise and convolutional noise, and an application is shown to speech enhancement.

...read moreread less

2.1K

•Proceedings Article

The aurora experimental framework for the performance evaluation of speech recognition systems under noisy conditions

David Pearce, +1 more

- 01 Jan 2000

TL;DR: A database designed to evaluate the performance of speech recognition algorithms in noisy conditions and recognition results are presented for the first standard DSR feature extraction scheme that is based on a cepstral analysis.

...read moreread less

2K

•Journal Article•10.1016/J.CSL.2010.06.003

The subspace Gaussian mixture model-A structured model for speech recognition

Daniel Povey, +12 more

- 01 Apr 2011

- Computer Speech & Language

TL;DR: A new approach to speech recognition, in which all Hidden Markov Model states share the same Gaussian Mixture Model (GMM) structure with the same number of Gaussians in each state, appears to give better results than a conventional model.

...read moreread less

323

Proceedings Article•10.1109/ICASSP.2010.5495570

Feature extraction for robust speech recognition based on maximizing the sharpness of the power distribution and on power flooring

Chanwoo Kim, +1 more

- 14 Mar 2010

TL;DR: A new robust feature extraction algorithm based on a modified approach to power bias subtraction combined with applying a threshold to the power spectral density is presented, showing better performance than the previous implementation.

...read moreread less

127

•Proceedings Article•10.21437/INTERSPEECH.2012-221

Noise robust pitch tracking by subband autocorrelation classification

Daniel P. W. Ellis, +1 more

- 01 Jan 2012

TL;DR: Training on various types of noisy speech recordings leads to a great increase in performance over state-of-the-art algorithms, according to both the traditional Gross Pitch Error (GPE) measure, and a proposed novel Pitch Tracking Error (PTE) which more fully reflects the accuracy of both pitch estimation/extraction and voicing detection in a single measure.

...read moreread less

89