Proceedings Article10.21437/INTERSPEECH.2015-246
Sparse coding based features for speech units classification.
Pulkit Sharma,Vinayak Abrol,A. D. Dileep,Anil Kumar Sao +3 more
- 06 Sep 2015
- pp 712-715
12
TL;DR: In this paper, the training data belonging to each class is clustered into multiple clusters, and a principal component analysis (PCA) based dictionary is learnt for each cluster, where coefficients corresponding to middle principal components can effectively discriminate among different speech units.
read more
Abstract: In this work, we propose sparse representation based features for speech units classification tasks. In order to effectively capture the variations in a speech unit, the proposed method employs multiple class specific dictionaries. Here, the training data belonging to each class is clustered into multiple clusters, and a principal component analysis (PCA) based dictionary is learnt for each cluster. It has been observed that coefficients corresponding to middle principal components can effectively discriminate among different speech units. Exploiting this observation, we propose to use a transformation function known as weighted decomposition (WD) of principal components, which is used to emphasize the discriminative information present in the PCA-based dictionary. In this paper, both raw speech samples and mel frequency cepstral coefficients (MFCC) are used as an initial representation for feature extraction. For comparison, various popular dictionary learning techniques such as K-singular value decomposition (KSVD), simultaneous codeword optimization (SimCO) and greedy adaptive dictionary (GAD) are also employed in the proposed framework. The effectiveness of the proposed features is demonstrated using continuous density hidden Markov model (CDHMM) based classifiers for (i) classification of isolated utterances of E-set of English alphabet, (ii) classification of consonant-vowel (CV) segments in Hindi language and (iii) classification of phoneme from TIMIT phonetic corpus.
read more
Chat with Paper
AI Agents for this Paper
Find similar papers on Google Scholar, PubMed and Arxiv
Write a critical review of this paper
Analyze citations of this paper to find unaddressed research gaps
Citations
Greedy dictionary learning for kernel sparse representation based classifier
TL;DR: Compared to the existing state-of-the-art methods, the proposed method has much less computational complexity, but performs similar for various pattern classification tasks.
21
Voiced/nonvoiced detection in compressively sensed speech signals
TL;DR: The proposed novel unsupervised voiced/nonvoiced (V/NV) detection method attempts to exploit the fact that there is significant glottal activity during production of voiced speech while the same is not true for nonvoiced speech, and provides compelling evidence of the effectiveness of sparse feature vector for V/NV detection.
21
Sparse coding based features for speech units classification
TL;DR: Both raw speech samples and mel frequency cepstral coefficients are used as an initial representation for feature extraction and a transformation function known as weighted decomposition (WD) of principal components is used to emphasize the discriminative information present in the PCA-based dictionary.
17
Greedy double sparse dictionary learning for sparse representation of speech signals
TL;DR: A greedy double sparse (DS) dictionary learning algorithm for speech signals, where the dictionary is the product of a predefined base dictionary, and a sparse matrix, and it is shown that the dictionary can be learned efficiently in the coefficient domain rather than the signal domain.
14
Reducing footprint of unit selection based text-to-speech system using compressed sensing and sparse representation
TL;DR: Experimental studies on two different Indian languages suggest that CS/SR based footprint reduction methods can be used as an alternative to existing compression methods employed in USS system.
9
Related Papers (5)
Vinayak Abrol,Pulkit Sharma,Anil Kumar Sao +2 more
- 25 Aug 2013
[...]
Ivana Tosic,Pascal Frossard +1 more