Supervector pre-processing for PRSVM-based Chinese and Arabic dialect identification

doi:10.1109/ICASSP.2013.6639093

Open AccessProceedings Article10.1109/ICASSP.2013.6639093

Supervector pre-processing for PRSVM-based Chinese and Arabic dialect identification

Qian Zhang, +2 more

- 26 May 2013

- pp 7363-7367

26

TL;DR: Variations to supervector pre-processing for phone recognition-support vector machines (PRSVM) based dialect identification are explored and a newly proposed dialect salience measure is applied in supervector dimension selection and compared to a common N-gram frequency based selection.

Chat with Paper

AI Agents for this Paper

Find similar papers on Google Scholar, PubMed and Arxiv
Write a critical review of this paper
Analyze citations of this paper to find unaddressed research gaps

Citations

•Proceedings Article•10.18653/V1/W15-3205

Natural Language Processing for Dialectical Arabic: A Survey

Abdulhadi Shoufan, +1 more

- 01 Jul 2015

TL;DR: This paper presents a wide literature review of natural language processing for dialectical Arabic and identifies relevant contributions that address a specific NLP aspect for a specific dialect.

...read moreread less

118

•Journal Article•10.1109/ACCESS.2021.3059504

Systematic Literature Review of Dialectal Arabic: Identification and Detection

Ashraf Elnagar, +4 more

- 15 Feb 2021

- IEEE Access

TL;DR: The authors conducted a systematic literature review that is intended to give insight into the most and least popular research areas, dialects, machine learning approaches, neural network input features, data types, datasets, system evaluation criteria, publication venues, and publication trends.

...read moreread less

60

Proceedings Article•10.21437/ODYSSEY.2016-19

Identification of British English regional accents using fusion of i-vector and multi-accent phonotactic systems.

Maryam Najafian, +3 more

- 21 Jun 2016

TL;DR: This paper demonstrates that the relatively simple i-vector and phonotactic fused system with recognition accuracy of 84.87% outperforms the i- vector fused results reported in literature, by 4.7%.

...read moreread less

40

Proceedings Article•10.21437/INTERSPEECH.2017-576

Dialect Recognition Based on Unsupervised Bottleneck Features.

Qian Zhang, +1 more

- 20 Aug 2017

TL;DR: An unsupervised BNF extraction diagram is proposed in this study, which is derived from the traditional structure but trained with an estimated phonetic label, all without the need of a secondary transcribed corpus.

...read moreread less

16

Journal Article•10.1007/S00034-017-0724-1

Acoustic Feature Analysis and Discriminative Modeling for Language Identification of Closely Related South-Asian Languages

Farah Adeeba, +1 more

- 01 Aug 2018

- Circuits Systems and Signal Processing

TL;DR: Gaussian mixture model with universal background model (GMM-UBM)-based and I-vector-based language identification approaches are investigated and the results show that GMM-UBm is more effective than the I- vector for language identification of short duration test utterances.

...read moreread less

14

...

Expand

References

•Journal Article•10.1109/TSA.1996.481450

Comparison of four approaches to automatic language identification of telephone speech

M.A. Zissman

- 01 Jan 1996

- IEEE Transactions on Speech and Audio Pr...

TL;DR: Four approaches for automatic language identification of speech utterances are compared: Gaussian mixture model (GMM) classification; single-language phone recognition followed by languaged dependent, interpolated n-gram language modeling (PRLM); parallel PRLM, which uses multiple single- language phone recognizers, each trained in a different language; and languagedependent parallel phone recognition (PPR).

...read moreread less

750

•Proceedings Article

Approaches to Language Identification using Gaussian Mixture Models and Shifted Delta Cepstral Features

Pedro A. Torres-Carrasquillo, +6 more

- 01 Jan 2002

TL;DR: Two GMM-based approaches to language identification that use shifted delta cepstra (SDC) feature vectors to achieve LID performance comparable to that of the best phone-based systems are described.

...read moreread less

481

Journal Article•10.1109/TASL.2006.876860

A Vector Space Modeling Approach to Spoken Language Identification

Haizhou Li, +2 more

- 01 Jan 2007

- IEEE Transactions on Audio, Speech, and ...

TL;DR: The proposed VSM approach leads to a discriminative classifier backend, which is demonstrated to give superior performance over likelihood-based n-gram language modeling (LM) backend for long utterances.

...read moreread less

269

•Proceedings Article

Phonetic Speaker Recognition with Support Vector Machines

William M. Campbell, +4 more

- 09 Dec 2003

TL;DR: A new phone- based SVM speaker recognition approach that halves the error rate of conventional phone-based approaches is introduced and a new kernel based upon a linearization of likelihood ratio scoring is derived.

...read moreread less

156

Proceedings Article•10.1109/ICASSP.1996.543236

Automatic dialect identification of extemporaneous conversational, Latin American Spanish speech

M.A. Zissman, +3 more

- 07 May 1996

TL;DR: A dialect identification technique is described that takes as input extemporaneous, conversational speech spoken in Latin American Spanish and produces as output a hypothesis of the dialect, which could be extended easily to other dialects (and languages) as well.

...read moreread less

115

...

Expand

Supervector pre-processing for PRSVM-based Chinese and Arabic dialect identification

Chat with Paper

AI Agents for this Paper

Citations

Natural Language Processing for Dialectical Arabic: A Survey

Systematic Literature Review of Dialectal Arabic: Identification and Detection

Identification of British English regional accents using fusion of i-vector and multi-accent phonotactic systems.

Dialect Recognition Based on Unsupervised Bottleneck Features.

Acoustic Feature Analysis and Discriminative Modeling for Language Identification of Closely Related South-Asian Languages

References

Comparison of four approaches to automatic language identification of telephone speech

Approaches to Language Identification using Gaussian Mixture Models and Shifted Delta Cepstral Features

A Vector Space Modeling Approach to Spoken Language Identification

Phonetic Speaker Recognition with Support Vector Machines

Automatic dialect identification of extemporaneous conversational, Latin American Spanish speech

Related Papers (5)

Experiments with Lattice-based PPRLM Language Identification

QMDIS: QCRI-MIT advanced dialect identification system

Sichuan dialect speech recognition with deep LSTM network

Word based dialect classification using extreme learning machines

Novel Techniques for Dialectal Arabic Speech Recognition