Non-Linear and Non-Conventional Speech Processing: Alternative Techniques
TL;DR: This special issue aims to cover some problems related to non-linear and non-conventional speech processing and a selected choice of papers based on the presentations delivered at NOLISP’09 has given rise to this issue of Cognitive Computation.
read more
Abstract: This special issue aims to cover some problems related to non-linear and non-conventional speech processing. The origin of this volume is in the ISCA Tutorial and Research Workshop on Non-Linear Speech Processing, NOLISP’09, held at the Universitat de Vic (Catalonia, Spain) on June 25–27, 2009. The series of NOLISP workshops started in 2003 has become a biannual event whose aim is to discuss alternative techniques for speech processing that, in a sense, do not fit into mainstream approaches. A selected choice of papers based on the presentations delivered at NOLISP’09 has given rise to this issue of Cognitive Computation. The papers hereinafter deal with the following topics: What would happen if we were able to overcome the limitations of audiovisual human systems? The answer could be found in the paper by V. Espinosa-Duro, M. Faundez-Zanuy, and J. Mekyska. A growing recurrent self-organizing model for phoneme recognition is presented by Ch. Jlassi, N. Arous, and N. Ellouze. An algorithm for voiced/unvoiced decision and pitch estimation from speech signals based on classification of peaks using the auto-correlation of the speech multi-scale product is presented by M. A. Ben Massoud, A. Bouzid, and N. Elloze. A feature reduction system based on the Discriminative Common Vector is proposed by C.M. Travieso, M. del Pozo, M.A. Ferrer, and J. B. Alonso. A new approach to the diagnosis of Ataxia SCA-2 by application of independent component analysis to a set of data obtained by electro-oculography is presented by R. V. Garcia, F. Rojas, C. G. Puntonet, B. San Roman, L. Velazquez, and R. Rodriguez. A formant tracking technique based on Fourier ridges detection aiming at improving the performance of these algorithms is presented by I. Jemaa, K. Ouni and Y. Laprie A new architecture for vocabulary independent keyword detection for cognitive virtual agents (SEMAINE system) is proposed by M. Wollmer, F. Eyber, A. Graves, B. Schuller, and G. Rigoll. Their word spotting model is composed of a dynamic-Bayesian network and a bidirectional long–short-term memory recurrent neural network. An insight into non-linear transformations for improving the performance of an entropy-based voice activity detector is made in the paper by two guest editors (J. S. and V.Z.). By no means has the range of topics in this Special Issue covered all directions in non-linear speech processing. We would rather consider it a sample presenting several areas in progress. In any case, we hope that they will be of interest for the readers of Cognitive Computation. Our thanks are due to the Editor-in-Chief, Dr. Amir Hussain, for permanent support in preparation of this issue. Of course, this issue would have been impossible without the contributions of all authors. On the other hand, anonymous referees have also done a great job by carefully J. Sole-Casals (&) V. Zaiats Department of Digital Technologies and Information, Escola Politecnica Superior, Universitat de Vic, c/. de la Laura, 13, 08500 Vic (Barcelona), Spain e-mail: jordi.sole@uvic.cat
read more
Chat with Paper
AI Agents for this Paper
Find similar papers on Google Scholar, PubMed and Arxiv
Write a critical review of this paper
Analyze citations of this paper to find unaddressed research gaps
Citations
On the selection of non-invasive methods based on speech analysis oriented to automatic Alzheimer disease diagnosis.
Karmele López-de-Ipiña,Jesus-Bernardino Alonso,Carlos M. Travieso,Jordi Solé-Casals,Harkaitz Egiraun,Marcos Faundez-Zanuy,Aitzol Ezeiza,Nora Barroso,Miriam Ecay-Torres,Pablo Martinez-Lage,Unai Martinez de Lizardui +10 more
TL;DR: Artificial Neural Networks (ANN) have been used for the automatic classification of the two classes (AD and control subjects) and results were very satisfactory and promising for early diagnosis and classification of AD patients.
A Real-Time Speech Enhancement Framework in Noisy and Reverberated Acoustic Scenarios
TL;DR: An advanced real-time speech processing front-end aimed at automatically reducing the distortions introduced by room reverberation in distant speech signals, also considering the presence of background noise is proposed to achieve a significant improvement in speech quality for each speaker.
15
Enhancing the Feature Extraction Process for Automatic Speech Recognition with Fractal Dimensions
TL;DR: The Fractal Dimension of the observed time series is combined with the traditional MFCCs in the feature vector in order to enhance the performance of two different ASR systems.