Open Access
Using cepstral coefficients for Inhalation pause detection in spontaneous speech
Anders Sjöström,Johan Frid,Merle Horne +2 more
- 01 Jan 2005
- Vol. 1, pp 143-146
TL;DR: The method is most suited for signals with low noise and high average intensity (studio recording) but can also be used on noisier recordings with lower average intensity, albeit with poorer results.
read more
Abstract: A method for recognizing inhalations in spontaneous speech is presented. It is similar to the template matching technique; a distance measure is calculated between a reference sound and an equally long portion of the same sound being tracked. A feature representation consisting of the standard Mel Frequency Cepstral Coefficients (MFCC), obtained by performing a discrete Cosine Transform of the mel-scaled filterbank spectrum is used. MFCC's are calculated every 5 ms. The comparison is then done by computing the euclidian distance between the cepstral coefficients of each frame of the two sounds. A low distance value means that the two compared inhalations are likely to be similar. The method can detect inhalations in both male and female spontaneous speech. The method is most suited for signals with low noise and high average intensity (studio recording) but can also be used on noisier recordings with lower average intensity, albeit with poorer results. (Less)
read more
Chat with Paper
AI Agents for this Paper
Find similar papers on Google Scholar, PubMed and Arxiv
Write a critical review of this paper
Analyze citations of this paper to find unaddressed research gaps
References
Speaking: From Intention to Articulation
Thomas Berg,Willem J. M. Levelt +1 more
TL;DR: In Speaking, Willem "Pim" Levelt, Director of the Max-Planck-Institut fur Psycholinguistik, accomplishes the formidable task of covering the entire process of speech production, from constraints on conversational appropriateness to articulation and self-monitoring of speech.
6.5K
•Book
Human memory : theory and practice
Alan D. Baddeley
- 01 Jan 1990
TL;DR: This book discusses memory, Emotion and Cognition, and the role of Memory in Cognition - Working Memory, as well as Implicit Memory and Recollection.
3.4K
Breathing patterns during spontaneous speech
TL;DR: The data provide novel insight into associations between physiological and linguistic factors in the control of speech breathing, and are suggestive of the existence of neural planning of the respiratory system, in anticipation of the demands of the utterance.
168