Temporal Envelope of Time-Compressed Speech Represented in the Human Auditory Cortex
Kirill V. Nourski,Richard A. Reale,Hiroyuki Oya,Hiroto Kawasaki,Christopher K. Kovach,Haiming Chen,Matthew A. Howard,John F. Brugge +7 more
TL;DR: The ability of the core auditory cortex to follow the temporal speech envelope over a wide range of speaking rates leads us to conclude that such capacity in itself is not a limiting factor for speech comprehension.
read more
Abstract: Speech comprehension relies on temporal cues contained in the speech envelope, and the auditory cortex has been implicated as playing a critical role in encoding this temporal information. We investigated auditory cortical responses to speech stimuli in subjects undergoing invasive electrophysiological monitoring for pharmacologically refractory epilepsy. Recordings were made from multicontact electrodes implanted in Heschl's gyrus (HG). Speech sentences, time compressed from 0.75 to 0.20 of natural speaking rate, elicited average evoked potentials (AEPs) and increases in event-related band power (ERBP) of cortical high-frequency (70–250 Hz) activity. Cortex of posteromedial HG, the presumed core of human auditory cortex, represented the envelope of speech stimuli in the AEP and ERBP. Envelope following in ERBP, but not in AEP, was evident in both language-dominant and -nondominant hemispheres for relatively high degrees of compression where speech was not comprehensible. Compared to posteromedial HG, responses from anterolateral HG—an auditory belt field—exhibited longer latencies, lower amplitudes, and little or no time locking to the speech envelope. The ability of the core auditory cortex to follow the temporal speech envelope over a wide range of speaking rates leads us to conclude that such capacity in itself is not a limiting factor for speech comprehension.
read more
Chat with Paper
AI Agents for this Paper
Find similar papers on Google Scholar, PubMed and Arxiv
Write a critical review of this paper
Analyze citations of this paper to find unaddressed research gaps
Citations
Cortical oscillations and speech processing: emerging computational principles and operations.
Anne-Lise Giraud,David Poeppel +1 more
TL;DR: It is argued that neural oscillations are foundational in speech and language processing, 'packaging' incoming information into units of the appropriate temporal granularity, and constitutes a natural model system allowing auditory research to make a unique contribution to the issue of how neural oscillatory activity affects human cognition.
Mechanisms Underlying Selective Neuronal Tracking of Attended Speech at a “Cocktail Party”
Elana Zion Golumbic,Elana Zion Golumbic,Nai Ding,Stephan Bickel,Stephan Bickel,Peter Lakatos,Catherine A. Schevon,Guy M. McKhann,Robert R. Goodman,Ronald G. Emerson,Ashesh D. Mehta,Ashesh D. Mehta,Jonathan Z. Simon,David Poeppel,Charles E. Schroeder,Charles E. Schroeder +15 more
TL;DR: It is found that brain activity dynamically tracks speech streams using both low-frequency phase and high-frequency amplitude fluctuations and that optimal encoding likely combines the two.
922
Emergence of neural encoding of auditory objects while listening to competing speakers
Nai Ding,Jonathan Z. Simon +1 more
TL;DR: Recording from subjects selectively listening to one of two competing speakers using magnetoencephalography indicates that concurrent auditory objects, even if spectrotemporally overlapping and not resolvable at the auditory periphery, are neurally encoded individually in auditory cortex and emerge as fundamental representational units for top-down attentional modulation and bottom-up neural adaptation.
883
Reconstructing Speech from Human Auditory Cortex
Brian N. Pasley,Stephen V. David,Nima Mesgarani,Nima Mesgarani,Adeen Flinker,Shihab A. Shamma,Nathan E. Crone,Robert T. Knight,Robert T. Knight,Edward F. Chang +9 more
TL;DR: Direct brain recordings from neurosurgical patients listening to speech reveal that the acoustic speech signals can be reconstructed from neural activity in auditory cortex.
Neural Oscillations Carry Speech Rhythm through to Comprehension.
TL;DR: The reviewed evidence suggests that low- frequencies in the acoustic speech signal form the foundation of a rhythmic hierarchy supporting spoken language, mirrored by phase-locked oscillations in the human brain.
References
Event-related EEG/MEG synchronization and desynchronization: basic principles.
TL;DR: Quantification of ERD/ERS in time and space is demonstrated on data from a number of movement experiments, whereby either the same or different locations on the scalp can display ERD and ERS simultaneously.
7.1K
Spectrum estimation and harmonic analysis
David J. Thomson
- 01 Sep 1982
TL;DR: In this article, a local eigenexpansion is proposed to estimate the spectrum of a stationary time series from a finite sample of the process, which is equivalent to using the weishted average of a series of direct-spectrum estimates based on orthogonal data windows to treat both bias and smoothing problems.
4.4K
Speech recognition with primarily temporal cues.
TL;DR: Nearly perfect speech recognition was observed under conditions of greatly reduced spectral information; the presentation of a dynamic temporal pattern in only a few broad spectral regions is sufficient for the recognition of speech.
Temporal Information in Speech: Acoustic, Auditory and Linguistic Aspects
TL;DR: A new framework for describing the acoustic structure of speech based purely on temporal aspects has been developed, which is said to be comprised of three main temporal features, based on dominant fluctuation rates: envelope, periodicity, and fine-structure.
1.2K