TL;DR: The same principles that govern visual perception can explain many seemingly disparate auditory phenomena, and similarity suggests that the same neural mechanisms control attention and influence perception across different sensory modalities.
TL;DR: The neural substrates underlying auditory feedback control of speech were investigated using a combination of functional magnetic resonance imaging (fMRI) and computational modeling, revealing increased activity in bilateral superior temporal cortex during shifted feedback and increased influence of bilateral auditory cortical areas on right frontal areas during shifted speech.
TL;DR: It is proposed that the multisensory perception of flavor may be indicative of the fact that the taxonomy currently used to define the authors' senses is simply not appropriate.
TL;DR: Peripheral hearing deficits are likely to cause a number of interrelated problems that challenge the ability of HL listeners to communicate in social settings requiring selective attention.
Abstract: A common complaint among listeners with hearing loss (HL) is that they have difficulty communicating in common social settings. This article reviews how normal-hearing listeners cope in such settings, especially how they focus attention on a source of interest. Results of experiments with normal-hearing listeners suggest that the ability to selectively attend depends on the ability to analyze the acoustic scene and to form perceptual auditory objects properly. Unfortunately, sound features important for auditory object formation may not be robustly encoded in the auditory periphery of HL listeners. In turn, impaired auditory object formation may interfere with the ability to filter out competing sound sources. Peripheral degradations are also likely to reduce the salience of higher-order auditory cues such as location, pitch, and timbre, which enable normal-hearing listeners to select a desired sound source out of a sound mixture. Degraded peripheral processing is also likely to increase the time required to form auditory objects and focus selective attention so that listeners with HL lose the ability to switch attention rapidly (a skill that is particularly important when trying to participate in a lively conversation). Finally, peripheral deficits may interfere with strategies that normal-hearing listeners employ in complex acoustic settings, including the use of memory to fill in bits of the conversation that are missed. Thus, peripheral hearing deficits are likely to cause a number of interrelated problems that challenge the ability of HL listeners to communicate in social settings requiring selective attention.
TL;DR: This work finds that scene analysis starts much earlier in the auditory pathways than previously reported, and reveals that subcortical structures may already contribute to the analysis of auditory scenes.
TL;DR: The current work demonstrates that grouping can be strongly dependent on culture, and suggests that native language can exert an influence on general auditory perception at a basic level.
Abstract: Many aspects of perception are known to be shaped by experience, but others are thought to be innate universal properties of the brain. A specific example comes from rhythm perception, where one of the fundamental perceptual operations is the grouping of successive events into higher-level patterns, an operation critical to the perception of language and music. Grouping has long been thought to be governed by innate perceptual principles established a century ago. The current work demonstrates instead that grouping can be strongly dependent on culture. Native English and Japanese speakers were tested for their perception of grouping of simple rhythmic sequences of tones. Members of the two cultures showed different patterns of perceptual grouping, demonstrating that these basic auditory processes are not universal but are shaped by experience. It is suggested that the observed perceptual differences reflect the rhythms of the two languages, and that native language can exert an influence on general auditory perception at a basic level.
TL;DR: It is shown that neural correlates of auditory awareness in informational masking emerge between early and late stages of processing within the auditory cortex, presumably from the primary auditory cortex.
Abstract: Our ability to detect target sounds in complex acoustic backgrounds is often limited not by the ear's resolution, but by the brain's information-processing capacity. The neural mechanisms and loci of this “informational masking” are unknown. We combined magnetoencephalography with simultaneous behavioral measures in humans to investigate neural correlates of informational masking and auditory perceptual awareness in the auditory cortex. Cortical responses were sorted according to whether or not target sounds were detected by the listener in a complex, randomly varying multi-tone background known to produce informational masking. Detected target sounds elicited a prominent, long-latency response (50–250 ms), whereas undetected targets did not. In contrast, both detected and undetected targets produced equally robust auditory middle-latency, steady-state responses, presumably from the primary auditory cortex. These findings indicate that neural correlates of auditory awareness in informational masking emerge between early and late stages of processing within the auditory cortex.
TL;DR: This is the first study that demonstrates existence of a global tinnitus network of long-range cortical connections outside the central auditory system and proposes that this global extend of the tinn Titus network is crucial for the continuos perception of theTinnitus tone and a therapeutical intervention that is able to change this network should result in relief of tinnitis.
Abstract: Background
Tinnitus is an auditory phantom perception that is most likely generated in the central nervous system. Most of the tinnitus research has concentrated on the auditory system. However, it was suggested recently that also non-auditory structures are involved in a global network that encodes subjective tinnitus. We tested this assumption using auditory steady state responses to entrain the tinnitus network and investigated long-range functional connectivity across various non-auditory brain regions.
Methods and Findings
Using whole-head magnetoencephalography we investigated cortical connectivity by means of phase synchronization in tinnitus subjects and healthy controls. We found evidence for a deviating pattern of long-range functional connectivity in tinnitus that was strongly correlated with individual ratings of the tinnitus percept. Phase couplings between the anterior cingulum and the right frontal lobe and phase couplings between the anterior cingulum and the right parietal lobe showed significant condition x group interactions and were correlated with the individual tinnitus distress ratings only in the tinnitus condition and not in the control conditions.
Conclusions
To the best of our knowledge this is the first study that demonstrates existence of a global tinnitus network of long-range cortical connections outside the central auditory system. This result extends the current knowledge of how tinnitus is generated in the brain. We propose that this global extend of the tinnitus network is crucial for the continuos perception of the tinnitus tone and a therapeutical intervention that is able to change this network should result in relief of tinnitus.
TL;DR: Tone-deaf individuals, who cannot consciously perceive pitch differences, can paradoxically reproduce pitch intervals in correct directions, suggesting that multiple neural pathways have evolved for sound perception and production, so that pitch information sufficient for intact speech can be obtained separately from pathways necessary for conscious perception.
TL;DR: This prospective longitudinal study tested dynamic auditory and visual processing, speech-in-noise perception, phonological ability and orthographic ability in 62 five-year-old preschool children and demonstrated that dynamic auditory processing was related to speech perception, which itself wasrelated to phonological awareness.
TL;DR: The McGurk effect paradigm was used to examine the developmental onset of inter-language differences between Japanese and English in auditory-visual speech perception and showed that the degree of visual influence was low and equivalent for Japanese andEnglish language 6-year-olds, and increased over age for English language participants.
Abstract: The McGurk effect paradigm was used to examine the developmental onset of inter-language differences between Japanese and English in auditory-visual speech perception. Participants were asked to identify syllables in audiovisual (with congruent or discrepant auditory and visual components), audio-only, and video-only presentations at various signal-to-noise levels. In Experiment 1 with two groups of adults, native speakers of Japanese and native speakers of English, the results on both percent visually influenced responses and reaction time supported previous reports of a weaker visual influence for Japanese participants. In Experiment 2, an additional three age groups (6, 8, and 11 years) in each language group were tested. The results showed that the degree of visual influence was low and equivalent for Japanese and English language 6-year-olds, and increased over age for English language participants, especially between 6 and 8 years, but remained the same for Japanese participants. This may be related to the fact that English language adults and older children processed visual speech information relatively faster than auditory information whereas no such inter-modal differences were found in the Japanese participants’ reaction times.
TL;DR: Using functional magnetic resonance imaging, cortical visual motion area MT+/V5 responded to auditory motion in two rare subjects who had been blind since early childhood and whose vision was partially recovered in adulthood, suggesting that cross-modal plasticity can be influenced by the normal functional specialization of a cortical region.
Abstract: Using functional magnetic resonance imaging, we found that cortical visual motion area MT+/V5 responded to auditory motion in two rare subjects who had been blind since early childhood and whose vision was partially recovered in adulthood. Visually normal control subjects did not show similar auditory responses. These auditory responses in MT+ were specific to motion compared with other complex auditory stimuli including frequency sweeps and speech. Thus, MT+ developed motion-specific responses to nonvisual input, suggesting that cross-modal plasticity can be influenced by the normal functional specialization of a cortical region. Regarding sight recovery after early blindness, our results further demonstrate that cross-modal responses coexist with regained visual responses within the visual cortex.
TL;DR: These analyses established the importance of distinguishing between the accuracy of music perception and the appraisal of musical stimuli when using music listening as an index of implant success, and limitations in the utility of speech perception in predicting musical perception and appraisal underscore the utility in music perception as an alternative outcome measure for evaluating implant outcomes.
Abstract: The research examined whether performance by adult cochlear implant recipients on a variety of recognition and appraisal tests derived from real-world music could be predicted from technological, demographic, and life experience variables, as well as speech recognition scores. A representative sample of 209 adults implanted between 1985 and 2006 participated. Using multiple linear regression models and generalized linear mixed models, sets of optimal predictor variables were selected that effectively predicted performance on a test battery that assessed different aspects of music listening. These analyses established the importance of distinguishing between the accuracy of music perception and the appraisal of musical stimuli when using music listening as an index of implant success. Importantly, neither device type nor processing strategy predicted music perception or music appraisal. Speech recognition performance was not a strong predictor of music perception, and primarily predicted music perception when the test stimuli included lyrics. Additionally, limitations in the utility of speech perception in predicting musical perception and appraisal underscore the utility of music perception as an alternative outcome measure for evaluating implant outcomes. Music listening background, residual hearing (i.e., hearing aid use), cognitive factors, and some demographic factors predicted several indices of perceptual accuracy or appraisal of music.
TL;DR: These findings refine the knowledge of the circuitry underlying fear memory but they also provide novel insights into the neural substrates that govern the degree to which acquired fear of a tone generalizes to other tones.
Abstract: Although the lateral nucleus of the amygdala (LA) is essential for conditioned auditory fear memory, an emerging theme is that plasticity in multiple brain regions contributes to fear memory formation. The LA receives direct projections from the auditory thalamus, specifically the medial division of the medial geniculate nucleus (MGm) and adjacent posterior intralaminar nucleus (PIN). While traditionally viewed as a simple relay structure, mounting evidence implicates the thalamus in diverse cognitive processes. We investigated the role of plasticity in the MGm/PIN in auditory fear memory. First we found that auditory fear conditioning (but not control manipulations) increased the levels of activated CREB in both the MGm and PIN. Next, using viral vectors, we showed that exogenously increasing CREB in this region specifically enhanced formation of an auditory conditioned fear memory without affecting expression of an auditory fear memory, formation of a contextual fear memory, or basic auditory processing. Interestingly, mice with increased CREB levels in the MGm/PIN also showed broad auditory fear generalization (in contrast to control mice, they exhibited fear responses to tones of other frequencies). Together, these results implicate CREB-mediated plasticity in the MGm/PIN in both the formation and generalization of conditioned auditory fear memory. Not only do these findings refine our knowledge of the circuitry underlying fear memory but they also provide novel insights into the neural substrates that govern the degree to which acquired fear of a tone generalizes to other tones.
TL;DR: These findings correlated with parameter estimates obtained from a quantitative model of time estimation, and provide evidence of a fundamental deficit in temporal auditory precision in schizophrenia.
TL;DR: The anatomical and functional bases of somatosensory influences on auditory processing in the normal brainstem and midbrain are outlined and how interactions between the auditory and somatoensory system are modified through deafness is explored.
Abstract: Purpose This review outlines the anatomical and functional bases of somatosensory influences on auditory processing in the normal brainstem and midbrain. It then explores how interactions between t...
TL;DR: The findings suggest that auditory comprehension deficits are predominantly semantic in nature, even following acute left hemisphere disruption, which supports the hypothesis that the right hemisphere is capable of speech sound processing in the intact brain.
TL;DR: A biologically plausible model of how the auditory system is able to parse complex acoustic scenes into meaningful auditory objects and streams under adverse conditions is presented, where the role of cortical mechanisms in organizing complex auditory scenes is explored.
Abstract: Sound systems and speech technologies can benefit greatly from a deeper understanding of how the auditory system, and particularly the auditory cortex, is able to parse complex acoustic scenes into meaningful auditory objects and streams under adverse conditions. In the current work, a biologically plausible model of this process is presented, where the role of cortical mechanisms in organizing complex auditory scenes is explored. The model consists of two stages: (i) a feature analysis stage that maps the acoustic input into a multidimensional cortical representation and (ii) an integrative stage that recursively builds up expectations of how streams evolve over time and reconciles its predictions with the incoming sensory input by sorting it into different clusters. This approach yields a robust computational scheme for speaker separation under conditions of speech or music interference. The model can also emulate the archetypal streaming percepts of tonal stimuli that have long been tested in human subjects. The implications of this model are discussed with respect to the physiological correlates of streaming in the cortex as well as the role of attention and other top-down influences in guiding sound organization.
TL;DR: This article examined temporal processing in relation to Chinese reading acquisition and impairment and found that visual but not auditory temporal processing contributed unique variance to Chinese character recognition even with other cognitive measures controlled, suggesting it may be as important a correlate of reading ability in Chinese as in alphabetic scripts.
Abstract: This study examined temporal processing in relation to Chinese reading acquisition and impairment. The performances of 26 Chinese primary school children with developmental dyslexia on tasks of visual and auditory temporal order judgement, rapid naming, visual-orthographic knowledge, morphological, and phonological awareness were compared with those of 26 reading level ability controls (RL) and 26 chronological age controls (CA). Dyslexic children performed worse than the CA group but similar to the RL group on measures of accurate processing of auditory and visual-order stimuli, rapid naming, morphological awareness, and phonological awareness and a minority performed worse on the two temporal processing tasks. However, hierarchical regression analyses revealed that visual but not auditory temporal processing contributed unique variance to Chinese character recognition even with other cognitive measures controlled, suggesting it may be as important a correlate of reading ability in Chinese as in alphabetic scripts.
TL;DR: The present study showed that attenuation of sensation occurred both when participants themselves performed a goal-directed action and when they observed experimenter performing the same action, although they clearly reported that the tones were produced by other during action observation and by themselves during their own action.
TL;DR: The hypothesis that auditory hallucinations are internally generated speech perceptions that are lateralized to the left temporal lobe, in the peri-Sylvian region is advanced, which predicts that hallucinating patients should have problems identifying a simultaneously presented external speech sound, as measured through performance on the dichotic listening (DL) paradigm with consonant–vowel syllables.
Abstract: In this article we review research in our laboratory on auditory hallucinations using behavioral and MRI measure. The review consists of both previously published and new data that for the fi rst time is presented together in a cohesive way. Auditory hallucinations are among the most common symptoms in schizophrenia, affecting more than 70% of the patients. We here advance the hypothesis that auditory hallucinations are internally generated speech perceptions that are lateralized to the left temporal lobe, in the peri-Sylvian region. From this we predict that hallucinating patients should have problems identifying a simultaneously presented external speech sound, as measured through performance on the dichotic listening (DL) paradigm with consonant-vowel syllables, since this technique lateralizes the stimulus input. Across a series of behavioral experiments, we have shown that patients with schizophrenia who experience frequent auditory hallucinations fail to demonstrate an expected right ear advantage on the dichotic listening test. Absence of a right ear advantage is indicative of a functional defi cit in the left peri-Sylvian region. The results also revealed that patients with ongoing auditory hallucinations were more impaired than patients with previous hallucinations, and that a higher score on the hallucination item in a standard symptom rating scale (BPRS) correlated negatively with number of correct reports for the right ear stimulus. Moreover, we have found that schizophrenia patients fail to shift attention to the left ear stimulus, when explicitly instructed to focus on the right or left ear stimulus only, thus showing a defi cit in inhibition of attention and response-inhibition. The behavioral DL data are substantiated in two MR morphometry studies that revealed signifi cant reductions in grey matter density in the left peri-Sylvian region in hallucinating patients, and patients with reduced left temporal lobe grey matter density. Hallucinating patients also failed to show a right ear advantage in the dichotic listening test. Ongoing fMRI studies are focused on the underlying synaptic and molecular mechanisms by investigating the effects of the glutamate antagonist drug memantine on auditory perception and speech lateralization, and examination of temporal cortex-specifi c gene expression in the left peri-Sylvian region.
TL;DR: In this article, the authors present the case for object perception in vision and argue that an analog of object perception occurs in auditory perception, and propose a notion of an auditory object that is stronger than just that of an intentional object of audition, but does not identify auditory objects with the ordinary material objects we see.
Abstract: Vision has been the primary focus of naturalistic philosophical research concerning perception and perceptual experience. Guided by visual experience and vision science, many philosophers have focused upon theoretical issues dealing with the perception of objects. Recently, however, hearing researchers have discussed auditory objects. I present the case for object perception in vision, and argue that an analog of object perception occurs in auditory perception. I propose a notion of an auditory object that is stronger than just that of an intentional object of audition, but that does not identify auditory objects with the ordinary material objects we see. 1. Objects in Perception Humans understand the world in terms of objects. We take the environment to be populated by things like forks and bottles and steaks. Whether or not the world contains any such items, medium-sized dry goods are one central component to our conceptual schemes. Objects also feature in how we perceive the world to be. Several initial considerations support this intuitive claim. Birds attract and hold our attention as we track them in flight. So, if we can attend only to what we perceive, we visually perceive material objects. Objects are subjects of empirical beliefs formed on the strength of visual experience. If perceptual experiences constrain the structure and content of thoughts, the experience of objects explains the structure and content of thoughts about objects. Common actions, such as reaching for a spatula or swinging a racket at a ball, target objects. If the details of what we do in such cases depend upon characteristics of what we see, perception targets objects. Thus, attention, perceptual belief, and action hint that experience not merely causes cognition aimed at objects, but that it does so because objects figure among the things we perceive. An object-involving structure for perception helps explain the object-involving structures of attention, belief, and action. This much seems compelling, at least when we focus upon vision. Seeing, however, typically is presented as an exemplar of perceiving. Given the prominence of objects in visual perception, it is tempting to think that all perceiving concerns objects, their features, and their
TL;DR: This review emphasizes recent work on two aspects of auditory processing: auditory spatial processing and the role of the auditory cortex in both nonhuman primates and human subjects and temporal processing.
Abstract: Interest has recently surged in the neural mechanisms of audition, particularly with regard to functional imaging studies in human subjects. This review emphasizes recent work on two aspects of auditory processing. The first explores auditory spatial processing and the role of the auditory cortex in both nonhuman primates and human subjects. The interactions with visual stimuli, the ventriloquism effect, and the ventriloquism aftereffect are also reviewed. The second aspect is temporal processing. Studies investigating temporal integration, forward masking, and gap detection are reviewed, as well as examples from the birdsong system and echolocating bats.
TL;DR: N100 and MMN were found to be less affected by age than N2b and P300, and the higher influence of aging on late cognitive processes than on the perceptual and pre-attentive ones.
TL;DR: The coding of complex sounds in the early auditory system has a 'standard model' based on the known physiology of the cochlea and main brainstem pathways, but between the early and late auditory system, the role of primary auditory cortex (A1) is still debated.
TL;DR: Research bridging the study of general auditory processing and speech perception shows that the latter is constrained and influenced by operating characteristics of the auditory system and that the understanding of the processes involved in speech perception is enhanced by study within a more general framework.
Abstract: The complexities of the acoustic speech signal pose many significant challenges for listeners. Although perceiving speech begins with auditory processing, investigation of speech perception has progressed mostly independently of study of the auditory system. Nevertheless, a growing body of evidence demonstrates that cross-fertilization between the two areas of research can be productive. We briefly describe research bridging the study of general auditory processing and speech perception, showing that the latter is constrained and influenced by operating characteristics of the auditory system and that our understanding of the processes involved in speech perception is enhanced by study within a more general framework. The disconnect between the two areas of research has stunted the development of a truly interdisciplinary science, but there is an opportunity for great strides in understanding with the development of an integrated field of auditory cognitive science.
TL;DR: It is suggested that both altered sensory experience and improvised acoustic cues contribute to the auditory-visual speech perception in cochlear-implant users.
Abstract: The present study evaluated auditory-visual speech perception in cochlear-implant users as well as normal-hearing and simulated-implant controls to delineate relative contributions of sensory experience and cues. Auditory-only, visual-only, or auditory-visual speech perception was examined in the context of categorical perception, in which an animated face mouthing ∕ba∕, ∕da∕, or ∕ga∕ was paired with synthesized phonemes from an 11-token auditory continuum. A three-alternative, forced-choice method was used to yield percent identification scores. Normal-hearing listeners showed sharp phoneme boundaries and strong reliance on the auditory cue, whereas actual and simulated implant listeners showed much weaker categorical perception but stronger dependence on the visual cue. The implant users were able to integrate both congruent and incongruent acoustic and optical cues to derive relatively weak but significant auditory-visual integration. This auditory-visual integration was correlated with the duration of the implant experience but not the duration of deafness. Compared with the actual implant performance, acoustic simulations of the cochlear implant could predict the auditory-only performance but not the auditory-visual integration. These results suggest that both altered sensory experience and improvised acoustic cues contribute to the auditory-visual speech perception in cochlear-implant users.
TL;DR: A multivariate partial least-squares model was built to relate the ratings of unpleasantness with an auditory representation derived from a model of processing in the auditory pathway, and it was observed that the existence region corresponded to spectral frequencies between 2500 and 5500 Hz, and temporal modulations in the range 1-16 Hz.
Abstract: Certain sounds, for example, the squeal of chalk on a blackboard, are typically perceived as highly unpleasant. This study addressed the question of what aspects of the auditory representation of such sounds are associated with judgments of unpleasantness. Participants rated the perceived unpleasantness of a large number of sounds that included “griding” and other unpleasant sounds. A multivariate partial least-squares (PLS) model was then built to relate the ratings of unpleasantness with an auditory representation derived from a model of processing in the auditory pathway. The “existence region” of unpleasantness in the auditory space of frequency-temporal modulation was determined after the PLS model had been validated by predicting the unpleasantness of novel sounds from the auditory representation. It was observed that the existence region corresponded to spectral frequencies between 2500 and 5500 Hz, and temporal modulations in the range 1–16 Hz.