Journal Article10.1038/NATURE04485
Efficient auditory coding
706
TL;DR: It is shown that, for natural sounds, the complete acoustic waveform can be represented efficiently with a nonlinear model based on a population spike code, which shows striking similarities to time-domain cochlear filter estimates, have a frequency-bandwidth dependence similar to that of auditory nerve fibres, and yield significantly greater coding efficiency than conventional signal representations.
read more
Abstract: Efficient coding theory posits that sensory systems are under strong evolutionary and developmental pressures to utilize highly efficient codes (Barlow, 1961; Atick, 1992; Simoncelli and Olshausen, 2001; Laughlin and Sejnowski, 2003). Using information theory, the basis of modern telecommunications, we have found that mammalian hearing follows this efficient coding principle. Neurons in the inner ear and the "spikes" with which they communicate form an efficient code for natural sounds in the environment (Smith and Lewicki, 2004a, 2005a, 2006). This shows for the first time that the theoretical principle of efficient coding can account for the detailed form of the auditory code, a significant milestone in developing a theoretical understanding of sensory coding. Additionally, the results of applying the same technique to speech coding suggest that the acoustics of speech are optimally adapted to this mammalian auditory code (Smith and Lewicki, 2004b, 2005b). Beyond these scientific issues, we show that a "spike"-like code may also lead to improvements to applications such as digital audio compression and telecommunications.
In addition to our theoretical research, we sought to demonstrate efficient coding in human perception behaviorally. In a pair of experiments, we applied efficient coding theory to the problem of speech perception in individuals using cochlear implants (CI), for which there exist vast individual differences in spectral resolution and speech perception (Zeng et al., 2004b). We present a machine-learning method for CI filterbank design based on the efficient-coding hypothesis. Further, we describe a pair of experiments which evaluate this approach using noise-excited vocoder speech (Shannon et al., 1995). Participants' recognition of continuous speech and isolated syllables is significantly more accurate for speech filtered through the theoretically-motivated efficient-coding filterbank relative to the standard cochleotopic filterbank, particularly for speech transients. These findings offer insight in CI design and provide behavioral evidence for efficient coding in human perception.
read more
Chat with Paper
AI Agents for this Paper
Find similar papers on Google Scholar, PubMed and Arxiv
Write a critical review of this paper
Analyze citations of this paper to find unaddressed research gaps
Citations
•Book
Bayesian Reasoning and Machine Learning
David Barber
- 12 Mar 2012
TL;DR: Comprehensive and coherent, this hands-on text develops everything from basic reasoning to advanced techniques within the framework of graphical models, and develops analytical and problem-solving skills that equip them for the real world.
•Proceedings Article
Unsupervised feature learning for audio classification using convolutional deep belief networks
Honglak Lee,Peter T. Pham,Yan Largman,Andrew Y. Ng +3 more
- 07 Dec 2009
TL;DR: In this paper, the authors apply convolutional deep belief networks to audio data and empirically evaluate them on various audio classification tasks and show that the learned features correspond to phones/phonemes.
An Intelligent Fault Diagnosis Method Using Unsupervised Feature Learning Towards Mechanical Big Data
TL;DR: A two-stage learning method inspired by the idea of unsupervised feature learning that uses artificial intelligence techniques to learn features from raw data for intelligent diagnosis of machines that reduces the need of human labor and makes intelligent fault diagnosis handle big data more easily.
1.1K
•Book
What Is Stochastic Resonance? Definitions, Misconceptions, Debates, and Its Relevance to Biology
Mark D. McDonnell,Derek Abbott +1 more
- 01 Jan 2009
TL;DR: This work challenges neuroscientists and biologists to embrace a very broad definition of stochastic resonance in terms of signal-processing “noise benefits”, and to devise experiments aimed at verifying that random variability can play a functional role in the brain, nervous system, or other areas of biology.
Sparse Representation of Sounds in the Unanesthetized Auditory Cortex
TL;DR: The results represent the first quantitative evidence for sparse representations of sounds in the unanesthetized auditory cortex, and are compatible with a model in which most neurons are silent much of the time, and in which representations are composed of small dynamic subsets of highly active neurons.
References
Matching pursuits with time-frequency dictionaries
Stéphane Mallat,Zhifeng Zhang +1 more
TL;DR: The authors introduce an algorithm, called matching pursuit, that decomposes any signal into a linear expansion of waveforms that are selected from a redundant dictionary of functions, chosen in order to best match the signal structures.
10.2K
Advances in neural information processing systems 11
Peter Sollich
- 01 Jan 1999
TL;DR: It is shown that sorting single-trial ERP epochs in order of reaction time and plotting the potentials in 2-D clearly reveals underlying patterns of response variability linked to performance, and a new visualization tool, the 'ERP image', is proposed for investigating variability in latencies and amplitudes of event-evoked responses in spontaneous EEG or MEG records.
3K
Natural image statistics and neural representation
TL;DR: It has long been assumed that sensory neurons are adapted to the statistical properties of the signals to which they are exposed, but recent developments in statistical modeling have enabled researchers to study more sophisticated statistical models for visual images, to validate these models empirically against large sets of data, and to begin experimentally testing the efficient coding hypothesis.
Adaptive greedy approximations
TL;DR: A notion of the coherence of a signal with respect to a dictionary is derived from the characterization of the approximation errors of a pursuit from their statistical properties, which can be obtained from the invariant measure of the pursuit.
1.3K
Communication in Neuronal Networks
TL;DR: The authors are beginning to understand some of the geometric, biophysical, and energy constraints that have governed the evolution of cortical networks and how the brain exploits the adaptability of biological systems to reconfigure in response to changing needs.