Efficient auditory coding

doi:10.1038/NATURE04485

Journal Article10.1038/NATURE04485

Efficient auditory coding

James L. McClelland, +2 more

- 01 Jan 2006

- Nature

- Vol. 439, Iss: 7079, pp 978-982

706

TL;DR: It is shown that, for natural sounds, the complete acoustic waveform can be represented efficiently with a nonlinear model based on a population spike code, which shows striking similarities to time-domain cochlear filter estimates, have a frequency-bandwidth dependence similar to that of auditory nerve fibres, and yield significantly greater coding efficiency than conventional signal representations.

Abstract: Efficient coding theory posits that sensory systems are under strong evolutionary and developmental pressures to utilize highly efficient codes (Barlow, 1961; Atick, 1992; Simoncelli and Olshausen, 2001; Laughlin and Sejnowski, 2003). Using information theory, the basis of modern telecommunications, we have found that mammalian hearing follows this efficient coding principle. Neurons in the inner ear and the "spikes" with which they communicate form an efficient code for natural sounds in the environment (Smith and Lewicki, 2004a, 2005a, 2006). This shows for the first time that the theoretical principle of efficient coding can account for the detailed form of the auditory code, a significant milestone in developing a theoretical understanding of sensory coding. Additionally, the results of applying the same technique to speech coding suggest that the acoustics of speech are optimally adapted to this mammalian auditory code (Smith and Lewicki, 2004b, 2005b). Beyond these scientific issues, we show that a "spike"-like code may also lead to improvements to applications such as digital audio compression and telecommunications. In addition to our theoretical research, we sought to demonstrate efficient coding in human perception behaviorally. In a pair of experiments, we applied efficient coding theory to the problem of speech perception in individuals using cochlear implants (CI), for which there exist vast individual differences in spectral resolution and speech perception (Zeng et al., 2004b). We present a machine-learning method for CI filterbank design based on the efficient-coding hypothesis. Further, we describe a pair of experiments which evaluate this approach using noise-excited vocoder speech (Shannon et al., 1995). Participants' recognition of continuous speech and isolated syllables is significantly more accurate for speech filtered through the theoretically-motivated efficient-coding filterbank relative to the standard cochleotopic filterbank, particularly for speech transients. These findings offer insight in CI design and provide behavioral evidence for efficient coding in human perception.

Chat with Paper

AI Agents for this Paper

Find similar papers on Google Scholar, PubMed and Arxiv
Write a critical review of this paper
Analyze citations of this paper to find unaddressed research gaps

Citations

•Book

Bayesian Reasoning and Machine Learning

David Barber

- 12 Mar 2012

TL;DR: Comprehensive and coherent, this hands-on text develops everything from basic reasoning to advanced techniques within the framework of graphical models, and develops analytical and problem-solving skills that equip them for the real world.

...read moreread less

1.8K

•Proceedings Article

Unsupervised feature learning for audio classification using convolutional deep belief networks

Honglak Lee, +3 more

- 07 Dec 2009

TL;DR: In this paper, the authors apply convolutional deep belief networks to audio data and empirically evaluate them on various audio classification tasks and show that the learned features correspond to phones/phonemes.

...read moreread less

1.2K

Journal Article•10.1109/TIE.2016.2519325

An Intelligent Fault Diagnosis Method Using Unsupervised Feature Learning Towards Mechanical Big Data

Yaguo Lei, +4 more

- 19 Jan 2016

- IEEE Transactions on Industrial Electron...

TL;DR: A two-stage learning method inspired by the idea of unsupervised feature learning that uses artificial intelligence techniques to learn features from raw data for intelligent diagnosis of machines that reduces the need of human labor and makes intelligent fault diagnosis handle big data more easily.

...read moreread less

1.1K

•Book

What Is Stochastic Resonance? Definitions, Misconceptions, Debates, and Its Relevance to Biology

Mark D. McDonnell, +1 more

- 01 Jan 2009

TL;DR: This work challenges neuroscientists and biologists to embrace a very broad definition of stochastic resonance in terms of signal-processing “noise benefits”, and to devise experiments aimed at verifying that random variability can play a functional role in the brain, nervous system, or other areas of biology.

...read moreread less

817

•Journal Article•10.1371/JOURNAL.PBIO.0060016

Sparse Representation of Sounds in the Unanesthetized Auditory Cortex

Tomáš Hromádka, +2 more

- 29 Jan 2008

- PLOS Biology

TL;DR: The results represent the first quantitative evidence for sparse representations of sounds in the unanesthetized auditory cortex, and are compatible with a model in which most neurons are silent much of the time, and in which representations are composed of small dynamic subsets of highly active neurons.

...read moreread less

686

...

Expand

References

Journal Article•10.1109/78.258082

Matching pursuits with time-frequency dictionaries

Stéphane Mallat, +1 more

- 01 Aug 1993

- IEEE Transactions on Signal Processing

TL;DR: The authors introduce an algorithm, called matching pursuit, that decomposes any signal into a linear expansion of waveforms that are selected from a redundant dictionary of functions, chosen in order to best match the signal structures.

...read moreread less

10.2K

Advances in neural information processing systems 11

Peter Sollich

- 01 Jan 1999

TL;DR: It is shown that sorting single-trial ERP epochs in order of reaction time and plotting the potentials in 2-D clearly reveals underlying patterns of response variability linked to performance, and a new visualization tool, the 'ERP image', is proposed for investigating variability in latencies and amplitudes of event-evoked responses in spontaneous EEG or MEG records.

...read moreread less

3K

Journal Article•10.1146/ANNUREV.NEURO.24.1.1193

Natural image statistics and neural representation

Eero P. Simoncelli, +1 more

- 01 Jan 2001

- Annual Review of Neuroscience

TL;DR: It has long been assumed that sensory neurons are adapted to the statistical properties of the signals to which they are exposed, but recent developments in statistical modeling have enabled researchers to study more sophisticated statistical models for visual images, to validate these models empirically against large sets of data, and to begin experimentally testing the efficient coding hypothesis.

...read moreread less

2.6K

Journal Article•10.1007/BF02678430

Adaptive greedy approximations

Geoffrey M. Davis, +2 more

- 01 Mar 1997

- Constructive Approximation

TL;DR: A notion of the coherence of a signal with respect to a dictionary is derived from the characterization of the approximation errors of a pursuit from their statistical properties, which can be obtained from the invariant measure of the pursuit.

...read moreread less

1.3K

•Journal Article•10.1126/SCIENCE.1089662

Communication in Neuronal Networks

Simon B. Laughlin, +2 more

- 26 Sep 2003

- Science

TL;DR: The authors are beginning to understand some of the geometric, biophysical, and energy constraints that have governed the evolution of cortical networks and how the brain exploits the adaptability of biological systems to reconfigure in response to changing needs.

...read moreread less

1K

...

Expand

Efficient auditory coding

Chat with Paper

AI Agents for this Paper

Citations

Bayesian Reasoning and Machine Learning

Unsupervised feature learning for audio classification using convolutional deep belief networks

An Intelligent Fault Diagnosis Method Using Unsupervised Feature Learning Towards Mechanical Big Data

What Is Stochastic Resonance? Definitions, Misconceptions, Debates, and Its Relevance to Biology

Sparse Representation of Sounds in the Unanesthetized Auditory Cortex

References

Matching pursuits with time-frequency dictionaries

Advances in neural information processing systems 11

Natural image statistics and neural representation

Adaptive greedy approximations

Communication in Neuronal Networks

Related Papers (5)

Emergence of simple-cell receptive field properties by learning a sparse code for natural images

Possible Principles Underlying the Transformations of Sensory Messages

Natural image statistics and neural representation

Sparse Coding with an Overcomplete Basis Set: A Strategy Employed by V1 ?

Some informational aspects of visual perception.