Dataset Information

Hierarchical spike coding of sound.

ABSTRACT: Natural sounds exhibit complex statistical regularities at multiple scales. Acoustic events underlying speech, for example, are characterized by precise temporal and frequency relationships, but they can also vary substantially according to the pitch, duration, and other high-level properties of speech production. Learning this structure from data while capturing the inherent variability is an important first step in building auditory processing systems, as well as understanding the mechanisms of auditory perception. Here we develop Hierarchical Spike Coding, a two-layer probabilistic generative model for complex acoustic structure. The first layer consists of a sparse spiking representation that encodes the sound using kernels positioned precisely in time and frequency. Patterns in the positions of first layer spikes are learned from the data: on a coarse scale, statistical regularities are encoded by a second-layer spiking representation, while fine-scale structure is captured by recurrent interactions within the first layer. When fit to speech data, the second layer acoustic features include harmonic stacks, sweeps, frequency modulations, and precise temporal onsets, which can be composed to represent complex acoustic events. Unlike spectrogram-based methods, the model gives a probability distribution over sound pressure waveforms. This allows us to use the second-layer representation to synthesize sounds directly, and to perform model-based denoising, on which we demonstrate a significant improvement over standard methods.

SUBMITTER: Karklin Y

PROVIDER: S-EPMC4209850 | biostudies-literature | 2012

REPOSITORIES: biostudies-literature

ACCESS DATA

Publications

Hierarchical spike coding of sound.

Karklin Yan Y Ekanadham Chaitanya C Simoncelli Eero P EP

Advances in neural information processing systems 20120101

Natural sounds exhibit complex statistical regularities at multiple scales. Acoustic events underlying speech, for example, are characterized by precise temporal and frequency relationships, but they can also vary substantially according to the pitch, duration, and other high-level properties of speech production. Learning this structure from data while capturing the inherent variability is an important first step in building auditory processing systems, as well as understanding the mechanisms o ...[more]

PMID: 25356065

Dataset Information

Hierarchical spike coding of sound.

Publications

Hierarchical spike coding of sound.

Similar Datasets

OmicsDI is part of the ELIXIR infrastructure

Tweets

Similar Datasets

Spike-timing-based computation in sound localization.
| S-EPMC2978676 | biostudies-literature

Population rate-coding predicts correctly that human sound localization depends on sound intensity.
| S-EPMC6802950 | biostudies-literature

Spike-rate coding and spike-time coding are affected oppositely by different adaptation mechanisms.
| S-EPMC2819463 | biostudies-literature

Sound of Daily Living Identification Based on Hierarchical Situation Audition.
| S-EPMC10098573 | biostudies-literature

Opponent Coding of Sound Location (Azimuth) in Planum Temporale is Robust to Sound-Level Variations.
| S-EPMC4677988 | biostudies-literature

Hierarchical Inference in Sound Change: Words, Sounds, and Frequency of Use.
| S-EPMC8387583 | biostudies-literature

Disruption of hierarchical predictive coding during sleep.
| S-EPMC4371991 | biostudies-literature

Non-isomorphism in efficient coding of complex sound properties.
| S-EPMC3210183 | biostudies-other

Exploring the sound-modulated delay in tomato ripening through expression analysis of coding and non-coding RNAs.
| S-EPMC6324751 | biostudies-literature

Neural coding of prior expectations in hierarchical intention inference.
| S-EPMC5430911 | biostudies-literature