Dataset Information

Speech synthesis from ECoG using densely connected 3D convolutional neural networks.

ABSTRACT: OBJECTIVE:Direct synthesis of speech from neural signals could provide a fast and natural way of communication to people with neurological diseases. Invasively-measured brain activity (electrocorticography; ECoG) supplies the necessary temporal and spatial resolution to decode fast and complex processes such as speech production. A number of impressive advances in speech decoding using neural signals have been achieved in recent years, but the complex dynamics are still not fully understood. However, it is unlikely that simple linear models can capture the relation between neural activity and continuous spoken speech. APPROACH:Here we show that deep neural networks can be used to map ECoG from speech production areas onto an intermediate representation of speech (logMel spectrogram). The proposed method uses a densely connected convolutional neural network topology which is well-suited to work with the small amount of data available from each participant. MAIN RESULTS:In a study with six participants, we achieved correlations up to r??=??0.69 between the reconstructed and original logMel spectrograms. We transfered our prediction back into an audible waveform by applying a Wavenet vocoder. The vocoder was conditioned on logMel features that harnessed a much larger, pre-existing data corpus to provide the most natural acoustic output. SIGNIFICANCE:To the best of our knowledge, this is the first time that high-quality speech has been reconstructed from neural recordings during speech production using deep neural networks.

SUBMITTER: Angrick M

PROVIDER: S-EPMC6822609 | biostudies-literature | 2019 Jun

REPOSITORIES: biostudies-literature

ACCESS DATA

Publications

Speech synthesis from ECoG using densely connected 3D convolutional neural networks.

Angrick Miguel M Herff Christian C Mugler Emily E Tate Matthew C MC Slutzky Marc W MW Krusienski Dean J DJ Schultz Tanja T

Journal of neural engineering 20190304 3

<h4>Objective</h4>Direct synthesis of speech from neural signals could provide a fast and natural way of communication to people with neurological diseases. Invasively-measured brain activity (electrocorticography; ECoG) supplies the necessary temporal and spatial resolution to decode fast and complex processes such as speech production. A number of impressive advances in speech decoding using neural signals have been achieved in recent years, but the complex dynamics are still not fully underst ...[more]

PMID: 30831567

Dataset Information

Speech synthesis from ECoG using densely connected 3D convolutional neural networks.

Publications

Speech synthesis from ECoG using densely connected 3D convolutional neural networks.

Similar Datasets

OmicsDI is part of the ELIXIR infrastructure

Tweets

Similar Datasets

A Multi-Scale Densely Connected Convolutional Neural Network for Automated Thyroid Nodule Classification.
| S-EPMC9160335 | biostudies-literature

Fully automated condyle segmentation using 3D convolutional neural networks.
| S-EPMC9709043 | biostudies-literature

A novel 1-D densely connected feature selection convolutional neural network for heart sounds classification.
| S-EPMC8756246 | biostudies-literature

Understanding structure-guided variant effect predictions using 3D convolutional neural networks.
| S-EPMC10354367 | biostudies-literature

High precision protein functional site detection using 3D convolutional neural networks.
| S-EPMC6499237 | biostudies-literature

Predicting the target landscape of kinase inhibitors using 3D convolutional neural networks.
| S-EPMC10508635 | biostudies-literature

Single-Shot 3D Shape Reconstruction Using Structured Light and Deep Convolutional Neural Networks.
| S-EPMC7374384 | biostudies-literature

RNA3DCNN: Local and global quality assessments of RNA 3D structures using 3D deep convolutional neural networks.
| S-EPMC6258470 | biostudies-literature

Zebrafish tracking using convolutional neural networks.
| S-EPMC5314376 | biostudies-literature

HiC-GNN: A generalizable model for 3D chromosome reconstruction using graph convolutional neural networks.
| S-EPMC9842867 | biostudies-literature