Dataset Information

Towards End-to-End Acoustic Localization Using Deep Learning: From Audio Signals to Source Position Coordinates.

ABSTRACT: This paper presents a novel approach for indoor acoustic source localization using microphone arrays, based on a Convolutional Neural Network (CNN). In the proposed solution, the CNN is designed to directly estimate the three-dimensional position of a single acoustic source using the raw audio signal as the input information and avoiding the use of hand-crafted audio features. Given the limited amount of available localization data, we propose, in this paper, a training strategy based on two steps. We first train our network using semi-synthetic data generated from close talk speech recordings. We simulate the time delays and distortion suffered in the signal that propagate from the source to the array of microphones. We then fine tune this network using a small amount of real data. Our experimental results, evaluated on a publicly available dataset recorded in a real room, show that this approach is able to produce networks that significantly improve existing localization methods based on SRP-PHAT strategies and also those presented in very recent proposals based on Convolutional Recurrent Neural Networks (CRNN). In addition, our experiments show that the performance of our CNN method does not show a relevant dependency on the speaker's gender, nor on the size of the signal window being used.

SUBMITTER: Vera-Diaz JM

PROVIDER: S-EPMC6210564 | biostudies-other | 2018 Oct

REPOSITORIES: biostudies-other

ACCESS DATA

Publications

Towards End-to-End Acoustic Localization Using Deep Learning: From Audio Signals to Source Position Coordinates.

Vera-Diaz Juan Manuel JM Pizarro Daniel D Macias-Guarasa Javier J

Sensors (Basel, Switzerland) 20181012 10

This paper presents a novel approach for indoor acoustic source localization using microphone arrays, based on a Convolutional Neural Network (CNN). In the proposed solution, the CNN is designed to directly estimate the three-dimensional position of a single acoustic source using the raw audio signal as the input information and avoiding the use of hand-crafted audio features. Given the limited amount of available localization data, we propose, in this paper, a training strategy based on two ste ...[more]

PMID: 30322007

Dataset Information

Towards End-to-End Acoustic Localization Using Deep Learning: From Audio Signals to Source Position Coordinates.

Publications

Towards End-to-End Acoustic Localization Using Deep Learning: From Audio Signals to Source Position Coordinates.

Similar Datasets

OmicsDI is part of the ELIXIR infrastructure

Tweets

Similar Datasets

Earthquake source characterization by machine learning algorithms applied to acoustic signals.
| S-EPMC8630080 | biostudies-literature

DISCO: A deep learning ensemble for uncertainty-aware segmentation of acoustic signals.
| S-EPMC10370718 | biostudies-literature

Fast and accurate annotation of acoustic signals with deep neural networks.
| S-EPMC8560090 | biostudies-literature

Towards deep learning with segregated dendrites.
| S-EPMC5716677 | biostudies-literature

Susceptibility to audio signals during autonomous driving.
| S-EPMC6089411 | biostudies-literature

Characterization of Deep Learning-Based Speech-Enhancement Techniques in Online Audio Processing Applications.
| S-EPMC10181690 | biostudies-literature

Deep learning velocity signals allow quantifying turbulence intensity.
| S-EPMC7968843 | biostudies-literature

Parallel Chords: an audio-visual analytics design for parallel coordinates.
| S-EPMC11567997 | biostudies-literature

Fully end-to-end deep-learning-based diagnosis of pancreatic tumors.
| S-EPMC7778580 | biostudies-literature

Bat detective-Deep learning tools for bat acoustic signal detection.
| S-EPMC5843167 | biostudies-literature