About

Log in?

DTU users get better search results including licensed content and discounts on order fees.

Anyone can log in and get personalized features such as favorites, tags and feeds.

Log in as DTU user Log in as non-DTU user No thanks

DTU Findit

Conference paper ยท Journal article

Semi-supervised source localization in reverberant environments using deep generative modeling

From

University of California at San Diego1

Bar-Ilan University2

Department of Electrical Engineering, Technical University of Denmark3

Acoustic Technology, Department of Electrical Engineering, Technical University of Denmark4

We present a method for acoustic source localization in reverberant environments based on semi-supervised machine learning (ML) with deep generative models. Source localization in the presence of reverberation remains a major challenge, which recent ML techniques have shown promise in addressing. Despite often large data volumes, the number of labels available for supervised learning in reverberant environments is usually small.

In semi-supervised learning, ML systems are trained using many examples with only few labels, with the goal of exploiting the natural structure of the data. We use variational autoencoders (VAEs), which are generative neural networks (NNs) that rely on explicit probabilistic representations, to model the latent distribution of reverberant acoustic data.

VAEs consist of an encoder NN, which maps complex input distributions to simpler parametric distributions (e.g., Gaussian), and a decoder NN which approximates the training examples. The VAE is trained to generate the phase of relative transfer functions (RTFs) between two microphones in reverberant environments, in parallel with a DOA classifier, on both labeled and unlabeled RTF samples.

The performance this VAE-based approach is compared with conventional and ML-based localization in simulated and real-world scenarios.

Language: English
Publisher: Acoustical Society of America
Year: 2020
Pages: 2662-2662
Proceedings: 179th Meeting of the Acoustical Society of America
ISSN: 00014966 , 01630962 and 15208524
Types: Conference paper and Journal article
DOI: 10.1121/1.5147419
ORCIDs: Fernandez Grande, Efren
Keywords

Abstracts

DTU users get better search results including licensed content and discounts on order fees.

Log in as DTU user

Access

Analysis