Boolean:	`(bicycle AND helmet) OR (head AND protection)` (always group AND in parenthesis)
Title:	`title:(climate change)`
Author:	`author:("bohr niels" OR "bohr n")` (avoid only full first name)
Phrase:	`"water pump control"` (does not work with wildcards)
Wildcards:	`wom?n pharm*`

More...

Conference paper

Audio-visual scene analysis in reverberant multi-talker environments

In Proceedings of the 23rd International Congress on Acoustics — 2019, pp. 3890-3896

By Ahrens, Axel^1,2; Lund, Kasper Duemose^1,2; Dau, Torsten^1,2,3

From

Hearing Systems Group, Hearing Systems Section, Department of Health Technology, Technical University of Denmark¹

Department of Health Technology, Technical University of Denmark²

Department of Electrical Engineering, Technical University of Denmark³

Abstract

Normal-hearing subjects are accurate in localizing sound sources even in reverberant multi-talker environments (e.g., Kopčo, 2010; Weller, 2016). Weller et al. (2016) showed that subjects can accurately analyse reverberant multi-talker scenes with up to four simultaneous talkers. While multi-talker scene analysis has mainly been investigated with only auditory information, the addition of visual information might influence the subjects’ perception.

To investigate the visual influence, audio-visual scenes with a varying number of talkers and degrees of reverberation were considered in the present study. The acoustic information was provided using a spherical loudspeaker array and the visual information was provided using head-tracked virtual reality glasses.

The visual information represented various possible talker locations and the subjects were asked to identify the number of talkers and their specific locations. For the identification of talkers, subjects had to label visual locations with headlines from the talker’s speech topic. It was hypothesized that the addition of visual information improves subjects’ ability to analyse complex auditory scenes, while the amount of reverberation impairs the overall performance.

Language:	English
Publisher:	Deutsche Gesellschaft für Akustik e.V.
Year:	2019
Pages:	3890-3896
Proceedings:	23rd International Congress on AcousticsInternational Congress on Acoustics
ISBN:	3939296155 and 9783939296157
Types:	Conference paper
ORCIDs:	Ahrens, Axel and Dau, Torsten

Keywords

Auditory Scene Analysis Speech Perception Virtual Reality

Audio-visual scene analysis in reverberant multi-talker environments

DTU Library

Address

Shortcuts

Log in?

Audio-visual scene analysis in reverberant multi-talker environments

DTU Library

Address

Shortcuts