About

Log in?

DTU users get better search results including licensed content and discounts on order fees.

Anyone can log in and get personalized features such as favorites, tags and feeds.

Log in as DTU user Log in as non-DTU user No thanks

DTU Findit

Conference paper

Castsearch - Context Based Spoken Document Retrieval

From

Cognitive Systems, Department of Informatics and Mathematical Modeling, Technical University of Denmark1

Department of Informatics and Mathematical Modeling, Technical University of Denmark2

The paper describes our work on the development of a system for retrieval of relevant stories from broadcast news. The system utilizes a combination of audio processing and text mining. The audio processing consists of a segmentation step that partitions the audio into speech and music. The speech is further segmented into speaker segments and then transcribed using an automatic speech recognition system, to yield text input for clustering using non-negative matrix factorization (NMF).

We find semantic topics that are used to evaluate the performance for topic detection. Based on these topics we show that a novel query expansion can be performed to return more intelligent search results. We also show that the query expansion helps overcome errors of the automatic transcription

Language: English
Publisher: IEEE
Year: 2007
Pages: IV-93-IV-96
Proceedings: 2007 IEEE International Conference on Acoustics, Speech and Signal Processing
ISBN: 1424407273 , 9781424407279 , 1424407281 and 9781424407286
ISSN: 2379190x and 15206149
Types: Conference paper
DOI: 10.1109/ICASSP.2007.367171
ORCIDs: Mølgaard, Lasse Lohilahti and Hansen, Lars Kai

DTU users get better search results including licensed content and discounts on order fees.

Log in as DTU user

Access

Analysis