Automated Speech and Audio Analysis for Semantic Access to Multimedia

Jong, F.M.G. de; Ordelman, R.; Huijbregts, M.

Automated Speech and Audio Analysis for Semantic Access to Multimedia

bookPart

2006

Jong, F.M.G. de

Ordelman, R.

Huijbregts, M.

The deployment and integration of audio processing tools can enhance the semantic annotation of multimedia content, and as a consequence, improve the effectiveness of conceptual access tools. This paper overviews the various ways in which automatic speech and audio analysis can contribute to increased granularity of automatically extracted metadata. A number of techniques will be presented, including the alignment of speech and text resources, large vocabulary speech recognition, key word spotting and speaker classification. The applicability of techniques will be discussed from a media crossing perspective. The added value of the techniques and their potential contribution to the content value chain will be illustrated by the description of two (complementary) demonstrators for browsing broadcast news archives. © Springer-Verlag Berlin Heidelberg 2006.

Topics

Audio processing Automatic speech Content value chains Multimedia contents Speaker classification Digital storage Speech recognition Text processing Tools Semantics

TNO Identifier

483007

Repository link

https://resolver.tno.nl/uuid:9067181f-e941-458a-b2cd-7406f203b860

DOI

https://dx.doi.org/10.1007/11930334_18

ISBN

9783540493358

Publisher

Springer

Source title

Semantic Multimedia, Proceedings First International Conference on Semantics and Digital Media Technologies, SAMT 2006, 6-8 December 2006, Athens, Greece

Editor(s)

Avrithis, Y.

Place of publication

Berlin : [etc]

Pages

226-240

Files

To receive the publication files, please send an e-mail request to TNO Repository.

Automated Speech and Audio Analysis for Semantic Access to Multimedia

Make TNO yours!