Automated Speech and Audio Analysis for Semantic Access to Multimedia

bookPart
The deployment and integration of audio processing tools can enhance the semantic annotation of multimedia content, and as a consequence, improve the effectiveness of conceptual access tools. This paper overviews the various ways in which automatic speech and audio analysis can contribute to increased granularity of automatically extracted metadata. A number of techniques will be presented, including the alignment of speech and text resources, large vocabulary speech recognition, key word spotting and speaker classification. The applicability of techniques will be discussed from a media crossing perspective. The added value of the techniques and their potential contribution to the content value chain will be illustrated by the description of two (complementary) demonstrators for browsing broadcast news archives. © Springer-Verlag Berlin Heidelberg 2006.
TNO Identifier
483007
ISBN
9783540493358
Publisher
Springer
Source title
Semantic Multimedia, Proceedings First International Conference on Semantics and Digital Media Technologies, SAMT 2006, 6-8 December 2006, Athens, Greece
Editor(s)
Avrithis, Y.
Place of publication
Berlin : [etc]
Pages
226-240
Files
To receive the publication files, please send an e-mail request to TNO Repository.