Title
NIST and NFI-TNO evaluations of automatic speaker recognition
Author
van Leeuwen, D.A.
Martin, A.F.
Przybocki, M.A.
Bouten, J.S.
TNO Defensie en Veiligheid
Contributor
Campell, J.P. (editor)
Mason, J. (editor)
Ortega-Garcia, J. (editor)
Publication year
2006
Abstract
In the past years, several text-independent speaker recognition evaluation campaigns have taken place. This paper reports on results of the NIST evaluation of 2004 and the NFI-TNO forensic speaker recognition evaluation held in 2003, and reflects on the history of the evaluation campaigns. The effects of speech duration, training handsets, transmission type, and gender mix show expected behaviour on the DET curves. New results on the influence of language show an interesting dependence of the DET curves on the accent of speakers. We also report on a number of statistical analysis techniques that have recently been introduced in the speaker recognition community, as well as a new application of the analysis of deviance analysis. These techniques are used to determine that the two evaluations held in 2003, by NIST and NFI-TNO, are of statistically different difficulty to the speaker recognition systems.
Subject
Acoustics and Audiology
Evaluation
Pattern recognition systems
Speech synthesis
Statistical methods
Automatic speaker recognition
Speech duration
Training handsets
Speech recognition
To reference this document use:
http://resolver.tudelft.nl/uuid:38395bad-33a7-49e4-9fbf-55724b72dedc
DOI
https://doi.org/10.1016/j.csl.2005.07.001
TNO identifier
239206
ISSN
0885-2308
Source
Computer Speech and Language, 20 (2-3 SPEC. ISS.), 128-158
Bibliographical note
Odyssey 2004: The Speaker and Language Recognition Workshop Odyssey-04, 31 May 2004 through 3 June 2004, Conference
Document type
article