Speaker adaptation in the NIST Speaker Recognition Evaluation 2004

Leeuwen, D.A. van

Speaker adaptation in the NIST Speaker Recognition Evaluation 2004

other

2005

Leeuwen, D.A. van

New in the 2004 edition of the NIST Speaker Recognition Evaluation (SRE) was the condition where unsupervised adaptation of speaker models is allowed. Despite the promising results on development test material, hardly any beneficial results were
obtained in the Evaluation itself. An analysis is made why this was the case, and it appears that a mimimum level of performance is essential to obtain results using adaptation that improve on the performance without adaptation. Further, the system
should be well calibrated. For the conditions with 8 conversation sides we have been able to find improvement using unsupervised adaptation using the NIST 2004 evaluation, both
for an UBM/GMM adaptation methodology, and a novel SVM adaptation methodology. The minimum DCF for a fused system drops from 0.259 for the unadapted condition to 0.231 for the adapted condition.

Topics

Speaker recognition Automatic speech recognition Speech Mathematical models Performance Speech analysis Speech communication Adaptation methodology Speaker Recognition Evaluation (SRE)Test materials Speech recognition

TNO Identifier

15991

Repository link

https://resolver.tno.nl/uuid:393f3642-5323-42b8-8341-ac126ac37dfa

Source title

9th European Conference on Speech Communication and Technology, 4 September 2005 through 8 September 2005, Lisbon Eurospeech

Pages

1981 - 1984

Files

To receive the publication files, please send an e-mail request to TNO Repository.

Speaker adaptation in the NIST Speaker Recognition Evaluation 2004

Make TNO yours!