Speaker adaptation in the NIST Speaker Recognition Evaluation 2004

other
New in the 2004 edition of the NIST Speaker Recognition Evaluation (SRE) was the condition where unsupervised adaptation of speaker models is allowed. Despite the promising results on development test material, hardly any beneficial results were
obtained in the Evaluation itself. An analysis is made why this was the case, and it appears that a mimimum level of performance is essential to obtain results using adaptation that improve on the performance without adaptation. Further, the system
should be well calibrated. For the conditions with 8 conversation sides we have been able to find improvement using unsupervised adaptation using the NIST 2004 evaluation, both
for an UBM/GMM adaptation methodology, and a novel SVM adaptation methodology. The minimum DCF for a fused system drops from 0.259 for the unadapted condition to 0.231 for the adapted condition.
TNO Identifier
15991
Source title
9th European Conference on Speech Communication and Technology, 4 September 2005 through 8 September 2005, Lisbon Eurospeech
Pages
1981 - 1984
Files
To receive the publication files, please send an e-mail request to TNO Repository.