Searched for: subject%3A%22Speech%22
(61 - 80 of 140)

Pages

document
Matějka, P. (author), Burget, L. (author), Schwarz, P. (author), Glembek, O. (author), Karafiát, M. (author), Grézl, F. (author), Černocký, J. (author), van Leeuwen, D.A. (author), Brümmer, N. (author), Strasheim, A. (author), TNO Defensie en Veiligheid (author)
This paper describes STBU 2006 speaker recognition system, which performed well in the NIST 2006 speaker recognition evaluation. STBU is consortium of 4 partners: Spescom Data Voice (South Africa), TNO (Netherlands), BUT (Czech Republic) and University of Stellenbosch (South Africa). The primary system is a combination of three main kinds of...
conference paper 2007
document
Beerends, J.G. (author), Busz, B. (author), Oudshoorn, P. (author), van Vugt, J. (author), Ahmed, K. (author), Niamut, O. (author), TNO Informatie- en Communicatietechnologie (author)
The authors discuss the way we perceive the quality of a speech signal and how different degradations contribute to the overall perceived speech (listening) quality. More specifically, ITU-T Recommendation P.862 (perceptual evaluation of speech quality-PESQ), which provides a perceptual modeling approach with which the subjectively perceived...
article 2007
document
de Korte, E.M. (author), van Lingen, P. (author), TNO Kwaliteit van Leven (author)
A comparative, experimental study with repeated measures has been conducted to evaluate the effect of the use of speech recognition on working postures, productivity and the perception of user friendliness. Fifteen subjects performed a standardised task, first with keyboard and mouse and, after a six week training period, with speech recognition...
article 2006
document
de Jong, F.M.G. (author), Ordelman, R. (author), Huijbregts, M. (author), TNO Informatie- en Communicatietechnologie (author)
The deployment and integration of audio processing tools can enhance the semantic annotation of multimedia content, and as a consequence, improve the effectiveness of conceptual access tools. This paper overviews the various ways in which automatic speech and audio analysis can contribute to increased granularity of automatically extracted...
bookPart 2006
document
Kim, D.S. (author), Beerends, J.G. (author), Ghitza, O. (author), Kroon, P. (author), Rix, A. (author), TNO Informatie- en Communicatietechnologie (author)
article 2006
document
van Leeuwen, D.A. (author), Huijbregts, Marijn (author), TNO Defensie en Veiligheid (author)
We describe the systems submitted to the NIST RT06s evaluation for the Speech Activity Detection (SAD) and Speaker Diarization (SPKR) tasks. For speech activity detection, a new analysis methodology is presented that generalizes the Detection Erorr Tradeoff analysis commonly used in speaker detection tasks. The speaker diarization systems are...
conference paper 2006
document
van Leeuwen, D.A. (author), Brümmer, Niko (author), TNO Defensie en Veiligheid (author)
This paper describes two new approaches to spoken language recognition. These were both successfully applied in the NIST 2005 Language Recognition Evaluation. The first approach extends the Gaussian Mixture Model technique with channel dependency, which results in actual detection costs (CDET) of 0.095 in NIST LRE-2005, and which should be...
conference paper 2006
document
Rix, A.W. (author), Beerends, J.G. (author), Kim, D.-S. (author), Kroon, P. (author), Ghitza, O. (author), TNO Informatie- en Communicatietechnologie (author)
In the past few years, objective quality assessment models have become increasingly used for assessing or monitoring speech and audio quality. By measuring perceived quality on an easily-understood subjective scale, such as listening quality (excellent, good, fair, poor, bad), these methods provide a quick and repeatable way to estimate customer...
article 2006
document
Kessens, J.M. (author), TNO Defensie en Veiligheid (author)
Dit onderzoek laat zien dat spraakherkenning van spraak met buitenlandse accenten verbetert door het toevoegen van uitspraakvarianten en door het aanpassen van de akoestische modellen met spraakopnamens van deze buitenlandse accenten
conference paper 2006
document
Brümmer, Niko (author), van Leeuwen, D.A. (author), TNO Defensie en Veiligheid (author)
Recent publications have examined the topic of calibration of confidence scores in the field of (binary-hypothesis) speaker detection. We extend this topic to the case of multiple-hypothesis language recognition. We analyze the structure of multiple-hypothesis recognition problems to show that any such problem subsumes a multitude of derived sub...
conference paper 2006
document
van Leeuwen, D.A. (author), Martin, A.F. (author), Przybocki, M.A. (author), Bouten, J.S. (author), TNO Defensie en Veiligheid (author)
In the past years, several text-independent speaker recognition evaluation campaigns have taken place. This paper reports on results of the NIST evaluation of 2004 and the NFI-TNO forensic speaker recognition evaluation held in 2003, and reflects on the history of the evaluation campaigns. The effects of speech duration, training handsets,...
article 2006
document
van Wijngaarden, S.J. (author), Verhave, J.A. (author), TNO Defensie en Veiligheid (author)
Traffic tunnels are generally hostile acoustic environments, both in terms of reverberation and ambient noise levels. Public address (PA) systems used to convey spoken warnings must meet stringent design requirements in order to produce sufficiently intelligible speech. To be able to predict PA system performance at tunnel design time, two di...
article 2006
document
van Leeuwen, D.A. (author), TNO Defensie en Veiligheid (author)
The TNO speaker speaker diarization system is based on a standard BIC segmentation and clustering algorithm. Since for the NIST Rich Transcription speaker dizarization evaluation measure correct speech detection appears to be essential, we have developed a speech activity detector (SAD) as well. This is based on decoding the speech signal using...
conference paper 2006
document
Möller, S. (author), Krebber, J. (author), Smeele, P. (author), TNO Defensie en Veiligheid (author)
This paper describes four experiments which have been carried out to evaluate the speech output component of the INSPIRE spoken dialogue system, providing speech control for di.erent devices located in a ‘‘smart’’ home environment. The aim is to quantify the impact of different factors on the quality of the system, when addressed either in the...
article 2006
document
TNO Defensie en Veiligheid (author), Boer, L.C. (author), van Balken, J.S. (author), van Wijngaarden, S.J. (author)
De verkeersleider kan weggebruikers in een tunnel aansturen met omroepberichten. De berichten worden van tevoren opgenomen met studiokwaliteit, en vervolgens met één druk op de knop afgespeeld. Automobilisten zullen zo’n bericht eerder begrijpen; en ook neemt de werkbelasting van de verkeersleider af, hetgeen de veiligheid vooral bevordert bij...
report 2006
document
van Leeuwen, D.A. (author), TNO Defensie en Veiligheid (author)
Abstract. The TNO speaker speaker diarization system is based on a standard BIC segmentation and clustering algorithm. Since for the NIST Rich Transcription speaker dizarization evaluation measure correct speech detection appears to be essential, we have developed a speech activity detector (SAD) as well. This is based on decoding the speech...
conference paper 2005
document
Truong, K.P. (author), Neri, A. (author), de Wet, F. (author), Cucchiarini, C. (author), Strik, H. (author), TNO Defensie en Veiligheid (author)
In this paper, we present an acoustic-phonetic approach to automatic pronunciation error detection. Classifiers using techniques such as Linear Discriminant Analysis and Decision Trees were developed for three sounds that are frequently pronounced incorrectly by L2-learners of Dutch: /a/, /y/ and /x/. This paper will focus mainly on the problems...
conference paper 2005
document
Leurdijk, A. (author), Limonard, S. (author), TNO Informatie- en Communicatietechnologie (author)
NM2 (New Media for a New Millennium) develops tools for interactive, personalised and non-linear audio-visual content that will be tested in seven pilot productions. This paper looks at the market potential for these productions from a technological, a business and a users' perspective. It shows that digital broadcast networks and broadband...
conference paper 2005
document
van Leeuwen, D.A. (author), TNO Defensie en Veiligheid (author)
New in the 2004 edition of the NIST Speaker Recognition Evaluation (SRE) was the condition where unsupervised adaptation of speaker models is allowed. Despite the promising results on development test material, hardly any beneficial results were obtained in the Evaluation itself. An analysis is made why this was the case, and it appears that a...
conference paper 2005
document
Truong, K.P. (author), van Leeuwen, D.A. (author), TNO Defensie en Veiligheid (author)
In the context of detecting ‘paralinguistic events’ with the aim to make classification of the speaker’s emotional state possible, a detector was developed for one of the most obvious ‘paralinguistic events’, namely laughter. Gaussian Mixture Models were trained with Perceptual Linear Prediction features, pitch&energy, pitch&voicing and...
conference paper 2005
Searched for: subject%3A%22Speech%22
(61 - 80 of 140)

Pages