Searched for: subject%3A%22Speech%255C%2Brecognition%22
(1 - 20 of 51)

Pages

document
van Leeuwen, D.A. (author)
This paper investigates the task of linking speakers across multiple recordings, which can be accomplished by speaker clustering. Various aspects are considered, such as computational complexity, on/offline approaches, and evaluation measures but also speaker recognition approaches. It has not been the aim of this study to optimize clustering...
conference paper 2019
document
Heiligers, M.J.C. (author), Huizing, A.G. (author)
conference paper 2018
document
Uguru, J.O. (author), Beerends, J.G. (author), Ebem, D. (author)
This study examines the level of speech recognition of English and Igbo utterances by 70 grade four children. The children, whose mother tongue is Igbo and aged between 8 and 10 years, had English monosyllabic words as well as Igbo monosyllabic and disyllabic words dictated to them in noisy and quiet classrooms. The results show that in noise,...
article 2017
document
Gallardo, L.F. (author), Möller, S. (author), Beerends, J. (author)
The performance of automatic speech recognition based on coded-decoded speech heavily depends on the quality of the transmitted signals, determined by channel impairments. This paper examines relationships between speech recognition performance and measurements of speech quality and intelligibility over transmission channels. Different to...
conference paper 2017
document
Lefter, I. (author), Burghouts, G.J. (author), Rothkrantz, L.J.M. (author)
This paper investigates how speech and gestures convey stress, and how they can be used for automatic stress recognition. As a first step, we look into how humans use speech and gestures to convey stress. In particular, for both speech and gestures, we distinguish between stress conveyed by the intended semantic message (e.g. spoken words for...
article 2016
document
Lefter, I. (author), Rothkrantz, L.J.M. (author), Wiggers, P. (author), van Leeuwen, D.A. (author), TNO Defensie en Veiligheid (author)
We explore possibilities for enhancing the generality, portability and robustness of emotion recognition systems by combining data-bases and by fusion of classifiers. In a first experiment, we investigate the performance of an emotion detection system tested on a certain database given that it is trained on speech from either the same database,...
conference paper 2010
document
Larson, M. (author), Ordelman, R. (author), Metze, F. (author), Kraaij, W. (author), de Jong, F. (author), TNO Informatie- en Communicatietechnologie (author)
When multimedia content has a speech track, a whole array of techniques involving speech recognition and analysis becomes available for indexing and structuring and can provide users with improved access and search. The set of new domains standing to benefit from these techniques encompasses talkshows, lectures, meetings, interviews, debates,...
conference paper 2010
document
Truong, K.P. (author), van Leeuwen, D.A. (author), Neerincx, M.A. (author), de Jong, F.M.G. (author), TNO Defensie en Veiligheid (author)
In this paper, we describe emotion recognition experiments carried out for spontaneous affective speech with the aim to compare the added value of annotation of felt emotion versus annotation of perceived emotion. Using speech material available in the TNO-GAMING corpus (a corpus containing audiovisual recordings of people playing videogames),...
conference paper 2009
document
Huijbregts, M. (author), van Leeuwen, D.A. (author), de Jong, F.M.G. (author), TNO Defensie en Veiligheid (author)
In this paper we present a method for combining multiple diarization systems into one single system by applying a majority voting scheme. The voting scheme selects the best segmentation purely on basis of the output of each system. On our development set of NIST Rich Transcription evaluation meetings the voting method improves our system on all...
conference paper 2009
document
Zekveld, A.A. (author), Kramer, S.E. (author), Kessens, J.M. (author), Vlaming, M.S.M.G. (author), Houtgast, T. (author), TNO Defensie en Veiligheid (author)
This study examined the subjective benefit obtained from automatically generated captions during telephone-speech comprehension in the presence of babble noise. Short stories were presented by telephone either with or without captions that were generated offline by an automatic speech recognition (ASR) system. To simulate online ASR, the word...
article 2009
document
van Leeuwen, D.A. (author), Kessens, J. (author), Sanders, E. (author), van den Heuvel, H. (author), TNO Defensie en Veiligheid (author)
In this paper we report the results of a Dutch speech recognition system evaluation held in 2008. The evaluation contained material in two domains: Broadcast News (BN) and Conversational Telephone Speech (CTS) and in two main accent regions (Flemish and Dutch). In total 7 sites submitted recognition results to the evaluation, totalling 58...
conference paper 2009
document
Larson, M. (author), Ordelman, R. (author), de Jong, F. (author), Kraaij, W. (author), Kohler, J. (author), TNO Informatie- en Communicatietechnologie (author)
conference paper 2009
document
Orr, R. (author), van Leeuwen, D.A. (author), TNO Defensie en Veiligheid (author)
In this study, we explore a human benchmark in language recognition, for the purpose of comparing human performance to machine performance in the context of the NIST LRE 2007. Humans are categorised in terms of language proficiency, and performance is presented per proficiency. Themain challenge in this work is the design of a test and...
conference paper 2009
document
Huijbregts, M. (author), van Leeuwen, D.A. (author), de Jong, F.M.G. (author), TNO Defensie en Veiligheid (author)
In this paper we present the two-pass speaker diarization system that we developed for the NIST RT09s evaluation. In the first pass of our system a model for speech overlap detection is generated automatically. This model is used in two ways to reduce the diarization errors due to overlapping speech. First, it is used in a second diarization...
conference paper 2009
document
van Leeuwen, D.A. (author), TNO Defensie en Veiligheid (author)
In this paper we propose a framework for measuring the overall performance of an automatic speaker recognition system using a set of trials of a heterogeneous evaluation such as NIST SRE- 2008, which combines several acoustic conditions in one evaluation. We do this by weighting trials of different conditions according to their relative...
conference paper 2009
document
Zekveld, A.A. (author), Kramer, S.E. (author), Kessens, J.M. (author), Vlaming, M.S.M.G. (author), Houtgast, T. (author), TNO Kwaliteit van Leven (author)
OBJECTIVES: The aim of this study was to evaluate the benefit that listeners obtain from visually presented output from an automatic speech recognition (ASR) system during listening to speech in noise. DESIGN: Auditory-alone and audiovisual speech reception thresholds (SRTs) were measured. The SRT is defined as the speech-to-noise ratio at which...
article 2008
document
Truong, K.P. (author), Neerincx, M.A. (author), van Leeuwen, D.A. (author), TNO Defensie en Veiligheid (author)
We investigated inter-observer agreement and the reliability of self-reported emotion ratings (i.e., self-raters judging their own emotions) in spontaneous multimodal emotion data. During a multiplayer video game, vocal and facial expressions were recorded (including the game content itself) and were annotated by the players themselves on...
conference paper 2008
document
Raaijmakers, S. (author), Truong, K.P. (author), TNO Defensie en Veiligheid (author)
We developed acoustic and lexical classifiers, based on a boosting algorithm, to assess the separability on arousal and valence dimensions in spontaneous emotional speech. The spontaneous emotional speech data was acquired by inviting subjects to play a first-person shooter video game. Our acoustic classifiers performed significantly better than...
conference paper 2008
document
Truong, K.P. (author), van Leeuwen, D.A. (author), TNO Defensie en Veiligheid (author)
In this paper, we present a detection approach and an ‘open-set’ detection evaluation methodology for automatic emotion recognition in speech. The traditional classification approach does not seem to be suitable and flexible enough for typical emotion recognition tasks. For example, classification does not have an appropriate way to cope with ...
conference paper 2007
document
van Leeuwen, D.A. (author), Brümmer, N. (author), TNO Defensie en Veiligheid (author)
In the evaluation of speaker recognition systems - an important part of speaker classification [1], the trade-off between missed speakers and false alarms has always been an important diagnostic tool. NIST has defined the task of speaker detection with the associated Detection Cost Function (DCF) to evaluate performance, and introduced the DET...
conference paper 2007
Searched for: subject%3A%22Speech%255C%2Brecognition%22
(1 - 20 of 51)

Pages