Searched for: subject%3A%22Speech%22
(41 - 60 of 140)

Pages

document
Cremers, A.H.M. (author), de Jong, J.G.M. (author), van Balken, J.S. (author), TNO Defensie en Veiligheid (author)
One of the major challenges in current user interface research and development is the accommodation of diversity in users and contexts of use in order to improve the self-efficacy of citizens. A common banking service, which should be designed for diversity, is the Automated Teller Machine (ATM). This paper describes the various user-centered...
conference paper 2008
document
van Wijngaarden, S.J. (author), Drullman, R. (author), TNO Defensie en Veiligheid (author)
Although the speech transmission index STI is a well-accepted and standardized method for objective prediction of speech intelligibility in a wide range of environments and applications, it is essentially a monaural model. Advantages of binaural hearing in speech intelligibility are disregarded. In specific conditions, this leads to considerable...
article 2008
document
van Leeuwen, D.A. (author), Konečný, M. (author), TNO Defensie en Veiligheid (author)
In this paper we describe the AMIDA speaker dizarization system as it was submitted to the NIST Rich Transcription evaluation 2007 for conference room data. This is done in the context of the history of this system and other speaker diarization systems. One of the goals of our system is to have as little tunable parameters as possible, while...
conference paper 2008
document
Terband, H. (author), Drullman, R. (author), TNO Defensie en Veiligheid (author)
A procedure was developed for the automated measurement of the speech reception threshold in stationary noise (SRTn), which can be administered by the subjects themselves using a computer. The procedure was based on the SRTn test for Dutch developed by Plomp and Mimpen [(1979). “Improving the reliability of testing the speech reception threshold...
article 2008
document
Zekveld, A.A. (author), Kramer, S.E. (author), Kessens, J.M. (author), Vlaming, M.S.M.G. (author), Houtgast, T. (author), TNO Kwaliteit van Leven (author)
OBJECTIVES: The aim of this study was to evaluate the benefit that listeners obtain from visually presented output from an automatic speech recognition (ASR) system during listening to speech in noise. DESIGN: Auditory-alone and audiovisual speech reception thresholds (SRTs) were measured. The SRT is defined as the speech-to-noise ratio at which...
article 2008
document
Truong, K.P. (author), Neerincx, M.A. (author), van Leeuwen, D.A. (author), TNO Defensie en Veiligheid (author)
We investigated inter-observer agreement and the reliability of self-reported emotion ratings (i.e., self-raters judging their own emotions) in spontaneous multimodal emotion data. During a multiplayer video game, vocal and facial expressions were recorded (including the game content itself) and were annotated by the players themselves on...
conference paper 2008
document
Raaijmakers, S. (author), Truong, K.P. (author), TNO Defensie en Veiligheid (author)
We developed acoustic and lexical classifiers, based on a boosting algorithm, to assess the separability on arousal and valence dimensions in spontaneous emotional speech. The spontaneous emotional speech data was acquired by inviting subjects to play a first-person shooter video game. Our acoustic classifiers performed significantly better than...
conference paper 2008
document
TNO Defensie en Veiligheid (author), Mohamoud, A.A. (author), Maris, M.G. (author)
In this paper, an emotion classification system based on speech signals is presented. The classifier can identify the most common emotions, namely anger, neutral, happiness and fear. The algorithm computes a number of acoustic features which are fed into the classifier based on a pattern recognition approach. The classification system is of...
conference paper 2008
document
Truong, K.P. (author), van Leeuwen, D.A. (author), TNO Defensie en Veiligheid (author)
In this paper, we present a detection approach and an ‘open-set’ detection evaluation methodology for automatic emotion recognition in speech. The traditional classification approach does not seem to be suitable and flexible enough for typical emotion recognition tasks. For example, classification does not have an appropriate way to cope with ...
conference paper 2007
document
van Leeuwen, D.A. (author), Brümmer, N. (author), TNO Defensie en Veiligheid (author)
In the evaluation of speaker recognition systems - an important part of speaker classification [1], the trade-off between missed speakers and false alarms has always been an important diagnostic tool. NIST has defined the task of speaker detection with the associated Detection Cost Function (DCF) to evaluate performance, and introduced the DET...
conference paper 2007
document
Strik, H. (author), Truong, K. (author), de Wet, F. (author), Cucchiarini, C. (author), TNO Defensie en Veiligheid (author)
Providing feedback on pronunciation errors in computer assisted language learning systems requires that pronunciation errors be detected automatically. In the present study we compare four types of classifiers that can be used for this purpose: two acoustic-phonetic classifiers (one of which employs linear-discriminant analysis (LDA)), a...
conference paper 2007
document
van Leeuwen, D.A. (author), Truong, K.P. (author), TNO Defensie en Veiligheid (author)
This paper introduces a detection methodology for recognition technologies in speech for which it is dif cult to obtain an abundance of non-target classes. An example is language recognition, where we would like to be able to measure the detection capability of a single target language without confounding with the modeling capability of non...
conference paper 2007
document
Truong, K. (author), van Leeuwen, D. (author), TNO Defensie en Veiligheid (author)
Emotions can be recognized by audible paralinguistic cues in speech. By detecting these paralinguistic cues that can consist of laughter, a trembling voice, coughs, changes in the intonation contour etc., information about the speaker’s state and emotion can be revealed. This paper describes the development of a gender-independent laugh detector...
article 2007
document
Merkx, P.A.B. (author), Truong, K.P. (author), Neerincx, M.A. (author), TNO Defensie en Veiligheid (author)
To develop an annotated database of spontaneous, multimodal, emotional expressions, recordings were made of facial and vocal expressions of emotions while participants were playing a multiplayer first-person shooter (fps) computer game. During a replay of the session, participants scored their own emotions by assigning values to them on an...
conference paper 2007
document
Truong, K.P. (author), van Leeuwen, D.A. (author), TNO Defensie en Veiligheid (author)
In this paper, we introduce a visual analysis method to assess the discriminability and confusiability between emotions according to automatic emotion classifiers. The degree of acoustic similarities between emotions can be defined in terms of distances that are based on pair-wise emotion discrimination experiments. By employing Multidimensional...
conference paper 2007
document
Pigeon, S. (author), Shen, W. (author), Lawson, A. (author), van Leeuwen, D.A. (author), TNO Defensie en Veiligheid (author)
This paper describes the speech database that has a central role in the Interspeech 2007 special session "Novel techniques for the NATO non-native Air Traffic Communications database." The rationale for recording and distributing this common research object is given, and details about the acquisition and annotation are given, as well as some...
conference paper 2007
document
Truong, K.P. (author), van Leeuwen, D.A. (author), Neerincx, M.A. (author), TNO Defensie en Veiligheid (author)
Two unobtrusive modalities for automatic emotion recognition are discussed: speech and facial expressions. First, an overview is given of emotion recognition studies based on a combination of speech and facial expressions. We will identify difficulties concerning data collection, data fusion, system evaluation and emotion annotation that one is...
conference paper 2007
document
Kessens, J.M. (author), van Leeuwen, D.A. (author), TNO Defensie en Veiligheid (author)
In this paper, we describe N-best 2008, the first Large Vocabulary Speech Recognition (LVCSR) benchmark evaluation held for the Dutch language. Both the accent as spoken in the Netherlands (Northern-Dutch) and in Belgium (Southern-Dutch or Flemish), will be evaluated. The evaluation tasks are broadcast news (BN) and conversational telephone...
conference paper 2007
document
Möller, S. (author), Smeele, P. (author), Boland, H. (author), Krebber, J. (author), TNO Industrie en Techniek (author)
In the present paper, we investigate the validity and reliability of de-facto evaluation standards, defined for measuring or predicting the quality of the interaction with spoken dialogue systems. Two experiments have been carried out with a dialogue system for controlling domestic devices. During these experiments, subjective judgments of...
article 2007
document
Truong, K.P. (author), van Leeuwen, D.A. (author), TNO Defensie en Veiligheid (author)
In this study, we investigated automatic laughter segmentation in meetings. We first performed laughterspeech discrimination experiments with traditional spectral features and subsequently used acousticphonetic features. In segmentation, we used Gaussian Mixture Models that were trained with spectral features. For the evaluation of the laughter...
conference paper 2007
Searched for: subject%3A%22Speech%22
(41 - 60 of 140)

Pages