Searched for: subject%3A%22Speech%22
(21 - 40 of 140)

Pages

document
de Jong, A.P.J. (author), Tak, S. (author), Toet, A. (author), Schultz, S. (author), Wijbenga, J.P. (author), van Erp, J.B.F. (author)
Several interaction techniques have been proposed to enable transfer of information between different displays in heterogeneous multi-display environments. However, it is not clear whether subjective user preference for these different techniques depends on the nature of the displays between which information is transferred. We explore...
conference paper 2013
document
van der Sluis, F. (author), Dijkstra, T. (author), van den Broek, E.L. (author)
This study explores the feasibility of sensitive machines; that is, machines with empathic abilities, at least to some extent. A signal processing and machine learning pipeline is presented that is used to analyze data from two studies in which 25 Post-Traumatic Stress Disorder (PTSD) patients participated. The feasibility of speech as a stress...
conference paper 2012
document
Ebem, D.U. (author), Beerends, J.G. (author), van Vugt, J. (author), Schmidmer, C. (author), Kooij, R.E. (author), Uguru, J.O. (author)
The extent to which the modeling used in objective speech quality algorithms depends on the cultural background of listeners as well as on the language characteristics using American English and Igbo, an African tone language is investigated. Two different approaches were used in order to separate behavioral aspects from speech signal aspects....
article 2011
document
Lefter, I. (author), Rothkrantz, L.J.M. (author), Wiggers, P. (author), van Leeuwen, D.A. (author), TNO Defensie en Veiligheid (author)
We explore possibilities for enhancing the generality, portability and robustness of emotion recognition systems by combining data-bases and by fusion of classifiers. In a first experiment, we investigate the performance of an emotion detection system tested on a certain database given that it is trained on speech from either the same database,...
conference paper 2010
document
Larson, M. (author), Ordelman, R. (author), Metze, F. (author), Kraaij, W. (author), de Jong, F. (author), TNO Informatie- en Communicatietechnologie (author)
When multimedia content has a speech track, a whole array of techniques involving speech recognition and analysis becomes available for indexing and structuring and can provide users with improved access and search. The set of new domains standing to benefit from these techniques encompasses talkshows, lectures, meetings, interviews, debates,...
conference paper 2010
document
Koelewijn, T. (author), Bronkhorst, A. (author), Theeuwes, J. (author), TNO Defensie en Veiligheid (author)
Multisensory integration and crossmodal attention have a large impact on how we perceive the world. Therefore, it is important to know under what circumstances these processes take place and how they affect our performance. So far, no consensus has been reached on whether multisensory integration and crossmodal attention operate independently...
article 2010
document
Popescu-Belis, A. (author), Poller, P. (author), Kilgour, J. (author), Boertjes, E. (author), Carletta, J. (author), Castronovo, S. (author), Fapso, M. (author), Flynn, M. (author), Nanchen, A. (author), Wilson, T. (author), de Wit, J. (author), Yazdan, M. (author)
The AMIDA Automatic Content Linking Device (ACLD) monitors a conversation using automatic speech recognition (ASR), and uses the detected words to retrieve documents that are of potential use to the participants in the conversation. The document set that is available includes project related documents such as reports, memos or emails, as well as...
conference paper 2009
document
Truong, K.P. (author), van Leeuwen, D.A. (author), Neerincx, M.A. (author), de Jong, F.M.G. (author), TNO Defensie en Veiligheid (author)
In this paper, we describe emotion recognition experiments carried out for spontaneous affective speech with the aim to compare the added value of annotation of felt emotion versus annotation of perceived emotion. Using speech material available in the TNO-GAMING corpus (a corpus containing audiovisual recordings of people playing videogames),...
conference paper 2009
document
Neerincx, M.A. (author), Cremers, A.H.M. (author), Kessens, J.M. (author), van Leeuwen, D.A. (author), Truong, K.P. (author), TNO Defensie en Veiligheid (author)
This paper presents a methodology to apply speech technology for compensating sensory, motor, cognitive and affective usage difficulties. It distinguishes (1) an analysis of accessibility and technological issues for the identification of context-dependent user needs and corresponding opportunities to include speech in multimodal user interfaces...
article 2009
document
Huijbregts, M. (author), van Leeuwen, D.A. (author), de Jong, F.M.G. (author), TNO Defensie en Veiligheid (author)
In this paper we present a method for combining multiple diarization systems into one single system by applying a majority voting scheme. The voting scheme selects the best segmentation purely on basis of the output of each system. On our development set of NIST Rich Transcription evaluation meetings the voting method improves our system on all...
conference paper 2009
document
Beerends, J.G. (author), van Buuren, R. (author), van Vugt, J. (author), Verhave, J. (author), TNO Defensie en Veiligheid TNO Informatie- en Communicatietechnologie (author)
The relation between subjective and objective speech intelligibility measurements is researched. For a large series of speech degradations, noise, linear and nonlinear distortions (speech codecs), intelligibility tests were carried out using short CVC words. In the subjective domain the percentage correctly identified words is taken as the...
article 2009
document
Zekveld, A.A. (author), Kramer, S.E. (author), Kessens, J.M. (author), Vlaming, M.S.M.G. (author), Houtgast, T. (author), TNO Defensie en Veiligheid (author)
This study examined the subjective benefit obtained from automatically generated captions during telephone-speech comprehension in the presence of babble noise. Short stories were presented by telephone either with or without captions that were generated offline by an automatic speech recognition (ASR) system. To simulate online ASR, the word...
article 2009
document
van Leeuwen, D.A. (author), Kessens, J. (author), Sanders, E. (author), van den Heuvel, H. (author), TNO Defensie en Veiligheid (author)
In this paper we report the results of a Dutch speech recognition system evaluation held in 2008. The evaluation contained material in two domains: Broadcast News (BN) and Conversational Telephone Speech (CTS) and in two main accent regions (Flemish and Dutch). In total 7 sites submitted recognition results to the evaluation, totalling 58...
conference paper 2009
document
Larson, M. (author), Ordelman, R. (author), de Jong, F. (author), Kraaij, W. (author), Kohler, J. (author), TNO Informatie- en Communicatietechnologie (author)
conference paper 2009
document
Orr, R. (author), van Leeuwen, D.A. (author), TNO Defensie en Veiligheid (author)
In this study, we explore a human benchmark in language recognition, for the purpose of comparing human performance to machine performance in the context of the NIST LRE 2007. Humans are categorised in terms of language proficiency, and performance is presented per proficiency. Themain challenge in this work is the design of a test and...
conference paper 2009
document
Huijbregts, M. (author), van Leeuwen, D.A. (author), de Jong, F.M.G. (author), TNO Defensie en Veiligheid (author)
In this paper we present the two-pass speaker diarization system that we developed for the NIST RT09s evaluation. In the first pass of our system a model for speech overlap detection is generated automatically. This model is used in two ways to reduce the diarization errors due to overlapping speech. First, it is used in a second diarization...
conference paper 2009
document
van Leeuwen, D.A. (author), TNO Defensie en Veiligheid (author)
In this paper we propose a framework for measuring the overall performance of an automatic speaker recognition system using a set of trials of a heterogeneous evaluation such as NIST SRE- 2008, which combines several acoustic conditions in one evaluation. We do this by weighting trials of different conditions according to their relative...
conference paper 2009
document
TNO Defensie en Veiligheid (author), Truong, K.P. (author)
doctoral thesis 2009
document
TNO Informatie- en communicatietechnologie (author), Beerends, J.G. (author), van Buuren, R.A. (author), van Vugt, J.M. (author), Verhave, J.A. (author)
Several measurement techniques exist to quantify the intelligibility of a speech transmission chain. In the objective domain, the Articulation Index [1] and the Speech Transmission Index STI [2], [3], [4], [5] have been standardized for predicting intelligibility. The STI uses a signal that contains spectro-temporal characteristics similar to...
conference paper 2009
document
Looije, R. (author), Melder, W.A. (author), Neerincx, M.A. (author), TNO Defensie en Veiligheid (author)
Learning a process control task, such as tactical picture compilation in the Navy, is difficult, because the students have to spend their limited cognitive resources both on the complex task itself and the act of learning. In addition to the resource limits, motivation can be reduced when learning progress is slow. Intelligent Virtual Agents may...
conference paper 2008
Searched for: subject%3A%22Speech%22
(21 - 40 of 140)

Pages