Searched for: subject%3A%22speech%22
(1 - 20 of 140)

Pages

document
Saravanan, A. (author), Tsfasman, A. (author), Neerincx, M.A. (author), Oertel, C. (author)
In ongoing and consecutive conversations with persons, a social robot has to determine which aspects to remember and how to address them in the conversation. In the health domain, important aspects concern the health-related goals, the experienced progress (expressed sentiment) and the ongoing motivation to pursue them. Despite the progress in...
conference paper 2022
document
Braat-Eggen, E. (author), Reinten, J. (author), Hornikx, M. (author), Kohlrausch, A. (author)
Students can be disturbed by background noise while working in an open-plan study environment. To improve the acoustic quality of open-plan study environments a study was done on the influence of different sound scenarios on students working on a typical student task, “studying for an exam”. Three sound scenarios and a quiet reference sound...
article 2021
document
Berkhoff, A. (author)
conference paper 2020
document
Beerends, J.G. (author), Neumann, N.M.P. (author), van den Broek, E.L. (author), Casanovas, A.L. (author), Menendez, J.T. (author), Schmidmer, C. (author), Berger, J. (author)
With the introduction of fullband speech coding the question arises what role frequency components above 14 kHz play in speech quality assessment. On the one hand, our results show that bandwidth limitation from 24 kHz down to 14 kHz is not audible to even the most critical subject. On the other hand, 14-24 kHz band limited, audible levels of...
article 2020
document
Pocta, P. (author), Beerends, J.G. (author)
This paper assesses the perceived listening quality of customer support waiting loops using both a subjective experiment as well as an objective perceptual assessment with POLQA (ITU-T Rec. P.863) and VISQOL. A modified version of the methodology defined in the ITU-T Rec. P.835 was derived and used in this paper to subjectively assess the...
article 2019
document
van Leeuwen, D.A. (author)
This paper investigates the task of linking speakers across multiple recordings, which can be accomplished by speaker clustering. Various aspects are considered, such as computational complexity, on/offline approaches, and evaluation measures but also speaker recognition approaches. It has not been the aim of this study to optimize clustering...
conference paper 2019
document
Gallardo, L.F. (author), Mittag, G. (author), Moller, S. (author), Beerends, J.G. (author)
In telephone conversations, transmitted speech of good to excellent quality is desired for enhanced Quality of Experience and to sustain lasting customer loyalty. Subjective mean opinion scores account for perceived transmitted quality, while instrumental models, such as POLQA, are able to estimate the subjective judgments. To perform subjective...
conference paper 2018
document
van der Sluis, F. (author), Van Den Broek, E.L. (author), Van Drunen, A. (author), Beerends, J.G. (author)
Available bandwidth is still a limiting factor for mobile communication applications. Multisensory communication has already been identified as an possibility to moderate this limitation. One of the strengths of mobile communication lies in its combination of visual and auditory modalities. However, one of the most salient features of mobile...
conference paper 2018
document
Uilenburg, N. (author), Wiefferink, K. (author), Verkerk, P.H. (author), van Denderen, M. (author), van Schie, C. (author), Oudesluys-Murphy, A.M. (author)
Purpose A screening tool called the VTO Language Screening Instrument (VTO-LSI) was developed to enable more uniform and earlier detection of language impairment. This report, consisting of 2 retrospective studies, focuses on the effects of using the VTO-LSI compared to regular detection procedures. Method Study 1 retrospectively compared VTO...
article 2018
document
Ding, D. (author), Neerincx, M.A. (author), Brinkman, W.P. (author)
Introduction: Inner voice is estimated to occur at least a quarter of people’s conscious waking life. Much research work asserts that inner voice plays various important roles in cognitive functions, such as self-regulation, self-reflection, and so on. Virtual cognitions are a stream of simulated thoughts people can hear while emerged in a...
article 2018
document
Heiligers, M.J.C. (author), Huizing, A.G. (author)
conference paper 2018
document
Friedman, D. (author), Brouwer, A.M. (author), Nijholt, A. (author)
In principle, brain-computer interfaces (BCIs) hold the promise for being the ultimate intelligent interfaces - what could surpass an interface that is able to interpret your thoughts and preferences, in real time, and behave accordingly? In practice, it is still not quite clear if and how BCIs can contribute to or replace existing interaction...
conference paper 2017
document
Počta, P. (author), Beerends, J.G. (author)
This paper deals with the intelligibility of speech coded by the STANAG 4591 standard codec, including packet loss, using synthesized speech input. Both subjective and objective assessments are used. It is shown that this codec significantly degrades intelligibility when compared to a standard narrowband filtered version of the synthesized...
article 2017
document
Uguru, J.O. (author), Beerends, J.G. (author), Ebem, D. (author)
This study examines the level of speech recognition of English and Igbo utterances by 70 grade four children. The children, whose mother tongue is Igbo and aged between 8 and 10 years, had English monosyllabic words as well as Igbo monosyllabic and disyllabic words dictated to them in noisy and quiet classrooms. The results show that in noise,...
article 2017
document
Gallardo, L.F. (author), Möller, S. (author), Beerends, J. (author)
The performance of automatic speech recognition based on coded-decoded speech heavily depends on the quality of the transmitted signals, determined by channel impairments. This paper examines relationships between speech recognition performance and measurements of speech quality and intelligibility over transmission channels. Different to...
conference paper 2017
document
Lefter, I. (author), Burghouts, G.J. (author), Rothkrantz, L.J.M. (author)
This paper investigates how speech and gestures convey stress, and how they can be used for automatic stress recognition. As a first step, we look into how humans use speech and gestures to convey stress. In particular, for both speech and gestures, we distinguish between stress conveyed by the intended semantic message (e.g. spoken words for...
article 2016
document
Bronkhorst, A.W. (author)
How do we recognize what one person is saying when others are speaking at the same time? This review summarizes widespread research in psychoacoustics, auditory scene analysis, and attention, all dealing with early processing and selection of speech, which has been stimulated by this question. Important effects occurring at the peripheral and...
article 2015
document
Počta, P. (author), Beerends, J.G. (author)
This paper investigates the impact of different telephone channels, represented by impairments as introduced by modern telecommunication networks (e.g. speech coding, bandwidth limitation, packet loss, etc.), on the intelligibility of synthesized speech. Both subjective and objective assessments are used. Two different speech intelligibility...
article 2015
document
Tielman, M. (author), van Meggelen, M. (author), Neerincx, M.A. (author), Brinkman, W.P. (author)
Internet-based guided self-therapy systems provide a novel method for Post-Traumatic Stress Disorder patients to follow therapy at home with the assistance of a virtual coach. One of the main challenges for such a coach is assisting patients with recollecting their traumatic memories, a vital part of therapy. In this paper, an ontology-based...
conference paper 2015
document
Lefter, I. (author), Burghouts, G.J. (author), Rothkrantz, L.J.M. (author)
Stressful situations are likely to occur at human operated service desks, as well as at human-computer interfaces used in public domain. Automatic surveillance can help notifying when extra assistance is needed. Human communication is inherently multimodal e.g. speech, gestures, facial expressions. It is expected that automatic surveillance...
article 2014
Searched for: subject%3A%22speech%22
(1 - 20 of 140)

Pages