Conference
Perception and Interactive Technologies
About: Perception and Interactive Technologies is an academic conference. The conference publishes majorly in the area(s): User interface & Perception. Over the lifetime, 63 publications have been published by the conference receiving 975 citations.
Papers
16 Jun 2008
TL;DR: Two different strategies for robust discrimination of non-verbal vocalisations such as laughter, breathing, hesitation, and consent are discussed: dynamic modelling by a broad selection of diverse acoustic Low-Level-Descriptors vs. static modelling by projection of these via statistical functionals onto a 0.6k feature space with subsequent de-correlation.
Abstract: Non-verbal vocalisations such as laughter, breathing, hesitation, and consent play an important role in the recognition and understanding of human conversational speech and spontaneous affect. In this contribution we discuss two different strategies for robust discrimination of such events: dynamic modelling by a broad selection of diverse acoustic Low-Level-Descriptors vs. static modelling by projection of these via statistical functionals onto a 0.6k feature space with subsequent de-correlation. As classifiers we employ Hidden Markov Models, Conditional Random Fields, and Support Vector Machines, respectively. For discussion of extensive parameter optimisation test-runs with respect to features and model topology, 2.9k non-verbals are extracted from the spontaneous Audio-Visual Interest Corpus. 80.7% accuracy can be reported with, and 92.6% without a garbage model for the discrimination of the named classes.
60 citations
19 Jun 2006
TL;DR: This work uses eye trackers for improved visual communication through gaze guidance and for Augmentative and Alternative Communication (AAC) and is interested in less invasive eye tracking techniques.
Abstract: Many eye-tracking systems either require the user to keep their head still or involve cameras or other equipment mounted on the user's head. While acceptable for research applications, these limitations make the systems unsatisfactory for prolonged use in interactive applications. Since the goal of our work is to use eye trackers for improved visual communication through gaze guidance [1,2] and for Augmentative and Alternative Communication (AAC) [3], we are interested in less invasive eye tracking techniques.
56 citations
19 Jun 2006
TL;DR: This paper analyzes various methods for fusing physiological and vocal information and compares the recognition results of the bimodal recognition approach with theresults of the unimodal approach.
Abstract: Recently, there has been a significant amount of work on the recognition of emotions from visual, verbal or physiological information. Most approaches to emotion recognition so far concentrate, however, on a single modality while work on the integration of multimodal information, in particular on fusing physiological signals with verbal or visual data, is scarce. In this paper, we analyze various methods for fusing physiological and vocal information and compare the recognition results of the bimodal recognition approach with the results of the unimodal approach.
56 citations
16 Jun 2008
TL;DR: The results show that standardized questionnaires are applicable only to a limited extent and indicate the need for the development of a valid and reliable questionnaire covering the usability and quality of multimodal systems.
Abstract: Different questionnaires assessing the usability of two multimodal systems and one unimodal system were compared. Each participant (N=21) performed several tasks with each device and was afterwards asked to rate the system by filling out different questionnaires. The results show that standardized questionnaires are applicable only to a limited extent. Despite some concordance, the results differ considerably and thus indicate the need for the development of a valid and reliable questionnaire covering the usability and quality of multimodal systems.
43 citations
19 Jun 2006
TL;DR: It is demonstrated that a simple implementation of the proposed methods without extensive training sessions or calibration is sufficient to accurately detect the head pose for human-computer interaction.
Abstract: In this contribution we extend existing methods for head pose estimation and investigate the use of local image phase for gaze detection. Moreover we describe how a small database of face images with given ground truth for head pose and gaze direction was acquired. With this database we compare two different computational approaches for extracting the head pose. We demonstrate that a simple implementation of the proposed methods without extensive training sessions or calibration is sufficient to accurately detect the head pose for human-computer interaction. Furthermore, we propose how eye gaze can be extracted based on the outcome of local filter responses and the detected head pose. In all, we present a framework where different approaches are combined to a single system for extracting information about the attentional state of a person.
36 citations
Performance Metrics
| Year | Papers |
|---|---|
| 2008 | 37 |
| 2006 | 26 |