Conference

Perception and Interactive Technologies

About: Perception and Interactive Technologies is an academic conference. The conference publishes majorly in the area(s): User interface & Perception. Over the lifetime, 63 publications have been published by the conference receiving 975 citations.

...read moreread less

Topics: User interface, Perception, Computer science, Visual perception, Dialog system ...read more

Conference Tools

Create Scientific Poster

Create Conference poster

Create Presentation with AI

Papers published on a yearly basis

Papers

Book Chapter•10.1007/978-3-540-69369-7_12•

Static and Dynamic Modelling for the Recognition of Non-verbal Vocalisations in Conversational Speech

[...]

Björn Schuller¹, Florian Eyben¹, Gerhard Rigoll¹•Institutions (1)

Technische Universität München¹

16 Jun 2008

TL;DR: Two different strategies for robust discrimination of non-verbal vocalisations such as laughter, breathing, hesitation, and consent are discussed: dynamic modelling by a broad selection of diverse acoustic Low-Level-Descriptors vs. static modelling by projection of these via statistical functionals onto a 0.6k feature space with subsequent de-correlation.

...read moreread less

Abstract: Non-verbal vocalisations such as laughter, breathing, hesitation, and consent play an important role in the recognition and understanding of human conversational speech and spontaneous affect. In this contribution we discuss two different strategies for robust discrimination of such events: dynamic modelling by a broad selection of diverse acoustic Low-Level-Descriptors vs. static modelling by projection of these via statistical functionals onto a 0.6k feature space with subsequent de-correlation. As classifiers we employ Hidden Markov Models, Conditional Random Fields, and Support Vector Machines, respectively. For discussion of extensive parameter optimisation test-runs with respect to features and model topology, 2.9k non-verbals are extracted from the spontaneous Audio-Visual Interest Corpus. 80.7% accuracy can be reported with, and 92.6% without a garbage model for the discrimination of the named classes.

...read moreread less

60 citations

Book Chapter•10.1007/11768029_25•

A single-camera remote eye tracker

[...]

André Meyer¹, Martin Böhme¹, Thomas Martinetz¹, Erhardt Barth¹•Institutions (1)

University of Lübeck¹

19 Jun 2006

TL;DR: This work uses eye trackers for improved visual communication through gaze guidance and for Augmentative and Alternative Communication (AAC) and is interested in less invasive eye tracking techniques.

...read moreread less

Abstract: Many eye-tracking systems either require the user to keep their head still or involve cameras or other equipment mounted on the user's head. While acceptable for research applications, these limitations make the systems unsatisfactory for prolonged use in interactive applications. Since the goal of our work is to use eye trackers for improved visual communication through gaze guidance [1,2] and for Augmentative and Alternative Communication (AAC) [3], we are interested in less invasive eye tracking techniques.

...read moreread less

56 citations

Book Chapter•10.1007/11768029_6•

Emotion recognition using physiological and speech signal in short-term observation

[...]

Jonghwa Kim¹, Elisabeth André¹•Institutions (1)

University of Augsburg¹

19 Jun 2006

TL;DR: This paper analyzes various methods for fusing physiological and vocal information and compares the recognition results of the bimodal recognition approach with theresults of the unimodal approach.

...read moreread less

Abstract: Recently, there has been a significant amount of work on the recognition of emotions from visual, verbal or physiological information. Most approaches to emotion recognition so far concentrate, however, on a single modality while work on the integration of multimodal information, in particular on fusing physiological signals with verbal or visual data, is scarce. In this paper, we analyze various methods for fusing physiological and vocal information and compare the recognition results of the bimodal recognition approach with the results of the unimodal approach.

...read moreread less

56 citations

Book Chapter•10.1007/978-3-540-69369-7_32•

Evaluation Methods for Multimodal Systems: A Comparison of Standardized Usability Questionnaires

[...]

Ina Wechsung¹, Anja Naumann¹•Institutions (1)

Deutsche Telekom¹

16 Jun 2008

TL;DR: The results show that standardized questionnaires are applicable only to a limited extent and indicate the need for the development of a valid and reliable questionnaire covering the usability and quality of multimodal systems.

...read moreread less

Abstract: Different questionnaires assessing the usability of two multimodal systems and one unimodal system were compared. Each participant (N=21) performed several tasks with each device and was afterwards asked to rate the system by filling out different questionnaires. The results show that standardized questionnaires are applicable only to a limited extent. Despite some concordance, the results differ considerably and thus indicate the need for the development of a valid and reliable questionnaire covering the usability and quality of multimodal systems.

...read moreread less

43 citations

Book Chapter•10.1007/11768029_2•

Detection of head pose and gaze direction for human-computer interaction

[...]

Ulrich Weidenbacher¹, Georg Layher¹, Pierre Bayerl¹, Heiko Neumann¹•Institutions (1)

University of Ulm¹

19 Jun 2006

TL;DR: It is demonstrated that a simple implementation of the proposed methods without extensive training sessions or calibration is sufficient to accurately detect the head pose for human-computer interaction.

...read moreread less

Abstract: In this contribution we extend existing methods for head pose estimation and investigate the use of local image phase for gaze detection. Moreover we describe how a small database of face images with given ground truth for head pose and gaze direction was acquired. With this database we compare two different computational approaches for extracting the head pose. We demonstrate that a simple implementation of the proposed methods without extensive training sessions or calibration is sufficient to accurately detect the head pose for human-computer interaction. Furthermore, we propose how eye gaze can be extracted based on the outcome of local filter responses and the detected head pose. In all, we present a framework where different approaches are combined to a single system for extracting information about the attentional state of a person.

...read moreread less

36 citations

...

Expand

Performance Metrics

Papers

424

Citations

No. of papers from the Conference in previous years
Year	Papers
2008	37
2006	26