Conference
Workshop on Perceptive User Interfaces
About: Workshop on Perceptive User Interfaces is an academic conference. The conference publishes majorly in the area(s): Computer science & User interface. Over the lifetime, 38 publications have been published by the conference receiving 1946 citations.
Papers
15 Nov 2001
TL;DR: Requirements for real-time barehanded interaction are defined, derived from application scenarios and usability considerations, and a finger-finding and hand-posture recognition algorithm is developed and evaluated.
Abstract: In this paper, we describe techniques for barehanded interaction between human and computer. Barehanded means that no device and no wires are attached to the user, who controls the computer directly with the movements of his/her hand.Our approach is centered on the needs of the user. We therefore define requirements for real-time barehanded interaction, derived from application scenarios and usability considerations. Based on those requirements a finger-finding and hand-posture recognition algorithm is developed and evaluated.To demonstrate the strength of the algorithm, we build three sample applications. Finger tracking and hand posture recognition are used to paint virtually onto the wall, to control a presentation with hand postures, and to move virtual items on the wall during a brainstorming session. We conclude the paper with user tests, which were conducted to prove the usability of bare-hand human computer interaction.
355 citations
15 Nov 2001
TL;DR: A vision-based system that detects head nods and head shakes in real time and can act as a useful and basic interface to a machine is described.
Abstract: Head nods and head shakes are non-verbal gestures used often to communicate intent, emotion and to perform conversational functions. We describe a vision-based system that detects head nods and head shakes in real time and can act as a useful and basic interface to a machine. We use an infrared sensitive camera equipped with infrared LEDs to track pupils. The directions of head movements, determined using the position of pupils, are used as observations by a discrete Hidden Markov Model (HMM) based pattern analyzer to detect when a head nod/shake occurs. The system is trained and tested on natural data from ten users gathered in the presence of varied lighting and varied facial expressions. The system as described achieves a real time recognition accuracy of 78.46% on the test dataset.
176 citations
15 Nov 2001
TL;DR: This paper describes an implemented system that combines multiple sources of knowledge to provide robust early processing for freehand sketching, and one of the most basic steps in converting the original digitized pen strokes in a sketch into the intended geometric objects.
Abstract: Freehand sketching is a natural and crucial part of everyday human interaction, yet is almost totally unsupported by current user interfaces. We are working to combine the exibility and ease of use of paper and pencil with the processing power of a computer, to produce a user interface for design that feels as natural as paper, yet is considerably smarter. One of the most basic steps in accomplishing this is converting the original digitized pen strokes in a sketch into the intended geometric objects. In this paper we describe an implemented system that combines multiple sources of knowledge to provide robust early processing for freehand sketching.
129 citations
15 Nov 2001
TL;DR: The design and implementation of a perceptual user interface for a responsive dialog-box agent that employs real-time computer vision to recognize user acknowledgements from head gestures (e.g., nod = yes) is presented.
Abstract: We present the design and implementation of a perceptual user interface for a responsive dialog-box agent that employs real-time computer vision to recognize user acknowledgements from head gestures (e.g., nod = yes). IBM Pupil-Cam technology together with anthropometric head and face measures are used to first detect the location of the user's face. Salient facial features are then identi ed and tracked to compute the global 2-D motion direction of the head. For recognition, timings of natural gesture motion are incorporated into a state-space model. The interface is presented in the context of an enhanced text editor employing a perceptual dialog-box agent.
94 citations
15 Nov 2001
TL;DR: This work has developed a system capable of estimating participants' focus of attention from multiple cues, and employs an omnidirectional camera to simultaneously track participants' faces around a meeting table and use neural networks to estimate their head poses.
Abstract: Estimating a person's focus of attention is useful for various human-computer interaction applications, such as smart meeting rooms, where a user's goals and intent have to be monitored. In work presented here, we are interested in modeling focus of attention in a meeting situation. We have developed a system capable of estimating participants' focus of attention from multiple cues. We employ an omnidirectional camera to simultaneously track participants' faces around a meeting table and use neural networks to estimate their head poses. In addition, we use microphones to detect who is speaking. The system predicts participants' focus of attention from acoustic and visual information separately, and then combines the output of the audio- and video-based focus of attention predictors. We have evaluated the system using the data from three recorded meetings. The acoustic information has provided 8% error reduction on average compared to using a single modality.
81 citations
Performance Metrics
| Year | Papers |
|---|---|
| 2001 | 38 |