Multimodal Adaptive Interfaces

Open Access

Multimodal Adaptive Interfaces

- 01 Jan 1998

13

TL;DR: Depending on the task at hand and the user’s preferences, she will use a combination of speech and gesture in different ways to communicate her intent.

Chat with Paper

AI Agents for this Paper

Find similar papers on Google Scholar, PubMed and Arxiv
Write a critical review of this paper
Analyze citations of this paper to find unaddressed research gaps

Citations

•Journal Article•10.1016/S0921-8890(03)00012-5

An Approach to Multi-modal Human-Machine Interaction for Intelligent Service Robots

Hans-Joachim Böhme, +6 more

- 31 Jul 2003

- Robotics and Autonomous Systems

TL;DR: A multi-modal scheme for human–robot interaction suited for a wide range of intelligent service robot applications and some reliable methods for vision-based interaction, sound analysis and speech output are developed.

...read moreread less

84

Journal Article•10.1109/TSMCC.2004.826278

A social informatics approach to human-robot interaction with a service social robot

Christine L. Lisetti, +3 more

- 01 May 2004

TL;DR: Results have indicated that individuals believe that service robots with emotion and personality capabilities would make them more acceptable in everyday roles in human life, prefer that robots communicate via both human-like facial expressions, voice, and text-based media, and become more positive about the idea of service and social robots after exposure to the technology.

...read moreread less

68

Patent

Automatic pronunciation scoring for language learning

Sunil K. Gupta, +2 more

- 03 Jul 2002

TL;DR: This paper proposed a method and apparatus for generating a pronunciation score by receiving a user phrase intended to conform to a reference phrase and processing the user phrase in accordance with at least one of an articulation-scoring engine, a duration scoring engine and an intonation-score engine to derive thereby the pronunciation score.

...read moreread less

38

•Proceedings Article

Learning words from natural audio-visual input.

Deb Roy, +1 more

- 01 Jan 1998

TL;DR: A model of early word learning which learns from natural audio and visual input is presented, which has been successfully implemented to learn words and their audio-visual grounding from camera and microphone input.

...read moreread less

27

•Dissertation

Understanding expressive action

Christopher R. Wren, +1 more

- 01 Jan 2000

TL;DR: This dissertation examines a body of sophisticated perceptual mechanisms developed in response to these needs as well as a selection of human-computer interface sketches designed to push the technology forward and explore the possibilities of this novel interface idiom.

...read moreread less

17

References

•Book

Elements of information theory

Thomas M. Cover, +1 more

- 01 Jan 1991

TL;DR: The author examines the role of entropy, inequality, and randomness in the design of codes and the construction of codes in the rapidly changing environment.

...read moreread less

52.2K

Journal Article•10.1109/5.18626

A tutorial on hidden Markov models and selected applications in speech recognition

Lawrence R. Rabiner

- 01 Feb 1989

TL;DR: In this paper, the authors provide an overview of the basic theory of hidden Markov models (HMMs) as originated by L.E. Baum and T. Petrie (1966) and give practical details on methods of implementation of the theory along with a description of selected applications of HMMs to distinct problems in speech recognition.

...read moreread less

24.3K

•Journal Article•10.1109/5.58337

Backpropagation through time: what it does and how to do it

Paul J. Werbos

- 01 Jan 1990

TL;DR: This paper first reviews basic backpropagation, a simple method which is now being widely used in areas like pattern recognition and fault diagnosis, and describes further extensions of this method, to deal with systems other than neural networks, systems involving simultaneous equations or true recurrent networks, and other practical issues which arise with this method.

...read moreread less

5.4K

Journal Article•10.1109/89.326616

RASTA processing of speech

Hynek Hermansky, +1 more

- 01 Oct 1994

- IEEE Transactions on Speech and Audio Pr...

TL;DR: The theoretical and experimental foundations of the RASTA method are reviewed, the relationship with human auditory perception is discussed, the original method is extended to combinations of additive noise and convolutional noise, and an application is shown to speech enhancement.

...read moreread less

2.1K

Journal Article•10.1145/32206.32212

The vocabulary problem in human-system communication

George W. Furnas, +3 more

- 01 Nov 1987

- Communications of The ACM

TL;DR: It is shown how this fundamental property of language limits the success of various design methodologies for vocabulary-driven interaction, and an optimal strategy, unlimited aliasing, is derived and shown to be capable of several-fold improvements.

...read moreread less

1.6K

Multimodal Adaptive Interfaces

Chat with Paper

AI Agents for this Paper

Citations

An Approach to Multi-modal Human-Machine Interaction for Intelligent Service Robots

A social informatics approach to human-robot interaction with a service social robot

Automatic pronunciation scoring for language learning

Learning words from natural audio-visual input.

Understanding expressive action

References

Elements of information theory

A tutorial on hidden Markov models and selected applications in speech recognition

Backpropagation through time: what it does and how to do it

RASTA processing of speech

The vocabulary problem in human-system communication

Related Papers (5)

Talking and Thinking With Our Hands

The role of gesture in communication and thinking

How pairs interact over a multimodal digital table

Gesture and the communicative intention of the speaker

Evaluating multimodal interaction with gestures and speech for point and select tasks