Voice command device

Topic Tools

Papers published on a yearly basis

Papers

Patent•

Automobile personal computer systems

[...]

24 Jan 2000

TL;DR: In this paper, an automobile personal computer system is provided, where a user can wirelessly interact with merchants, communications facilities, information providers, computers at the home or office, and other entities.

...read moreread less

Abstract: An automobile personal computer system is provided. A user of the system may wirelessly interact with merchants, communications facilities, information providers, computers at the home or office, and other entities. Such interactions may involve local wireless links and remote wireless links. Wireless communications may involve satellite transmissions, cellular transmissions, short-range wireless transmissions, etc. Products may be purchased using voice commands or by interacting with displays in the automobile. The automobile's location and functions may be monitored and controlled. Location information and other information particular to the user may be used to target promotions to the user. The user may obtain information on the goods or services available at a merchant while driving and may initiate a purchase transaction for those goods or services.

...read moreread less

1,296 citations

Patent•

Method and system for accessing CRM data via voice

[...]

Shannon Jones, Richard Gorman, Jesse Ambrose, Joseph Harb, Chris Haven - Show less +1 more

4 Jan 2002

TL;DR: In this paper, the system includes a voice recognition unit and a speech processing server that work together to enable users to interact with the system using voice commands guided by navigation context sensitive voice prompts, and provide user-requested data in a verbalized format back to the users.

...read moreread less

Abstract: A system and method for providing access to CRM data via a voice interface. In one embodiment, the system includes a voice recognition unit and a speech processing server that work together to enable users to interact with the system using voice commands guided by navigation context sensitive voice prompts, and provide user-requested data in a verbalized format back to the users. Digitized voice waveform data are processed to determine the voice commands of the user. The system also uses a “grammar” that enables users to retrieve data using intuitive natural language speech queries. In response to such a query, a corresponding data query is generated by the system to retrieve one or more data sets corresponding to the query. The user is then enabled to browse the data that are returned through voice command navigation, wherein the system “reads” the data back to the user using text-to-speech (TTS) conversion.

...read moreread less

1,188 citations

Patent•

System and method for providing and using universally accessible voice and speech data files

[...]

Premkumar V. Uppaluru¹•Institutions (1)

Nuance Communications¹

25 Jan 2002

TL;DR: In this paper, a system and method for universal access to voice-based documents containing information formatted using MIME and HTML standards using customized extensions for voice information access and navigation is presented.

...read moreread less

Abstract: A system and method provides universal access to voice-based documents containing information formatted using MIME and HTML standards using customized extensions for voice information access and navigation. These voice documents are linked using HTML hyper-links that are accessible to subscribers using voice commands, touch-tone inputs and other selection means. These voice documents and components in them are addressable using HTML anchors embedding HTML universal resource locators (URLs) rendering them universally accessible over the Internet. This collection of connected documents forms a voice web. The voice web includes subscriber-specific documents including speech training files for speaker dependent speech recognition, voice print files for authenticating the identity of a user and personal preference and attribute files for customizing other aspects of the system in accordance with a specific subscriber.

...read moreread less

983 citations

Proceedings Article•

Hidden voice commands

[...]

Nicholas Carlini¹, Pratyush Mishra¹, Tavish Vaidya², Yuankai Zhang², Micah Sherr², Clay Shields², David Wagner¹, Wenchao Zhou² - Show less +4 more•Institutions (2)

University of California, Berkeley¹, Georgetown University²

10 Aug 2016

TL;DR: This paper explores in this paper how voice interfaces can be attacked with hidden voice commands that are unintelligible to human listeners but which are interpreted as commands by devices.

...read moreread less

Abstract: Voice interfaces are becoming more ubiquitous and are now the primary input method for many devices. We explore in this paper how they can be attacked with hidden voice commands that are unintelligible to human listeners but which are interpreted as commands by devices. We evaluate these attacks under two different threat models. In the black-box model, an attacker uses the speech recognition system as an opaque oracle. We show that the adversary can produce difficult to understand commands that are effective against existing systems in the black-box model. Under the white-box model, the attacker has full knowledge of the internals of the speech recognition system and uses it to create attack commands that we demonstrate through user testing are not understandable by humans. We then evaluate several defenses, including notifying the user when a voice command is accepted; a verbal challenge-response protocol; and a machine learning approach that can detect our attacks with 99.8% accuracy.

...read moreread less

635 citations

Proceedings Article•10.1145/3133956.3134052•

DolphinAtack: Inaudible Voice Commands

[...]

Guoming Zhang, Chen Yan, Xiaoyu Ji, Taimin Zhang, Tianchen Zhang, Wenyuan Xu - Show less +2 more

31 Aug 2017-arXiv: Cryptography and Security

TL;DR: DolphinAttack as discussed by the authors is a completely inaudible attack that modulates voice commands on ultrasonic carriers (e.g., f > 20 kHz) to achieve inaudibility.

...read moreread less

Abstract: Speech recognition (SR) systems such as Siri or Google Now have become an increasingly popular human-computer interaction method, and have turned various systems into voice controllable systems(VCS). Prior work on attacking VCS shows that the hidden voice commands that are incomprehensible to people can control the systems. Hidden voice commands, though hidden, are nonetheless audible. In this work, we design a completely inaudible attack, DolphinAttack, that modulates voice commands on ultrasonic carriers (e.g., f > 20 kHz) to achieve inaudibility. By leveraging the nonlinearity of the microphone circuits, the modulated low frequency audio commands can be successfully demodulated, recovered, and more importantly interpreted by the speech recognition systems. We validate DolphinAttack on popular speech recognition systems, including Siri, Google Now, Samsung S Voice, Huawei HiVoice, Cortana and Alexa. By injecting a sequence of inaudible voice commands, we show a few proof-of-concept attacks, which include activating Siri to initiate a FaceTime call on iPhone, activating Google Now to switch the phone to the airplane mode, and even manipulating the navigation system in an Audi automobile. We propose hardware and software defense solutions. We validate that it is feasible to detect DolphinAttack by classifying the audios using supported vector machine (SVM), and suggest to re-design voice controllable systems to be resilient to inaudible voice command attacks.

...read moreread less

488 citations

...

Expand

Year	Papers
2025	8
2024	11
2023	78
2022	131
2021	123
2020	168

Topic Tools

Papers published on a yearly basis

Papers

Automobile personal computer systems

Method and system for accessing CRM data via voice

System and method for providing and using universally accessible voice and speech data files

Hidden voice commands

DolphinAtack: Inaudible Voice Commands

Related Topics (5)

Performance Metrics