Patent
Cognitive print speaker modeler
Amsterdam Jeff,Aaron K. Baughman,Hammer Stephen C,Provan David A +3 more
- 31 Oct 2019
2
TL;DR: In this paper, a hierarchical long short term model (LSTM) is used to identify a speaker in a streaming video with audio according to words spoken by the speaker matched to a cognitive print.
read more
Abstract: Aspects of the present invention provide devices that subtitle streaming video with audio and identify a speaker in a streaming video with audio according to words spoken by the speaker matched to a cognitive print. The cognitive print includes traits classified according a hierarchical long short term model (LSTM). The hierarchical LSTM includes layers of LSTMs and each layer corresponds to the classification of one trait. A processor annotates a subtitle of the words spoken by the speaker, which decorates the subtitle with a label representative of the identified speaker, and streams the decorated subtitle with the streaming video with audio.
read more
Chat with Paper
AI Agents for this Paper
Find similar papers on Google Scholar, PubMed and Arxiv
Write a critical review of this paper
Analyze citations of this paper to find unaddressed research gaps
Citations
Patent
Dialogue system, vehicle and method for controlling the vehicle
Kim Kye Yoon,Donghee Seok,Dongsoo Shin,Jeong-Eom Lee,Ga Hee Kim,Seona Kim,Park Jung Mi,HeeJin Ro +7 more
- 15 Nov 2018
TL;DR: In this article, a dialogue system, a vehicle and a method for controlling the vehicle is described, which includes: acquiring an utterance and a speech pattern by recognizing a speech when a speech of a plurality of speakers is input through a speech input device.
9
Patent
Label generation method and device and computer readable storage medium
Zhou Chi
- 25 Sep 2020
TL;DR: In this paper, the authors proposed a label generation method and device and a computer readable storage medium, which comprises the following steps: when a to-be-generated video is received, extracting at least one to be-classified video frame from the to- begenerated video; obtaining a fullquantity classification model, and determining a plurality of model scheduling indexes corresponding to the full-quantity classifier for each to-Be-classified classification model in the at least 1 to beclassified video frames, wherein each model scheduling index in the plurality of index represents the importance degree of
References
The NIST Definition of Cloud Computing
Peter Mell,Timothy Grance +1 more
- 28 Sep 2011
TL;DR: This cloud model promotes availability and is composed of five essential characteristics, three service models, and four deployment models.
17.6K
Convolutional, Long Short-Term Memory, fully connected Deep Neural Networks
Tara N. Sainath,Oriol Vinyals,Andrew W. Senior,Hasim Sak +3 more
- 08 Sep 2015
TL;DR: This paper takes advantage of the complementarity of CNNs, LSTMs and DNNs by combining them into one unified architecture, and finds that the CLDNN provides a 4-6% relative improvement in WER over an LSTM, the strongest of the three individual models.
Jointly Modeling Embedding and Translation to Bridge Video and Language
Yingwei Pan,Tao Mei,Ting Yao,Houqiang Li,Yong Rui +4 more
- 27 Jun 2016
TL;DR: Liu et al. as discussed by the authors presented a unified framework, named Long Short-Term Memory with visual-semantic Embedding (LSTM-E), which can simultaneously explore the learning of LSTM and visualsemantic embedding.
Patent
Application of Z-Webs and Z-factors to Analytics, Search Engine, Learning, Recognition, Natural Language, and Other Utilities
Saied Tadayon,Bijan Tadayon +1 more
- 28 Feb 2013
TL;DR: In this paper, the authors introduce Z-webs, including Z-factors and Z-nodes, for the understanding of relationships between objects, subjects, abstract ideas, concepts, or the like, including face, car, images, people, emotions, mood, text, natural language, voice, music, video, locations, formulas, facts, historical data, landmarks, personalities, ownership, family, friends, love, happiness, social behavior, voting behavior, and the like.
398
Dense Captioning with Joint Inference and Visual Context
Linjie Yang,Kevin Tang,Jianchao Yang,Li-Jia Li +3 more
- 01 Jul 2017
TL;DR: Dense captioning is a newly emerging computer vision topic for understanding images with dense language descriptions.
259
Related Papers (5)
Nevenka Dimitrova,Andrew C. Miller,Dongge Li +2 more
- 28 Sep 2004
Shi Jiang,Cao Jianzhong +1 more
- 11 Jan 2017