Cognitive print speaker modeler

Patent

Cognitive print speaker modeler

- 31 Oct 2019

2

TL;DR: In this paper, a hierarchical long short term model (LSTM) is used to identify a speaker in a streaming video with audio according to words spoken by the speaker matched to a cognitive print.

Chat with Paper

AI Agents for this Paper

Find similar papers on Google Scholar, PubMed and Arxiv
Write a critical review of this paper
Analyze citations of this paper to find unaddressed research gaps

Citations

Patent

Dialogue system, vehicle and method for controlling the vehicle

Kim Kye Yoon, +7 more

- 15 Nov 2018

TL;DR: In this article, a dialogue system, a vehicle and a method for controlling the vehicle is described, which includes: acquiring an utterance and a speech pattern by recognizing a speech when a speech of a plurality of speakers is input through a speech input device.

...read moreread less

9

Patent

Label generation method and device and computer readable storage medium

Zhou Chi

- 25 Sep 2020

TL;DR: In this paper, the authors proposed a label generation method and device and a computer readable storage medium, which comprises the following steps: when a to-be-generated video is received, extracting at least one to be-classified video frame from the to- begenerated video; obtaining a fullquantity classification model, and determining a plurality of model scheduling indexes corresponding to the full-quantity classifier for each to-Be-classified classification model in the at least 1 to beclassified video frames, wherein each model scheduling index in the plurality of index represents the importance degree of

...read moreread less

References

Report•10.6028/NIST.SP.800-145

The NIST Definition of Cloud Computing

Peter Mell, +1 more

- 28 Sep 2011

TL;DR: This cloud model promotes availability and is composed of five essential characteristics, three service models, and four deployment models.

...read moreread less

17.6K

Proceedings Article•10.1109/ICASSP.2015.7178838

Convolutional, Long Short-Term Memory, fully connected Deep Neural Networks

Tara N. Sainath, +3 more

- 08 Sep 2015

TL;DR: This paper takes advantage of the complementarity of CNNs, LSTMs and DNNs by combining them into one unified architecture, and finds that the CLDNN provides a 4-6% relative improvement in WER over an LSTM, the strongest of the three individual models.

...read moreread less

1.9K

•Proceedings Article•10.1109/CVPR.2016.497

Jointly Modeling Embedding and Translation to Bridge Video and Language

Yingwei Pan, +4 more

- 27 Jun 2016

TL;DR: Liu et al. as discussed by the authors presented a unified framework, named Long Short-Term Memory with visual-semantic Embedding (LSTM-E), which can simultaneously explore the learning of LSTM and visualsemantic embedding.

...read moreread less

704

Patent

Application of Z-Webs and Z-factors to Analytics, Search Engine, Learning, Recognition, Natural Language, and Other Utilities

Saied Tadayon, +1 more

- 28 Feb 2013

TL;DR: In this paper, the authors introduce Z-webs, including Z-factors and Z-nodes, for the understanding of relationships between objects, subjects, abstract ideas, concepts, or the like, including face, car, images, people, emotions, mood, text, natural language, voice, music, video, locations, formulas, facts, historical data, landmarks, personalities, ownership, family, friends, love, happiness, social behavior, voting behavior, and the like.

...read moreread less

398