Patent
System, method and computer program product for performing unstructured information management and automatic text analysis, and providing multiple document views derived from different document tokenizations
Andrei Z. Broder,David Carmel,Arthur Charles Ciccolo,David A. Ferrucci,Yoelle Maarek,Yosi Mass,Aya Soffer,Wlodek Zadrozny +7 more
- 30 May 2003
167
TL;DR: In this paper, the authors present a system architecture, components and a searching technique for an unstructured information management system (UIMS), which is provided as middleware for the effective management and interchange of unstructuring information over a wide array of information sources.
read more
Abstract: Disclosed is a system architecture, components and a searching technique for an Unstructured Information Management System (UIMS). The UIMS may be provided as middleware for the effective management and interchange of unstructured information over a wide array of information sources. The architecture generally includes a search engine, data storage, analysis engines containing pipelined document annotators and various adapters. The searching technique makes use of a two-level searching technique. Also disclosed is system, method and computer program product to process document data. The method includes inputting a document and operating at least one text analysis engine that comprises a plurality of coupled annotators for tokenizing document data for identifying and annotating a particular type of semantic content. Operating the at least one text analysis engine generates a plurality of views of a document, where each of the plurality of views are derived from a different tokenization of the document. The method further includes storing the plurality of views in a common data structure associated with the document.
read more
Chat with Paper
AI Agents for this Paper
Find similar papers on Google Scholar, PubMed and Arxiv
Write a critical review of this paper
Analyze citations of this paper to find unaddressed research gaps
Citations
Patent
Robotic catheter system
Daniel T. Wallace,Frederic H. Moll,Robert G. Younge,Kenneth M. Martin,Gregory J. Stahler,David F. Moore,Daniel T. Adams,Michael R. Zinn,Gunter Niemeyer +8 more
- 03 Jul 2006
TL;DR: In this paper, an elongate guide instrument has a base, distal end, and a working lumen, wherein the guide instrument base is operatively coupled to the interface.
1.1K
Patent
Full text query and search systems and methods of use
Yuanhua Tang,Qianjin Hu,Yonghong Yang +2 more
- 25 Oct 2005
TL;DR: The Shannon Information score as discussed by the authors is a ranking system that uses p-values to represent the likelihood that a hit is due to random matches, and users can specify the parameters that determine hits and their ranking based on phrase matches and sentence similarities.
519
Patent
System and method for providing answers to questions
Eric W. Brown,David A. Ferrucci,Adam Lally,Wlodek Zadrozny +3 more
- 14 May 2008
TL;DR: In this paper, a system, method and computer program product for providing answers to questions based on any corpus of data is presented, which facilitates generating a number of candidate passages from the corpus that answer an input query, and finds the correct resulting answer by collecting supporting evidence from the multiple passages.
419
Patent
Questions and answers generation
Pablo Ariel Duboué,David A. Ferrucci,David C. Gondek,James W. Murdock,Wlodek Zadrozny +4 more
- 15 Mar 2010
TL;DR: In this article, a system, method and/or computer program product for automatically generating questions and answers based on any corpus of data is presented, given a collection of textual documents, automatically generating collections of questions about the documents together with answers to those questions.
371
Patent
Providing answers to questions using multiple models to score candidate answers
Eric W. Brown,David A. Ferrucci,James W. Murdock +2 more
- 23 Sep 2011
TL;DR: In one embodiment, the method comprises receiving an input query; conducting a search to identify candidate answers to the input query, and producing a plurality of scores for each of the candidate answers as discussed by the authors.
312
References
Patent
Graphic user interface for database system
Andrew J. Szabo
- 23 Dec 1996
TL;DR: In this paper, a graphical user interface method for representing a search of a database, providing a plurality of stylized Venn diagrams each representing an intersection of at least two sets, is presented.
881
Patent
Integration platform for heterogeneous databases
Matthew Morgenstern
- 04 Nov 1997
TL;DR: A method for processing heterogeneous data including high level specifications to drive program generation of information mediators, inclusion of structured file formats (also referred to as data interface languages) in a uniform manner with heterogeneous database schema, development of a uniform data description language across a wide range of data schemas and structured formats, and use of annotations to separate out from such specifications.
543
Patent
Natural language information retrieval system and method
Carolina Rubio de Hita,David van den Akker,Erik C. E. Govaers,Frank M. J. Platteau,Kurt Van Deun,Melissa Macpherson,Peter de Bie,Sophie Laviolette +7 more
- 22 Aug 1997
TL;DR: In this article, an information retrieval system that represents the content of a language-based database being searched as well as the user's natural language query is presented. And the system includes a non-real-time development system for automatically creating a database index having one or more contentbased database keywords.
527
Patent
Mechanism and apparatus for using messages to look up documents stored in spaces in a distributed computing environment
Gregory L. Slaughter,Thomas E. Saulpaugh,Bernard A. Traversat,Mohamed M. Abdelaziz,Michael J. Duigou +4 more
- 12 Sep 2000
TL;DR: In this article, a system and method for searching for documents within spaces in a distributed computing environment are provided, where a client sends a lookup message to a space which stores documents, and a set of zero or more documents which match the lookup message are discovered.
458
Patent
Method and system for optimally searching a document database using a representative semantic space
Matthew S. Sommer,Kevin B. Thompson +1 more
- 24 Jan 2005
TL;DR: In this paper, a term-by-document matrix is compiled from a corpus of documents representative of a particular subject matter that represents the frequency of occurrence of each term per document.
425
Related Papers (5)
Yuanhua Tang,Qianjin Hu,Yonghong Yang +2 more
- 25 Oct 2005
John R. Ripley
- 14 Feb 2003
Edward A. Green,Kevin L. Markey,Kreider Mark L +2 more
- 21 Oct 2004