Proceedings Article10.1145/1216295.1216328
Building data integration queries by demonstration
Rattapoom Tuchinda,Pedro Szekely,Craig A. Knoblock +2 more
- 28 Jan 2007
- pp 170-179
TL;DR: A novel approach is introduced that exploits the structure of the relational data source(s) to formulate a set of constraints and is used in conjunction with partial plans to produce an intelligent query interface that does not require the user to know details about data sources or existing values.
read more
Abstract: The magnitude of data available on the web prompts the need for an easy to use query interface that enables users to integrate data from multiple web sources in an intelligent fashion. Past work in the area of databases has resulted in different query interface systems that simplify query formulation. While these approaches reduce the user's effort to compose queries, the user is still required to pick data sources to use and the interaction is not guaranteed to yield a non-empty result set. We introduce a novel approach that exploits the structure of the relational data source(s) to formulate a set of constraints. These constraints are used in conjunction with partial plans to produce an intelligent query interface that (a) does not require the user to know details about data sources or existing values (b) suggests valid inputs to the user (c) creates consistent queries that always return values.
read more
Chat with Paper
AI Agents for this Paper
Find similar papers on Google Scholar, PubMed and Arxiv
Write a critical review of this paper
Analyze citations of this paper to find unaddressed research gaps
Citations
New Horizons for a Data-Driven Economy: A Roadmap for Usage and Exploitation of Big Data in Europe
Jos Mara Cavanillas,Edward Curry,Wolfgang Wahlster +2 more
- 10 Apr 2016
TL;DR: In this article, the authors present the Big Data Opportunity, Big Data Value Chain, Usage and Exploitation of Big Data, and A Roadmap for Big Data Research for the European Commissions BIG project.
End-user programming of mashups with vegemite
James Lin,Jeffrey Wong,Jeffrey Nichols,Allen Cypher,Tessa Lau +4 more
- 08 Feb 2009
TL;DR: The CoScripter web automation tool is extended with a spreadsheet-like environment called Vegemite to automatically populate tables with information collected from various web sites, and a particular strength of this approach is its ability to augment a data set with new values computed by a web site.
Challenges of data integration and interoperability in big data
Anirudh Kadadi,Rajeev Agrawal,Christopher Nyamful,Rahman Atiq +3 more
- 01 Oct 2014
TL;DR: The data integration and data interoperability are complex challenges for the organizations deploying big data architectures due to the heterogeneous nature of data used by them.
114
Query by example
Moshe M. Zloof
- 30 Dec 1899
TL;DR: In the last few years, we have witnessed a trend to appeal to the non-professional user who has little or virtually no computer or mathematical background as mentioned in this paper, and this trend has been continued for many years.
101
Integrating spreadsheet data via accurate and low-effort extraction
Zhe Chen,Michael Cafarella +1 more
- 24 Aug 2014
TL;DR: A two-phase semiautomatic system that extracts accurate relational metadata while minimizing user effort, based on an undirected graphical model, that enables downstream spreadsheet integration applications.
References
The Process of Retrieval from Very Long‐Term Memory
TL;DR: In this paper, the authors argue that the recall of the names of their high school classmates can be understood from an information-processing analysis which interprets retrieval as a problem-solving process.
193
•Journal Article
Accurately and Reliably Extracting Data from the Web: A Machine Learning Approach.
TL;DR: A set of tools for extracting data from web sites and transforming it into a structured data format, such as XML, so that the resulting data can be used to build new applications without having to deal with unstructured data.
185
eTuner: tuning schema matching software using synthetic scenarios
Yoonkyong Lee,Mayssam Sayyadian,AnHai Doan,Arnon Rosenthal +3 more
- 25 Jan 2007
TL;DR: eTuner, an approach to automatically tune schema matching systems, is described, which produced tuned matching systems that achieve higher accuracy than using the systems with currently possible tuning methods.
Accurately and reliably extracting data from the Web: a machine learning approach
Craig A. Knoblock,Kristina Lerman,Steven Minton,Ion Muslea +3 more
- 01 Jan 2003
TL;DR: This paper developed a set of tools for extracting data from web sites and transforming it into a structured data format, such as XML, which can then be used to build new applications without having to deal with unstructured data.
Related Papers (5)
Jeffrey Wong,Jason Hong +1 more
- 29 Apr 2007
David F. Huynh,Robert C. Miller,David R. Karger +2 more
- 11 Nov 2007
S. S. Agrawal
- 01 Jan 2013
Chen Li,Edward Y. Chang +1 more
- 01 Feb 2000
Zachary G. Ives,Alon Halevy +1 more
- 01 Jan 2002