Journal Article10.14778/1920841.1920984
Xplus: a SQL-tuning-aware query optimizer
Herodotos Herodotou,Shivnath Babu +1 more
- 01 Sep 2010
- Vol. 3, Iss: 1, pp 1149-1160
TL;DR: Xplus is designed, implements, and evaluates Xplus, which, to this knowledge, is the first query optimizer to provide this feature, and shows the effectiveness of Xplus on real-life tuning scenarios created using TPC-H queries on a PostgreSQL database.
read more
Abstract: The need to improve a suboptimal execution plan picked by the query optimizer for a repeatedly run SQL query arises routinely. Complex expressions, skewed or correlated data, and changing conditions can cause the optimizer to make mistakes. For example, the optimizer may pick a poor join order, overlook an important index, use a nested-loop join when a hash join would have done better, or cause an expensive, but avoidable, sort to happen. SQL tuning is also needed while tuning multi-tier services to meet service-level objectives. The difficulty of SQL tuning can be lessened considerably if users and higher-level tuning tools can tell the optimizer: "I am not satisfied with the performance of the plan p being used for the query Q that runs repeatedly. Can you generate a (δ%) better plan?" This paper designs, implements, and evaluates Xplus which, to our knowledge, is the first query optimizer to provide this feature. Xplus goes beyond the traditional plan-first-execute-next approach: Xplus runs some (sub)plans proactively, collects monitoring data from the runs, and iterates. A nontrivial challenge is in choosing a small set of plans to run. Xplus guides this process efficiently using an extensible architecture comprising SQL-tuning experts with different goals, and a policy to arbitrate among the experts. We show the effectiveness of Xplus on real-life tuning scenarios created using TPC-H queries on a PostgreSQL database.
read more
Chat with Paper
AI Agents for this Paper
Find similar papers on Google Scholar, PubMed and Arxiv
Write a critical review of this paper
Analyze citations of this paper to find unaddressed research gaps
Citations
•Proceedings Article
Starfish: A Self-tuning System for Big Data Analytics.
Herodotos Herodotou,Harold Lim,Gang Luo,Nedyalko Borisov,Liang Dong,Fatma Bilgen Cetin,Shivnath Babu +6 more
- 01 Jan 2011
TL;DR: Starfish is introduced, a self-tuning system for big data analytics that builds on Hadoop while adapting to user needs and system workloads to provide good performance automatically, without any need for users to understand and manipulate the many tuning knobs in Hadoops.
DBSherlock: A Performance Diagnostic Tool for Transactional Databases
Dong Young Yoon,Ning Niu,Barzan Mozafari +2 more
- 26 Jun 2016
TL;DR: A practical tool for assisting DBAs in quickly and reliably diagnosing performance problems in an OLTP database is presented, which is substantially more accurate than the state-of-the-art algorithm in finding correct explanations.
87
PerfXplain: debugging MapReduce job performance
Nodira Khoussainova,Magdalena Balazinska,Dan Suciu +2 more
- 01 Mar 2012
TL;DR: PerfXplain provides a new query language for articulating performance queries and an algorithm for generating explanations from a log of past MapReduce job executions, based on techniques related to decision-tree building.
•Book
Massively Parallel Databases and Mapreduce Systems
Shivnath Babu,Herodotos Herodotou +1 more
- 13 Nov 2013
TL;DR: This monograph covers the design principles and core features of systems for analyzing very large datasets using massively-parallel computation and storage techniques on large clusters of nodes.
•Proceedings Article
Determining Essential Statistics for Cost Based Optimization of an ETL Workflow
Ramanujam Halasipuram,Prasad M. Deshpande,Sriram Padmanabhan +2 more
- 01 Jan 2014
TL;DR: This paper proposes an optimization framework to choose a set of statistics to collect for a given workflow, using which the optimizer can estimate the cost of any alternative plan for the workflow, and experimentally demonstrates the effective and efficiency of the proposed algorithms.
References
Access path selection in a relational database management system
P. Griffiths Selinger,Morton M. Astrahan,Donald D. Chamberlin,Raymond A. Lorie,T. G. Price +4 more
- 30 May 1979
TL;DR: System R as mentioned in this paper is an experimental database management system developed to carry out research on the relational model of data, which chooses access paths for both simple (single relation) and complex queries (such as joins), given a user specification of desired data as a boolean expression of predicates.
Eddies: continuously adaptive query processing
Ron Avnur,Joseph M. Hellerstein +1 more
- 16 May 2000
TL;DR: This paper introduces a query processing mechanism called an eddy, which continuously reorders operators in a query plan as it runs, and describes the moments of symmetry during which pipelined joins can be easily reordered, and the synchronization barriers that require inputs from different sources to be coordinated.
The EXODUS optimizer generator
Goetz Graefe,David J. DeWitt +1 more
- 01 Dec 1987
TL;DR: In this paper, the authors present a query optimizer for the EXODUS extensible database system, which transforms query trees and selects methods for executing operations according to cost functions associated with the methods.
Tuning database configuration parameters with iTuned
Songyun Duan,Vamsidhar Thummala,Shivnath Babu +2 more
- 01 Aug 2009
TL;DR: ITuned is described, a tool that automates the task of identifying good settings for database configuration parameters and has three novel features: a technique called Adaptive Sampling that proactively brings in appropriate data through planned experiments to find high-impact parameters and high-performance parameter settings.
Robust query processing through progressive optimization
Volker Markl,Vijayshankar Raman,David E. Simmen,Guy M. Lohman,Hamid Pirahesh,Miso Cilimdzic +5 more
- 13 Jun 2004
TL;DR: This work presents an approach to query processing that is extremely robust because it is able to detect and recover from cardinality estimation errors, and calls this approach "progressive query optimization" (POP).
Related Papers (5)
Surajit Chaudhuri,Vivek Narasayya,Ravi Ramamurthy +2 more
- 01 Aug 2008
Shivnath Babu
- 10 Jun 2010
Allison Lee,Mohamed Zait +1 more
- 01 Aug 2008