Proceedings Article10.1109/ICDE51399.2021.00174
Joint Index, Sorting, and Compression Optimization for Memory-Efficient Spatio-Temporal Data Management
Keven Richly,Rainer Schlosser,Martin Boissier +2 more
- 01 Apr 2021
- pp 1901-1906
11
TL;DR: In this paper, a linear programming approach is presented to determine fine-grained configuration decisions for spatio-temporal workloads by dividing the data into partitions of fixed size, and applying the compression, sorting, and index selections on a finegrained level to reflect spatiotemporal access patterns.
read more
Abstract: The wide distribution of location-acquisition technologies has led to large volumes of spatio-temporal data, which are the foundation for a broad spectrum of applications. Based on these applications’ performance requirements, in-memory databases are used to store and process the data. As DRAM capacities are limited and expensive, modern database systems apply various configuration optimizations (e.g., compression) to reduce the memory footprint. The selection of cost and performance balancing configurations is challenging due to the vast amount of possible setups consisting of mutually dependent individual decisions. In this paper, we present a linear programming approach to determine fine-grained configuration decisions for spatio-temporal workloads. By dividing the data into partitions of fixed size, we can apply the compression, sorting, and index selections on a fine-grained level to reflect spatiotemporal access patterns. Our approach jointly optimizes these configurations to maximize performance under a given memory budget. We demonstrate on a real-world dataset that models specifically optimized for spatio-temporal data characteristics allow us to reduce the memory footprint (up to 60% by equal performance) and increase the performance (up to 80% by equal memory size) compared to established rule-based heuristics.
read more
Chat with Paper
AI Agents for this Paper
Find similar papers on Google Scholar, PubMed and Arxiv
Write a critical review of this paper
Analyze citations of this paper to find unaddressed research gaps
Citations
Robust Index Selection for Stochastic Dynamic Workloads
TL;DR: In this article , the authors consider index selection problems accounting for non-standard features, such as multiple potential workloads, different risk-averse objectives, multi-index configurations, reconfiguration costs, and anticipation of dynamic workload scenarios.
Budget-Conscious Fine-Grained Configuration Optimization for Spatio-Temporal Applications
TL;DR: In this paper , the authors proposed different linear programming (LP) models addressing cost dependencies at different levels of accuracy to compute optimized tuning configurations for a given workload and memory budgets, and extended their LP-based approach to incorporate reconfiguration costs as well as a worst-case optimization for potential workload scenarios.
2
Design and Implementation of Key-Value Database for Ship Virtual Test Platform Based on Distributed System
Qingyu Meng,Kejia Zhang,Haiwei Pan,Maocai Yuan,Baoying Ma +4 more
TL;DR: This paper proposes a Key-Value database for a ship virtual test platform, utilizing a distributed system to manage heterogeneous data types, with optimized I/O bandwidth and CPU resource utilization, and a multimodal data fusion mechanism for efficient data analysis.
2
•Proceedings Article
Evaluating Lightweight Integer Compression Algorithms in Column-Oriented In-Memory DBMS.
Linus Heinzl,Ben Hurdelhey,Martin Boissier,Michael Perscheid,Hasso Plattner +4 more
- 01 Jan 2021
Enterprise Platform and Integration Concepts Research at HPI
TL;DR: The Hasso Plattner Institute (HPI) as mentioned in this paper is an independent Faculty of Digital Engineering at the University of Potsdam, Germany that unites computer science research and teaching with the advantages of a privately financed institute and a tuition-free study program.
References
Integrating compression and execution in column-oriented database systems
Daniel J. Abadi,Samuel Madden,Miguel Ferreira +2 more
- 27 Jun 2006
TL;DR: This paper shows how compression schemes not traditionally used in row-oriented DBMSs can be applied to column-oriented systems and evaluates a set of compression schemes and shows that the best scheme depends not only on the properties of the data but also on the nature of the query workload.
Automatic Database Management System Tuning Through Large-scale Machine Learning
Dana Van Aken,Andrew Pavlo,Geoffrey J. Gordon,Bohan Zhang +3 more
- 09 May 2017
TL;DR: An automated approach that leverages past experience and collects new information to tune DBMS configurations and recommends configurations that are as good as or better than ones generated by existing tools or a human expert is presented.
602
DB2 design advisor: integrated automatic physical database design
Daniel C. Zilio,Jun Rao,Sam Lightstone,Guy M. Lohman,Adam J. Storm,Christian Garcia-Arellano,Scott Fadden +6 more
- 31 Aug 2004
TL;DR: The DB2 Design Advisor in IBM DB2® Universal DatabaseTM (DB2 UDB) Version 8.2 for Linux®, UNIX® and Windows® is a tool that, for a given workload, automatically recommends physical design features that are any subset of indexes, materialized query tables (also called materialized views), shared-nothing database partitionings, and multidimensional clustering of tables.
DB2 advisor: an optimizer smart enough to recommend its own indexes
Gary Valentin,M. Zuliani,Daniel C. Zilio,Guy M. Lohman,Alan Skelley +4 more
- 28 Feb 2000
TL;DR: In this paper, the DB2 optimizer is used to tackle the index selection problem, a variation of the knapsack problem, and a user interface for index recommendation is presented.
CoPhy: a scalable, portable, and interactive index advisor for large workloads
Debabrata Dash,Neoklis Polyzotis,Anastasia Ailamaki +2 more
- 01 Mar 2011
TL;DR: This work is the first to reveal that the index tuning problem has a well structured space of solutions, and this space can be explored efficiently with well known techniques from linear optimization.