Interactive data exploration with smart drill-down

doi:10.1109/ICDE.2016.7498300

Open AccessProceedings Article10.1109/ICDE.2016.7498300

Interactive data exploration with smart drill-down

Manas Joglekar, +2 more

- 01 May 2016

- Vol. 2016, pp 906-917

52

TL;DR: It is demonstrated that the underlying optimization problems are NP-HARD, and an algorithm for finding the approximately optimal list of rules to display when the user uses a smart drill-down is described.

Chat with Paper

AI Agents for this Paper

Find similar papers on Google Scholar, PubMed and Arxiv
Write a critical review of this paper
Analyze citations of this paper to find unaddressed research gaps

Citations

Journal Article•10.1145/3299887.3299891

Data Lifecycle Challenges in Production Machine Learning: A Survey

Neoklis Polyzotis, +3 more

- 11 Dec 2018

TL;DR: Challenges in data understanding, data validation and cleaning, and data preparation are explored - how different constraints are imposed on the solutions depending on where in the lifecycle of a model the problems are encountered and who encounters them are explored.

...read moreread less

214

Proceedings Article•10.1145/3318464.3383126

Automating Exploratory Data Analysis via Machine Learning: An Overview

Tova Milo, +1 more

- 11 Jun 2020

TL;DR: This tutorial reviews recent lines of work for automating EDA, starting from recommender systems for suggesting a single exploratory action, going through kNN-based classifiers and active-learning methods for predicting users' interestingness preferences, and finally to fully automates EDA using state-of-the-art methods such as deep reinforcement learning and sequence-to-sequence models.

...read moreread less

91

•Proceedings Article•10.1145/3035918.3064013

Database Learning: Toward a Database that Becomes Smarter Every Time

Yongjoo Park, +3 more

- 09 May 2017

TL;DR: The principle of maximum entropy is exploited to produce answers, which are in expectation guaranteed to be more accurate than existing sample-based approximations and which lead to increasingly faster response times for future queries.

...read moreread less

87

Proceedings Article•10.1145/3318464.3389779

Automatically Generating Data Exploration Sessions Using Deep Reinforcement Learning

Ori Bar El, +2 more

- 11 Jun 2020

TL;DR: This work presents ATENA, a system that takes an input dataset and auto-generates a compelling exploratory session, presented in an EDA notebook, using a novel Deep Reinforcement Learning (DRL) architecture to effectively optimize the notebook generation.

...read moreread less

73

•Proceedings Article•10.1145/3035918.3064013

Database Learning: Toward a Database that Becomes Smarter Every Time

Yongjoo Park, +3 more

- 16 Mar 2017

- arXiv: Databases

TL;DR: Verdict as mentioned in this paper exploits the principle of maximum entropy to produce answers, which are in expectation guaranteed to be more accurate than existing sample-based approximations, and conducts extensive experiments on real-world query traces from a large customer of a major database vendor.

...read moreread less

59

...

Expand

References

UCI Machine Learning Repository

A. Asuncion

- 01 Jan 2007

24.3K

•Proceedings Article

Fast Algorithms for Mining Association Rules in Large Databases

Rakesh Agrawal, +1 more

- 12 Sep 1994

TL;DR: Two new algorithms for solving thii problem that are fundamentally different from the known algorithms are presented and empirical evaluation shows that these algorithms outperform theknown algorithms by factors ranging from three for small problems to more than an order of magnitude for large problems.

...read moreread less

12.6K

Journal Article•10.1145/335191.335372

Mining frequent patterns without candidate generation

Jiawei Han, +2 more

- 16 May 2000

TL;DR: This study proposes a novel frequent pattern tree (FP-tree) structure, which is an extended prefix-tree structure for storing compressed, crucial information about frequent patterns, and develops an efficient FP-tree-based mining method, FP-growth, for mining the complete set of frequent patterns by pattern fragment growth.

...read moreread less

7K

Journal Article•10.1023/A:1009726021843

Data cube: a relational aggregation operator generalizing GROUP-BY, CROSS-TAB, and SUB-TOTALS

Jim Gray, +3 more

- 26 Feb 1996

TL;DR: The data cube operator as discussed by the authors generalizes the histogram, cross-tabulation, roll-up, drill-down, and sub-total constructs found in most report writers.

...read moreread less

2.3K

•Journal Article•10.1145/3147.3165

Random sampling with a reservoir

Jeffrey Scott Vitter

- 01 Mar 1985

- ACM Transactions on Mathematical Softwar...

TL;DR: Theoretical and empirical results indicate that Algorithm Z outperforms current methods by a significant margin, and an efficient Pascal-like implementation is given that incorporates these modifications and that is suitable for general use.

...read moreread less

2K

...

Expand

Interactive data exploration with smart drill-down

Chat with Paper

AI Agents for this Paper

Citations

Data Lifecycle Challenges in Production Machine Learning: A Survey

Automating Exploratory Data Analysis via Machine Learning: An Overview

Database Learning: Toward a Database that Becomes Smarter Every Time

Automatically Generating Data Exploration Sessions Using Deep Reinforcement Learning

Database Learning: Toward a Database that Becomes Smarter Every Time

References

UCI Machine Learning Repository

Fast Algorithms for Mining Association Rules in Large Databases

Mining frequent patterns without candidate generation

Data cube: a relational aggregation operator generalizing GROUP-BY, CROSS-TAB, and SUB-TOTALS

Random sampling with a reservoir

Related Papers (5)

Interactive Data Exploration with Smart Drill-Down

SeeDB: efficient data-driven visualization recommendations to support visual analytics

Data cube: a relational aggregation operator generalizing GROUP-BY, CROSS-TAB, and SUB-TOTALS

Intelligent Rollups in Multidimensional OLAP Data

Discovery-Driven Exploration of OLAP Data Cubes