PerfExplorer: A Performance Data Mining Framework For Large-Scale Parallel Computing

doi:10.1109/SC.2005.55

Open AccessProceedings Article10.1109/SC.2005.55

PerfExplorer: A Performance Data Mining Framework For Large-Scale Parallel Computing

Kevin Huck, +1 more

- 12 Nov 2005

- pp 41-41

116

TL;DR: The framework architecture enables the development and integration of data mining operations that will be applied to large-scale parallel performance profiles and is built on a robust parallel performance database (PerfDMF) to access the parallel profiles and save its analysis results.

Chat with Paper

AI Agents for this Paper

Find similar papers on Google Scholar, PubMed and Arxiv
Write a critical review of this paper
Analyze citations of this paper to find unaddressed research gaps

Citations

•Journal Article

When is nearest neighbor meaningful

Kevin S. Beyer, +3 more

- 01 Jan 1999

- Lecture Notes in Computer Science

TL;DR: In this article, the authors explore the effect of dimensionality on the nearest neighbor problem and show that under a broad set of conditions (much broader than independent and identically distributed dimensions), as dimensionality increases, the distance to the nearest data point approaches the distance of the farthest data point.

...read moreread less

1.9K

•Journal Article•10.1145/3314107

A Survey of Parallel Sequential Pattern Mining

Wensheng Gan, +4 more

- 07 Jun 2019

- ACM Transactions on Knowledge Discovery ...

TL;DR: An in-depth survey of the current status of parallel SPM (PSPM) is investigated and provided, including detailed categorization of traditional serial SPM approaches, and state-of-the art PSPM.

...read moreread less

290

Patent

Frequent Pattern Mining

Shi Han, +3 more

- 27 Apr 2011

TL;DR: This comprehensive reference consists of 18 chapters from prominent researchers in the field of frequent pattern mining, and contains a survey describing key research on the topic, a case study and future directions.

...read moreread less

129

•Proceedings Article•10.5555/3014904.3014967

Caliper: performance introspection for HPC software stacks

David Boehme, +7 more

- 13 Nov 2016

TL;DR: With Caliper, a general abstraction layer is developed to provide performance data collection as a service to applications, runtime systems, libraries, and tools that allows them to share performance data across software stack boundaries.

...read moreread less

86

•Journal Article•10.1145/3314107

A Survey of Parallel Sequential Pattern Mining

Wensheng Gan, +4 more

- 26 May 2018

- arXiv: Databases

TL;DR: In this paper, an in-depth survey of the current status of parallel sequential pattern mining (PSPM) is investigated and provided, including detailed categorization of traditional serial SPM approaches, and state of the art parallel SPM.

...read moreread less

42

...

Expand

References

•Book Chapter•10.1007/3-540-49257-7_15

When Is ''Nearest Neighbor'' Meaningful?

Kevin S. Beyer, +3 more

- 10 Jan 1999

TL;DR: The effect of dimensionality on the "nearest neighbor" problem is explored, and it is shown that under a broad set of conditions, as dimensionality increases, the Distance to the nearest data point approaches the distance to the farthest data point.

...read moreread less

2.5K

•Journal Article

When is nearest neighbor meaningful

Kevin S. Beyer, +3 more

- 01 Jan 1999

- Lecture Notes in Computer Science

TL;DR: In this article, the authors explore the effect of dimensionality on the nearest neighbor problem and show that under a broad set of conditions (much broader than independent and identically distributed dimensions), as dimensionality increases, the distance to the nearest data point approaches the distance of the farthest data point.

...read moreread less

1.9K

Proceedings Article•10.1145/605397.605403

Automatically characterizing large scale program behavior

Timothy Sherwood, +3 more

- 01 Oct 2002

TL;DR: This work quantifies the effectiveness of Basic Block Vectors in capturing program behavior across several different architectural metrics, explores the large scale behavior of several programs, and develops a set of algorithms based on clustering capable of analyzing this behavior.

...read moreread less

1.7K

•Journal Article•10.1177/1094342006064482

The Tau Parallel Performance System

Sameer Shende, +1 more

- 01 May 2006

TL;DR: This paper presents the TAU (Tuning and Analysis Utilities) parallel performance sytem and describes how it addresses diverse requirements for performance observation and analysis.

...read moreread less

1.2K

•Posted Content

Experiments with Random Projection

Sanjoy Dasgupta

- 16 Jan 2013

- arXiv: Learning

TL;DR: Results of random projection as a promising dimensionality reduction technique for learning mixtures of Gaussians are summarized by a wide variety of experiments on synthetic and real data.

...read moreread less

330