A data mining toolset for distributed high- performance platforms
TL;DR: This paper presents an integrated toolset named VEGA (Visual Environment for Grid Applications), which allows a Knowledge Grid user to develop and execute distributed data mining computations in a simple and effective way.
read more
Abstract: Today a large number of scientific and commercial applications often require to analyse large data sets maintained over geographically distributed sites by using the computational power of distributed high-performance environments. Advances in networking technology and computational infrastructure made it possible to construct large-scale distributed computing platforms, called computational grids, that provide dependable, consistent, and pervasive access to high-end computational resources. Grids can play a significant role in providing an effective computational support for distributed data mining applications. Currently we are developing a software system for geographically distributed knowledge discovery applications called KNOWLEDGE GRID,which is designed on top of computational grid mechanisms, provided by grid environments such as Glob us. In this paper we present an integrated toolset named VEGA (Visual Environment for Grid Applications), which allows a Knowledge Grid user to develop and execute distributed data mining computations in a simple and effective way.
read more
Chat with Paper
AI Agents for this Paper
Find similar papers on Google Scholar, PubMed and Arxiv
Write a critical review of this paper
Analyze citations of this paper to find unaddressed research gaps
Citations
Distributed data mining on grids: services, tools, and applications
Mario Cannataro,Antonio Congiusta,Andrea Pugliese,Domenico Talia,Paolo Trunfio +4 more
- 01 Dec 2004
TL;DR: The paper discusses how to design and implement data mining applications by using the KNOWLEDGE GRID tools starting from searching grid resources, composing software and data components, and executing the resulting data mining process on a grid.
Semantics and knowledge grids: building the next-generation grid
Mario Cannataro,Domenico Talia +1 more
TL;DR: This work proposes a comprehensive software architecture for the next-generation grid, which integrates currently available services and components in Semantic Web, Semantic Grid, P2P, and ubiquitous systems.
Distributed data mining services leveraging WSRF
Antonio Congiusta,Domenico Talia,Paolo Trunfio +2 more
- 01 Jan 2007
TL;DR: Design aspects and implementation choices involved in the re-designed and re-implemented Grid Services, a high-level framework providing Grid-based knowledge discovery tools and services, are highlighted.
54
Proteus, a Grid based Problem Solving Environment for Bioinformatics: Architecture and Experiments
Mario Cannataro,Carmela Comito,Filippo Lo Schiavo,Pierangelo Veltri +3 more
- 01 Jan 2004
TL;DR: Considering bioinformatics requirements, PROTEUS is presented, a Grid-based Problem Solving Environment for bioinformics applications that uses ontology to enhance composition of bioInformatics applications.
Grid-Based Data Mining and Knowledge Discovery
Mario Cannataro,Antonio Congiusta,Carlo Mastroianni,Andrea Pugliese,Domenico Talia,Paolo Trunfio +5 more
- 01 Jan 2004
TL;DR: The chapter presents also how to design and implement distributed data mining applications by using the Knowledge Grid tools starting from searching Grid resources, composing software and data elements, and executing the resulting application on a Grid.
25
References
Globus: a Metacomputing Infrastructure Toolkit
Ian Foster,Carl Kesselman +1 more
- 01 Jun 1997
TL;DR: The Globus system is intended to achieve a vertically integrated treatment of application, middleware, and net work, an integrated set of higher level services that enable applications to adapt to heteroge neous and dynamically changing metacomputing environ ments.
•Book
Advances in Distributed and Parallel Knowledge Discovery
Hillol Kargupta,Philip K. Chan +1 more
- 01 Sep 2000
TL;DR: This book presents introductions to DKD and PKD, extensive reviews of the field, and state-of-the-art techniques that address the problem of extracting interesting associations, classifiers, clusters, and other patterns from data.
342
KNOWLEDGE GRID: High Performance Knowledge Discovery Services on the Grid
Mario Cannataro,Domenico Talia,Paolo Trunfio +2 more
- 12 Nov 2001
TL;DR: A software architecture for parallel and distributed knowledge discovery (PDKD) systems that is built on top of computational grid services that provide dependable, consistent, and pervasive access to high-end computational resources.
50