Mining knowledge in astrophysical massive data sets
TL;DR: The present status of the MDS in the Virtual Observatory and what is currently done and planned to bring advanced data mining methodologies in the case of the DAME (DAta Mining and Exploration) project are summarized.
read more
Abstract: Modern scientific data mainly consist of huge data sets gathered by a very large number of techniques and stored in much diversified and often incompatible data repositories. More in general, in the e-science environment, it is considered as a critical and urgent requirement to integrate services across distributed, heterogeneous, dynamic “virtual organizations” formed by different resources within a single enterprise. In the last decade, Astronomy has become an immensely data-rich field due to the evolution of detectors (plates to digital to mosaics), telescopes and space instruments. The Virtual Observatory approach consists of the federation under common standards of all astronomical archives available worldwide, as well as data analysis, data mining and data exploration applications. The main drive behind such an effort is that once the infrastructure is complete, it will allow a new type of multi-wavelength, multi-epoch science, which can only be barely imagined. Data mining, or knowledge discovery in databases, while being the main methodology to extract the scientific information contained in such Massive Data Sets (MDS), poses crucial problems since it has to orchestrate complex problems posed by transparent access to different computing environments, scalability of algorithms, reusability of resources, etc. In the present paper we summarize the present status of the MDS in the Virtual Observatory and what is currently done and planned to bring advanced data mining methodologies in the case of the DAME (DAta Mining and Exploration) project.
read more
Chat with Paper
AI Agents for this Paper
Find similar papers on Google Scholar, PubMed and Arxiv
Write a critical review of this paper
Analyze citations of this paper to find unaddressed research gaps
Citations
How Predicting The Academic Success of Students of the ESPAM MFL?: A Preliminary Decision Trees Based Study
Jessica Morales Carrillo,Jorge Parraga-Alava +1 more
- 01 Oct 2018
TL;DR: This paper determines academic success of students of the ESPAM MFL through decision trees based algorithms as a preliminary approach using three built classifiers: C5.0, Random Forest and CART which are applied on a dataset with 1086 instances corresponding to personal and academic information about professionalizing subjects of students from the Computer Science Career.
10
DAME: A Web Oriented Infrastructure for Scientific Data Mining & Exploration
Massimo Brescia,Giuseppe Longo,S. George Djorgovski,Stefano Cavuoti,Raffaele D'Abrusco,Ciro Donalek,Alessandro Di Guido,Michelangelo Fiore,Mauro Garofalo,Omar Laurino,Ashish Mahabal,Francesco Manna,Alfonso Nocella,Giovanni D'Angelo,Maurizio Paolillo +14 more
- 01 Nov 2010
TL;DR: DAME (DAta Mining & Exploration) is an innovative, general purpose, Web-based, VObs compliant, distributed data mining infrastructure specialized in Massive Data Sets exploration with machine learning methods.
9
•Book
Data-Rich Astronomy: Mining Synoptic Sky Surveys
Stefano Cavuoti
- 28 Jan 2015
TL;DR: In the last decade a new generation of telescopes and sensors has allowed the production of a very large amount of data and astronomy has become, a data-rich science; this transition is often labeled as: "data revolution" and "data tsunami" as discussed by the authors.
5
Automatic Identification of Hexagonal Pattern Artifacts in Radio Astronomical Surveys
Dina Said,Jeroen Stil,Russ Taylor,Ken Barker +3 more
- 07 Dec 2013
TL;DR: This research presents the first algorithm for Automatic Identification of HPA in RAS (AIHR), which will dramatically reduce the amount of time and effort invested to manually discover HPAs.
•Posted Content
DAME: A Web Oriented Infrastructure for Scientific Data Mining & Exploration
Massimo Brescia,Giuseppe Longo,S. George Djorgovski,Stefano Cavuoti,Raffaele D'Abrusco,Ciro Donalek,Alessandro Di Guido,Michelangelo Fiore,Mauro Garofalo,Omar Laurino,Ashish Mahabal,Francesco Manna,Alfonso Nocella,Giovanni D'Angelo,Maurizio Paolillo +14 more
TL;DR: The DAME (DAta Mining & Exploration) project as mentioned in this paper is an innovative, general purpose, web-based, VObs compliant, distributed data mining infrastructure specialized in Massive Data Sets exploration with machine learning methods.
References
•Posted Content
Astrophysics in S.Co.P.E
Massimo Brescia,Stefano Cavuoti,Giovanni D'Angelo,Raffaele D'Abrusco,Ciro Donalek,Natalia Deniskina,Omar Laurino,Giuseppe Longo +7 more
TL;DR: The most significant results obtained in the first two years of the project are summarized and related to the development of middleware and Data Mining tools for the Virtual Observatory.
6
Astrophysics in S.Co.P.E..
Massimo Brescia,Stefano Cavuoti,Giovanni D'Angelo,Raffaele D'Abrusco,Ciro Donalek,Natalia Deniskina,Omar Laurino,Giuseppe Longo +7 more
- 01 Jan 2008
TL;DR: S.Co.P.E. as discussed by the authors is one of the four projects funded by the Italian Government in order to provide Southern Italy with a distributed computing infrastructure for fundamental science, including astrophysics and observational cosmology.
•Posted Content
The DAME/VO-Neural Infrastructure: an Integrated Data Mining System Support for the Science Community
Massimo Brescia,Anna Corazza,Stefano Cavuoti,Giovanni D'Angelo,Raffaele D'Abrusco,Ciro Donalek,S. George Djorgovski,Natalia Deniskina,Michelangelo Fiore,Mauro Garofalo,Omar Laurino,Giuseppe Longo,Ashish Mahabal,Francesco Manna,Alfonso Nocella,B. Skordovski +15 more
TL;DR: The result of the DAME/VO-Neural project effort will be a service-oriented architecture, obtained by using appropriate standards and incorporating Grid paradigms and restful Web services frameworks where needed, that will have as main target the integration of interdisciplinary distributed systems within and across organizational domains.
2
The Virtual Observatory in Italy: status and prospect.
Riccardo Smareglia,Fabio Pasian,Ugo Becciani,Giovanni Longo,A. Prete Martinez,Alfredo Volpicelli +5 more
- 01 Jan 2006
TL;DR: In this paper, a typical theme to face in this attempt is the need to preserve the patrimony inherent in the acquired data, which must be acquired, described, processed and preserved in order to make possible their reuse from the future generations of researchers and scientific programs whose purposes can be different from those that have originally pushed the acquisition of.
The DAME/VO-Neural Infrastructure: an Integrated Data Mining System Support for the Science Community
M. Brescia,A. Corazza,S. Cavuoti,G. d'Angelo,R. D'Abrusco,C. Donalek,S. G. Djorgovski,N. Deniskina,M. Fiore,M. Garofalo,O. Laurino,G. Longo A. Mahabal,F. Manna,A. Nocella,B. Skordovski +14 more
TL;DR: The DAME/VO-Neural project develops a distributed e-infrastructure for data mining and exploration in massive astronomical datasets, integrating heterogeneous data repositories and services across virtual organizations using Grid paradigms and IVOA standards.