Journal Article10.1145/319382.319388
Machine learning and data mining
681
TL;DR: The eld of data mining addresses the question of how best to use this historical data to discover general regularities and to improve future decisions.
read more
Abstract: Over the past decade many organizations have begun to routinely capture huge volumes of historical data describing their operations, their products, and their customers. At the same time, scientists and engineers in many elds nd themselves capturing increasingly complex experimental datasets, such as the gigabytes of functional MRI data that describe brain activity in humans. The eld of data mining addresses the question of how best to use this historical data to discover general regularities and to improve future decisions.
read more
Chat with Paper
AI Agents for this Paper
Find similar papers on Google Scholar, PubMed and Arxiv
Write a critical review of this paper
Analyze citations of this paper to find unaddressed research gaps
Citations
Machine learning
TL;DR: Machine learning addresses many of the same research questions as the fields of statistics, data mining, and psychology, but with differences of emphasis.
An efficient algorithm for large-scale detection of protein families
TL;DR: This work presents a novel approach called TRIBE-MCL for rapid and accurate clustering of protein sequences into families based on precomputed sequence similarity information that has been rigorously tested and validated on a number of very large databases.
3.9K
Web mining research: a survey
Raymond Kosala,Hendrik Blockeel +1 more
TL;DR: This paper surveys the research in the area of Web mining, point out some confusions regarded the usage of the term Web mining and suggest three Web mining categories, which are then situate some of the research with respect to these three categories.
Rough sets theory for multicriteria decision analysis
TL;DR: The original rough set approach proved to be very useful in dealing with inconsistency problems following from information granulation, but is failing when preference-orders of attribute domains (criteria) are to be taken into account and it cannot handle inconsistencies following from violation of the dominance principle.
1.7K
Materials discovery and design using machine learning
TL;DR: In this paper, the typical mode of and basic procedures for applying machine learning in materials science are outlined and compared, and the current research status is reviewed with regard to applications of ML in material property prediction, in new materials discovery and for other purposes.
1K
References
•Book
C4.5: Programs for Machine Learning
J. Ross Quinlan
- 15 Oct 1992
TL;DR: A complete guide to the C4.5 system as implemented in C for the UNIX environment, which starts from simple core learning methods and shows how they can be elaborated and extended to deal with typical problems such as missing data and over hitting.
27.2K
Machine learning
TL;DR: Machine learning addresses many of the same research questions as the fields of statistics, data mining, and psychology, but with differences of emphasis.
Programs for Machine Learning
Steven L. Salzberg,Alberto Segre +1 more
- 01 Jan 1994
TL;DR: In his new book, C4.5: Programs for Machine Learning, Quinlan has put together a definitive, much needed description of his complete system, including the latest developments, which will be a welcome addition to the library of many researchers and students.
Learning Bayesian Networks: The Combination of Knowledge and Statistical Data
TL;DR: In this article, a Bayesian approach for learning Bayesian networks from a combination of prior knowledge and statistical data is presented, which is derived from a set of assumptions made previously as well as the assumption of likelihood equivalence, which says that data should not help to discriminate network structures that represent the same assertions of conditional independence.
Data cube: a relational aggregation operator generalizing GROUP-BY, CROSS-TAB, and SUB-TOTALS
Jim Gray,A. Bosworth,A. Lyaman,Hamid Pirahesh +3 more
- 26 Feb 1996
TL;DR: The data cube operator as discussed by the authors generalizes the histogram, cross-tabulation, roll-up, drill-down, and sub-total constructs found in most report writers.
Related Papers (5)
J. Ross Quinlan
- 15 Oct 1992
Jiawei Han,Micheline Kamber,Jian Pei +2 more
- 08 Sep 2000
[...]
Leo Breiman
- 01 Oct 2001
Ian H. Witten,Eibe Frank,Mark Hall +2 more
- 25 Oct 1999