Journal Article10.1142/S0218213001000726
Visualization support for user-centered model selection in knowledge discovery and data mining
18
TL;DR: The knowledge discovery system D2MS is introduced in which several visualization techniques of data and knowledge are developed and integrated into the steps of the knowledge discovery process in order to support the participation of the user.
read more
Abstract: The problem of model selection in knowledge discovery and data mining—the selection of appropriate discovered patterns/models or algorithms to achieve such patterns/models—is generally a difficult task for the user as it requires meta-knowledge on algorithms/models and model performance metrics. Viewing knowledge discovery as a human-centered process that requires an effective collaboration between the user and the discovery system, our work aims to make model selection in knowledge discovery easier and more effective. For such a collaboration, our solution is to give the user the ability to try easily various alternatives and to compare competing models quantitatively and qualitatively. The basic idea of our solution is to integrate data and knowledge visualization with the knowledge discovery process in order to the support the participation of the user. We introduce the knowledge discovery system D2MS in which several visualization techniques of data and knowledge are developed and integrated into the steps of the knowledge discovery process. The visualizers in D2MS greatly help the user gain better insight in each step of the knowledge discovery process as well the relationship between data and discovered knowledge in the whole process.
read more
Chat with Paper
AI Agents for this Paper
Find similar papers on Google Scholar, PubMed and Arxiv
Write a critical review of this paper
Analyze citations of this paper to find unaddressed research gaps
Citations
Stride Period Adaptation for a Biomimetic Running Hexapod
Jonathan K. Karpick,Jorge G. Cham,Jonathan E. Clark,Mark R. Cutkosky +3 more
- 01 Jan 2003
TL;DR: An adaptation strategy for adjusting the stride period in a hexapedal running robot based on measurements of ground contact timing obtained from binary sensors on the robot’s feet is described.
82
Mining hepatitis data with temporal abstraction
Tu Bao Ho,Trong Dung Nguyen,Saori Kawasaki,Si Quang Le,Dung Duc Nguyen,Hideto Yokoi,Katsuhiko Takabayashi +6 more
- 24 Aug 2003
TL;DR: This paper presents a temporal abstraction approach to mining knowledge from this hepatitis database, exploiting hepatitis background knowledge and data analysis and introduces new notions and methods for abstracting short-term changed and long- term changed tests.
Research on Visualization Techniques in Data Mining
Hailiang Jin,Huijie Liu +1 more
- 28 Dec 2009
TL;DR: Current visualization methods applied in data mining are summarized and trends are clarified based on the task and object of visual data mining.
20
Knowledge visualization in hepatitis study
DucDung Nguyen,Tu Bao Ho,Saori Kawasaki +2 more
- 01 Jan 2006
TL;DR: This work presents some developed tools integrated in the data mining system D2MS for appropriately visualizing knowledge, and their usage in hepatitis study, emphasizing on the two rule visualizers, one for individual rule and the other for rule in its relations with the others.
•Journal Article
Temporal abstraction and data mining with visualization of laboratory data.
Katsuhiko Takabayashi,Tu Bao Ho,Hideto Yokoi,Trong Dung Nguyen,Saori Kawasaki,Si Quang Le,Takahiro Suzuki,Osamu Yokosuka +7 more
TL;DR: In the course of evaluating the results by domain experts, even though there were not so remarkable hypotheses, visualization tools made it easier for them to understand the relations of the complicated rules.
References
•Book
C4.5: Programs for Machine Learning
J. Ross Quinlan
- 15 Oct 1992
TL;DR: A complete guide to the C4.5 system as implemented in C for the UNIX environment, which starts from simple core learning methods and shows how they can be elaborated and extended to deal with typical problems such as missing data and over hitting.
27.2K
•Proceedings Article
A study of cross-validation and bootstrap for accuracy estimation and model selection
Ron Kohavi
- 20 Aug 1995
TL;DR: The results indicate that for real-word datasets similar to the authors', the best method to use for model selection is ten fold stratified cross validation even if computation power allows using more folds.
•Proceedings Article
Integrating classification and association rule mining
Bing Liu,Wynne Hsu,Yiming Ma +2 more
- 27 Aug 1998
TL;DR: The integration is done by focusing on mining a special subset of association rules, called class association rules (CARs), and shows that the classifier built this way is more accurate than that produced by the state-of-the-art classification system C4.5.
•Book
Feature Selection for Knowledge Discovery and Data Mining
Huan Liu,Hiroshi Motoda +1 more
- 31 Jul 1998
TL;DR: Feature Selection for Knowledge Discovery and Data Mining offers an overview of the methods developed since the 1970's and provides a general framework in order to examine these methods and categorize them and suggests guidelines for how to use different methods under various circumstances.
2.2K
Tidier Drawings of Trees
Edward M. Reingold,J.S. Tilford +1 more
TL;DR: It is shown that various algorithms for producing tidy drawings of trees contain some difficulties that lead to aesthetically unpleasing, wider than necessary drawings, and a new algorithm is presented with comparable time and storage requirements that produces tidier drawings.
575