Task-Based Visual Interactive Modeling: Decision Trees and Rule-Based Classifiers
TL;DR: In this article , the authors provide an overview of available visualizations for decision trees with a focus on how visualizations differ with respect to 16 tasks and investigate the types of visual designs employed, and the quality measures presented.
read more
Abstract: Visual analytics enables the coupling of machine learning models and humans in a tightly integrated workflow, addressing various analysis tasks. Each task poses distinct demands to analysts and decision-makers. In this survey, we focus on one canonical technique for rule-based classification, namely decision tree classifiers. We provide an overview of available visualizations for decision trees with a focus on how visualizations differ with respect to 16 tasks. Further, we investigate the types of visual designs employed, and the quality measures presented. We find that (i) interactive visual analytics systems for classifier development offer a variety of visual designs, (ii) utilization tasks are sparsely covered, (iii) beyond classifier development, node-link diagrams are omnipresent, (iv) even systems designed for machine learning experts rarely feature visual representations of quality measures other than accuracy. In conclusion, we see a potential for integrating algorithmic techniques, mathematical quality measures, and tailored interactive visualizations to enable human experts to utilize their knowledge more effectively.
read more
Chat with Paper
AI Agents for this Paper
Find similar papers on Google Scholar, PubMed and Arxiv
Write a critical review of this paper
Analyze citations of this paper to find unaddressed research gaps
Citations
•Posted Content
Towards Natural Language Interfaces for Data Visualization: A Survey.
Leixian Shen,Enya Shen,Yuyu Luo,Xiaocong Yang,Xuming Hu,Xiongshuai Zhang,Zhiwei Tai,Jianmin Wang +7 more
TL;DR: A comprehensive review of visualization-oriented natural language interfaces (V-NLI) can be found in this paper, where the following seven stages are used: query understanding, data transformation, visual mapping, view transformation, human interaction and presentation.
73
Unleashing the Potential of Conversational AI: Amplifying Chat-GPT’s Capabilities and Tackling Technical Hurdles
Vikas Hassija,Arjab Chakrabarti,Anushka Singh,Vinay Chamola,Biplab Sikdar +4 more
TL;DR: The architecture, training process, and challenges associated with large language models, including bias, interpretability, and ethics are discussed, including bias, interpretability, and ethics.
19
Visual Knowledge Discovery with Artificial Intelligence: Challenges and Future Directions
Daryna Grechyna
- 01 Jan 2022
TL;DR: In this article , the authors summarized the current research trend and provided foresight to future research direction in integrating AI/ML and visualization, starting with visualization in ML, visual analytics, visual-enabled machine learning, natural language processing, and multidimensional visualization and AI to illustrate the research trend towards visual knowledge discovery.
Explainable Artificial Intelligence for Drug Discovery and Development - A Comprehensive Survey
Roohallah Alizadehsani,Solomon Sunday Oyelere,Sadiq Hussain,Senthil Kumar Jagatheesaperumal,Renê Ripardo Calixto,Mohamed Rahouti,Mohamad Roshanzamir,Victor Hugo C. de Albuquerque +7 more
14
Towards Natural Language Interfaces for Data Visualization: A Survey
TL;DR: A comprehensive review of the existing V-NLI systems can be found in this article , where the following seven stages are used: query interpretation, data transformation, visual mapping, view transformation, human interaction, dialogue management, and presentation.
References
Random Forests
Leo Breiman
- 01 Oct 2001
TL;DR: Internal estimates monitor error, strength, and correlation and these are used to show the response to increasing the number of features used in the forest, and are also applicable to regression.
•Journal Article
Scikit-learn: Machine Learning in Python
Fabian Pedregosa,Gaël Varoquaux,Alexandre Gramfort,Vincent Michel,Bertrand Thirion,Olivier Grisel,Mathieu Blondel,Peter Prettenhofer,Ron Weiss,Vincent Dubourg,Jake Vanderplas,Alexandre Passos,David Cournapeau,Matthieu Brucher,Matthieu Perrot,Edouard Duchesnay +15 more
TL;DR: Scikit-learn is a Python module integrating a wide range of state-of-the-art machine learning algorithms for medium-scale supervised and unsupervised problems, focusing on bringing machine learning to non-specialists using a general-purpose high-level language.
Deep learning
TL;DR: Deep learning is making major advances in solving problems that have resisted the best attempts of the artificial intelligence community for many years, and will have many more successes in the near future because it requires very little engineering by hand and can easily take advantage of increases in the amount of available computation and data.
67K
Classification and regression trees
TL;DR: This article gives an introduction to the subject of classification and regression trees by reviewing some widely available algorithms and comparing their capabilities, strengths, and weakness in two examples.
•Proceedings Article
LightGBM: a highly efficient gradient boosting decision tree
Guolin Ke,Qi Meng,Thomas Finley,Taifeng Wang,Wei Chen,Weidong Ma,Qiwei Ye,Tie-Yan Liu +7 more
- 04 Dec 2017
TL;DR: It is proved that, since the data instances with larger gradients play a more important role in the computation of information gain, GOSS can obtain quite accurate estimation of the information gain with a much smaller data size, and is called LightGBM.