Chat with Paper
AI Agents for this Paper
Find similar papers on Google Scholar, PubMed and Arxiv
Write a critical review of this paper
Analyze citations of this paper to find unaddressed research gaps
Citations
A practical tutorial on bagging and boosting based ensembles for machine learning: Algorithms, software tools, performance study, practical perspectives and opportunities
TL;DR: The performance of 14 different bagging and boosting based ensembles, including XGBoost, LightGBM and Random Forest, is empirically analyzed in terms of predictive capability and efficiency.
397
An Introduction to Machine Learning.
Solveig Badillo,Balazs Banfai,Fabian Birzele,Iakov I. Davydov,Lucy Hutchinson,Tony Kam-Thong,Juliane Siebourg-Polster,Bernhard Steiert,Jitao David Zhang +8 more
TL;DR: The foundational ideas of ML are introduced to this community such that readers obtain the essential tools they need to understand publications on the topic and put applications of ML in molecular biology as well as the fields of pharmacometrics and clinical pharmacology into perspective.
359
Prediction and behavioral analysis of travel mode choice: A comparison of machine learning and logit models
TL;DR: The best-performing machine-learning model, random forest, has significantly higher predictive accuracy than multinomial logit and mixed logit models, and the random forest model produces behaviorally unreasonable arc elasticities and marginal effects when these behavioral outputs are computed from a standard approach.
297
A Survey on Causal Inference
TL;DR: A comprehensive review of causal inference methods under the potential outcome framework, one of the well-known causal inference frameworks, can be found in this article, where both the traditional statistical methods and the recent machine learning enhanced methods are discussed and compared.
282
Evaluating time series forecasting models: An empirical study on performance estimation methods
TL;DR: This paper compares different variants of cross-validation and of out-of-sample approaches using two case studies: One with 62 real-world time series and another with three synthetic time series, and shows noticeable differences in the performance estimation methods in the two scenarios.
200