Enhanced Preprocessing Approach Using Ensemble Machine Learning Algorithms for Detecting Liver Disease
Abdul Quadir Md,S. Kulkarni,Christy Jackson Joshua,T. Vaichole,Senthilkumar Mohan,Celestine Iwendi +5 more
TL;DR: In this paper, the authors proposed a novel architecture based on ensemble learning and enhanced preprocessing to predict liver disease using the Indian Liver Patient Dataset (ILPD), and their results are compared to those obtained with existing studies.
read more
Abstract: There has been a sharp increase in liver disease globally, and many people are dying without even knowing that they have it. As a result of its limited symptoms, it is extremely difficult to detect liver disease until the very last stage. In the event of early detection, patients can begin treatment earlier, thereby saving their lives. It has become increasingly popular to use ensemble learning algorithms since they perform better than traditional machine learning algorithms. In this context, this paper proposes a novel architecture based on ensemble learning and enhanced preprocessing to predict liver disease using the Indian Liver Patient Dataset (ILPD). Six ensemble learning algorithms are applied to the ILPD, and their results are compared to those obtained with existing studies. The proposed model uses several data preprocessing methods, such as data balancing, feature scaling, and feature selection, to improve the accuracy with appropriate imputations. Multivariate imputation is applied to fill in missing values. On skewed columns, log1p transformation was applied, along with standardization, min–max scaling, maximum absolute scaling, and robust scaling techniques. The selection of features is carried out based on several methods including univariate selection, feature importance, and correlation matrix. These enhanced preprocessed data are trained on Gradient boosting, XGBoost, Bagging, Random Forest, Extra Tree, and Stacking ensemble learning algorithms. The results of the six models were compared with each other, as well as with the models used in other research works. The proposed model using extra tree classifier and random forest, outperformed the other methods with the highest testing accuracy of 91.82% and 86.06%, respectively, portraying our method as a real-world solution for detecting liver disease.
read more
Chat with Paper
AI Agents for this Paper
Find similar papers on Google Scholar, PubMed and Arxiv
Write a critical review of this paper
Analyze citations of this paper to find unaddressed research gaps
Citations
Can Artificial Intelligence Accelerate Fluid Mechanics Research?
TL;DR: This paper reviewed ML and DL research for fluid dynamics, presents algorithmic challenges and discusses potential future directions and discusses the potential future direction of fluid dynamics research in artificial intelligence for scientific, engineering and biomedical applications.
20
A comparative analysis of boosting algorithms for chronic liver disease prediction
Shahid Mohammad Ganie,Pijush Kanti Dutta Pramanik +1 more
TL;DR: A comparative analysis of boosting algorithms for chronic liver disease prediction finds Gradient Boosting as the most effective algorithm for predicting chronic liver disease with high accuracy and performance.
9
Machine Learning Algorithms for Predicting Mechanical Stiffness of Lattice Structure-Based Polymer Foam
TL;DR: This study revealed the accurate prediction of the mechanical stiffness of lattice parts for the desired set of lattices parameters and recorded corresponding strain deformations.
7
A systematic review on deep learning‐based automated cancer diagnosis models
Ritu Tandon,Shweta Agrawal,Narendra Pal Singh Rathore,Abhinava K. Mishra,Sanjiv Jain +4 more
TL;DR: A systematic review on deep learning‐based automated cancer diagnosis models finds that most researchers achieved appreciable accuracy using convolutional neural network models for automated diagnosis of cancer patients.
7
Development of a Model to Classify Skin Diseases using Stacking Ensemble Machine Learning Techniques
Oluwayemisi Jaiyeoba,Emeka Ogbuju,Owolabi Temitope Yomi,Francisca Oladipo +3 more
- 20 May 2024
TL;DR: The study developed an ensemble machine learning model to classify Erythemato-Squamous Diseases (ESD) with high accuracy. The model achieved an accuracy of 99.30% and outperformed the individual base models.
7
References
•Posted Content
Model Evaluation, Model Selection, and Algorithm Selection in Machine Learning
TL;DR: Different flavors of the bootstrap technique are introduced for estimating the uncertainty of performance estimates, as an alternative to confidence intervals via normal approximation if bootstrapping is computationally feasible.
598
Prediction of fatty liver disease using machine learning algorithms.
Chieh-Chen Wu,Wen-Chun Yeh,Wen-Ding Hsu,Mohaimenul Islam,Phung Anh Nguyen,Tahmina Nasrin Poly,Yao-Chin Wang,Hsuan Chia Yang,Yu-Chuan Jack Li +8 more
TL;DR: Wang et al. as discussed by the authors developed a machine learning model to predict FLD that could assist physicians in classifying high-risk patients and make a novel diagnosis, prevent and manage FLD.
285
An intelligent model for liver disease diagnosis
TL;DR: An intelligent model for the diagnosis of liver diseases which integrates CART and CBR is suggested which can be used as a supporting system in making decisions regarding liver disease diagnosis and treatment.
165
Comparison of Machine Learning Approaches for Prediction of Advanced Liver Fibrosis in Chronic Hepatitis C Patients
Somaya Hashem,Gamal Esmat,Wafaa El-Akel,Shahira M. Habashy,Safaa Abdel Raouf,Mohamed Elhefnawi,Mohamed I. Eladawy,Mahmoud ElHefnawi +7 more
TL;DR: Machine-learning approaches could be used as alternative methods in prediction of the risk of advanced liver fibrosis due to chronic hepatitis C by combining the serum bio-markers and clinical information to develop the classification models.
134
Application of Machine Learning Techniques for Clinical Predictive Modeling: A Cross-Sectional Study on Nonalcoholic Fatty Liver Disease in China
TL;DR: Novel machine learning techniques may have screening and predictive value for NAFLD by leveraging a set of statistical testing techniques.