Top 6197 papers published in the topic of Support vector machine in 2021

Showing papers on "Support vector machine published in 2021"

Journal Article•10.1016/J.MEASUREMENT.2020.108288•

A hybrid deep transfer learning model with machine learning methods for face mask detection in the era of the COVID-19 pandemic

[...]

Mohamed Loey¹, Gunasekaran Manogaran², Gunasekaran Manogaran³, Mohamed Hamed N. Taha⁴, Nour Eldeen Khalifa⁴ - Show less +1 more•Institutions (4)

Banha University¹, University of California, Davis², Asia University (Taiwan)³, Cairo University⁴

01 Jan 2021-Measurement

TL;DR: A hybrid model using deep and classical machine learning for face mask detection will be presented, and the SVM classifier achieved 99.64 % testing accuracy in RMFD.

...read moreread less

807 citations

Journal Article•10.1016/J.ESWA.2020.114054•

Deep Learning Approaches for COVID-19 Detection Based on Chest X-ray Images.

[...]

Aras Masood Ismael¹, Abdulkadir Sengur²•Institutions (2)

Sulaimani Polytechnic University¹, Fırat University²

01 Feb 2021-Expert Systems With Applications

TL;DR: Results showed the deep approaches to be quite efficient when compared to the local texture descriptors in the detection of COVID-19 based on chest X-ray images.

...read moreread less

Abstract: COVID-19 is a novel virus that causes infection in both the upper respiratory tract and the lungs. The numbers of cases and deaths have increased on a daily basis on the scale of a global pandemic. Chest X-ray images have proven useful for monitoring various lung diseases and have recently been used to monitor the COVID-19 disease. In this paper, deep-learning-based approaches, namely deep feature extraction, fine-tuning of pretrained convolutional neural networks (CNN), and end-to-end training of a developed CNN model, have been used in order to classify COVID-19 and normal (healthy) chest X-ray images. For deep feature extraction, pretrained deep CNN models (ResNet18, ResNet50, ResNet101, VGG16, and VGG19) were used. For classification of the deep features, the Support Vector Machines (SVM) classifier was used with various kernel functions, namely Linear, Quadratic, Cubic, and Gaussian. The aforementioned pretrained deep CNN models were also used for the fine-tuning procedure. A new CNN model is proposed in this study with end-to-end training. A dataset containing 180 COVID-19 and 200 normal (healthy) chest X-ray images was used in the study's experimentation. Classification accuracy was used as the performance measurement of the study. The experimental works reveal that deep learning shows potential in the detection of COVID-19 based on chest X-ray images. The deep features extracted from the ResNet50 model and SVM classifier with the Linear kernel function produced a 94.7% accuracy score, which was the highest among all the obtained results. The achievement of the fine-tuned ResNet50 model was found to be 92.6%, whilst end-to-end training of the developed CNN model produced a 91.6% result. Various local texture descriptors and SVM classifications were also used for performance comparison with alternative deep approaches; the results of which showed the deep approaches to be quite efficient when compared to the local texture descriptors in the detection of COVID-19 based on chest X-ray images.

...read moreread less

706 citations

Journal Article•10.1016/J.PATREC.2020.07.042•

Comparative analysis of image classification algorithms based on traditional machine learning and deep learning

[...]

Pin Wang¹, En Fan², Peng Wang³•Institutions (3)

Shenzhen Polytechnic¹, Shaoxing University², Chinese Academy of Sciences³

01 Jan 2021-Pattern Recognition Letters

TL;DR: The experimental results in this paper show that traditional machine learning has a better solution effect on small sample data sets, and deep learning framework has higher recognition accuracy on large sample data set.

...read moreread less

591 citations

Journal Article•10.1016/J.ESWA.2020.114060•

Machine learning and data mining in manufacturing

[...]

Alican Dogan¹, Derya Birant¹•Institutions (1)

Dokuz Eylül University¹

15 Mar 2021-Expert Systems With Applications

TL;DR: A comprehensive literature review is presented to provide an overview of how machine learning techniques can be applied to realize manufacturing mechanisms with intelligent actions and points to several significant research questions that are unanswered in the recent literature having the same target.

...read moreread less

Abstract: Manufacturing organizations need to use different kinds of techniques and tools in order to fulfill their foundation goals. In this aspect, using machine learning (ML) and data mining (DM) techniques and tools could be very helpful for dealing with challenges in manufacturing. Therefore, in this paper, a comprehensive literature review is presented to provide an overview of how machine learning techniques can be applied to realize manufacturing mechanisms with intelligent actions. Furthermore, it points to several significant research questions that are unanswered in the recent literature having the same target. Our survey aims to provide researchers with a solid understanding of the main approaches and algorithms used to improve manufacturing processes over the past two decades. It presents the previous ML studies and recent advances in manufacturing by grouping them under four main subjects: scheduling, monitoring, quality, and failure. It comprehensively discusses existing solutions in manufacturing according to various aspects, including tasks (i.e., clustering, classification, regression), algorithms (i.e., support vector machine, neural network), learning types (i.e., ensemble learning, deep learning), and performance metrics (i.e., accuracy, mean absolute error). Furthermore, the main steps of knowledge discovery in databases (KDD) process to be followed in manufacturing applications are explained in detail. In addition, some statistics about the current state are also given from different perspectives. Besides, it explains the advantages of using machine learning techniques in manufacturing, expresses the ways to overcome certain challenges, and offers some possible further research directions.

...read moreread less

533 citations

Journal Article•10.1109/ACCESS.2021.3053759•

Efficient Prediction of Cardiovascular Disease Using Machine Learning Algorithms With Relief and LASSO Feature Selection Techniques

[...]

Pronab Ghosh¹, Sami Azam², Mirjam Jonkman², Asif Karim², F. M. Javed Mehedi Shamrat, Eva Ignatious², Shahana Shultana¹, Abhijith Reddy Beeravolu², Friso De Boer² - Show less +5 more•Institutions (2)

Daffodil International University¹, Charles Darwin University²

22 Jan 2021-IEEE Access

TL;DR: In this article, the authors proposed a model that incorporates different methods to achieve effective prediction of heart disease, which used efficient Data Collection, Data Pre-processing and Data Transformation methods to create accurate information for the training model.

...read moreread less

Abstract: Cardiovascular diseases (CVD) are among the most common serious illnesses affecting human health. CVDs may be prevented or mitigated by early diagnosis, and this may reduce mortality rates. Identifying risk factors using machine learning models is a promising approach. We would like to propose a model that incorporates different methods to achieve effective prediction of heart disease. For our proposed model to be successful, we have used efficient Data Collection, Data Pre-processing and Data Transformation methods to create accurate information for the training model. We have used a combined dataset (Cleveland, Long Beach VA, Switzerland, Hungarian and Stat log). Suitable features are selected by using the Relief, and Least Absolute Shrinkage and Selection Operator (LASSO) techniques. New hybrid classifiers like Decision Tree Bagging Method (DTBM), Random Forest Bagging Method (RFBM), K-Nearest Neighbors Bagging Method (KNNBM), AdaBoost Boosting Method (ABBM), and Gradient Boosting Boosting Method (GBBM) are developed by integrating the traditional classifiers with bagging and boosting methods, which are used in the training process. We have also instrumented some machine learning algorithms to calculate the Accuracy (ACC), Sensitivity (SEN), Error Rate, Precision (PRE) and F1 Score (F1) of our model, along with the Negative Predictive Value (NPR), False Positive Rate (FPR), and False Negative Rate (FNR). The results are shown separately to provide comparisons. Based on the result analysis, we can conclude that our proposed model produced the highest accuracy while using RFBM and Relief feature selection methods (99.05%).

...read moreread less

434 citations

Journal Article•10.3390/TECHNOLOGIES9030052•

Effect of Data Scaling Methods on Machine Learning Algorithms and Model Performance

[...]

Manjurul Ahsan, M. A. Parvez Mahmud, Pritom Kumar Saha, Kishor Datta Gupta, Zahed Siddique - Show less +1 more

1 Jan 2021

TL;DR: CART, along with RS or QT, outperforms all other ML algorithms with 100% accuracy, 100% precision, 99% recall, and 100% F1 score, and the study outcomes demonstrate that the model’s performance varies depending on the data scaling method.

...read moreread less

Abstract: Heart disease, one of the main reasons behind the high mortality rate around the world, requires a sophisticated and expensive diagnosis process. In the recent past, much literature has demonstrated machine learning approaches as an opportunity to efficiently diagnose heart disease patients. However, challenges associated with datasets such as missing data, inconsistent data, and mixed data (containing inconsistent missing data both as numerical and categorical) are often obstacles in medical diagnosis. This inconsistency led to a higher probability of misprediction and a misled result. Data preprocessing steps like feature reduction, data conversion, and data scaling are employed to form a standard dataset—such measures play a crucial role in reducing inaccuracy in final prediction. This paper aims to evaluate eleven machine learning (ML) algorithms—Logistic Regression (LR), Linear Discriminant Analysis (LDA), K-Nearest Neighbors (KNN), Classification and Regression Trees (CART), Naive Bayes (NB), Support Vector Machine (SVM), XGBoost (XGB), Random Forest Classifier (RF), Gradient Boost (GB), AdaBoost (AB), Extra Tree Classifier (ET)—and six different data scaling methods—Normalization (NR), Standscale (SS), MinMax (MM), MaxAbs (MA), Robust Scaler (RS), and Quantile Transformer (QT) on a dataset comprising of information of patients with heart disease. The result shows that CART, along with RS or QT, outperforms all other ML algorithms with 100% accuracy, 100% precision, 99% recall, and 100% F1 score. The study outcomes demonstrate that the model’s performance varies depending on the data scaling method.

...read moreread less

407 citations

Journal Article•10.1007/S40745-021-00344-X•

A Comprehensive Comparative Study of Artificial Neural Network (ANN) and Support Vector Machines (SVM) on Stock Forecasting

[...]

Akshit Kurani¹, Pavan Doshi¹, Aarya Vakharia¹, Manan Shah²•Institutions (2)

Indus University¹, Pandit Deendayal Petroleum University²

02 Jun 2021-Annals of Data Science

TL;DR: Conclusively SVM and ANN played prominent roles in tackling these issue to an extent and can be enhanced with their integration with other novel techniques resulting in hybrid methodologies and will lead students, researchers and financial enthusiasts to more potent approaches for Stock forecasting.

...read moreread less

Abstract: From exchanging budgetary instruments to tracking individual spending plans to detail a business's profit, money-related organisations utilise computational innovation day by day. Here in this paper, we focus on the significance of innovation in accounts such as financial risk management and stock prediction. We discuss two significant algorithms that have a notable role in stock forecasting. Artificial Neural Networks (ANN), as absenteeism of some data points, does not hamper the network functioning. Secondly, Support Vector Machines (SVM) has several features, and due to simple decision boundaries, it avoids over-fitting. The paper first looks at the different technologies applied in stock market prediction. It examines how sentimental analysis, decision trees, moving average algorithm, and data mining is applied in various stock prediction scenarios. The paper covers the recent past studies to explore the concepts and methodologies through which ANN's and SVM's have been used. Additionally, the paper incorporates significant aspects of novel methods and technologies in which ANN as a hybrid model like ANN-MLP, GARCH-MLP, a combination of the Backpropagation algorithm and Multilayer Feed-forward network, yields better results. Simultaneously, SVM's have been successfully applied in stock prediction, giving an accuracy of about 60%–70% for simple SVM, which is further improved by combining methods like Random Forest, Genetic Algorithm more accurate outcomes. Further, we present our thoughts on where SVM's and ANN's stand as prediction algorithms and challenges like the time constraint, current scenarios, data limitation, and cold start problems were raised. Conclusively SVM and ANN played prominent roles in tackling these issue to an extent and can further be enhanced with their integration with other novel techniques resulting in hybrid methodologies. It will lead students, researchers and financial enthusiasts to more potent approaches for Stock forecasting.

...read moreread less

381 citations

Journal Article•10.1016/J.ASEJ.2020.11.011•

Extreme gradient boosting (Xgboost) model to predict the groundwater levels in Selangor Malaysia

[...]

Ahmedbahaaaldin Ibrahem Ahmed Osman¹, Ali Najah Ahmed¹, Ming Fai Chow¹, Yuk Feng Huang², Ahmed El-Shafie³, Ahmed El-Shafie⁴ - Show less +2 more•Institutions (4)

Universiti Tenaga Nasional¹, Universiti Tunku Abdul Rahman², University of Malaya³, United Arab Emirates University⁴

22 Jan 2021-Ain Shams Engineering Journal

TL;DR: The proposed Xgboost model outperformed both the Artificial Neural Network and Support Vector Regression models for all different input combinations and serves as a great benchmark for future groundwater levels prediction using Xg Boost algorithm.

...read moreread less

372 citations

Journal Article•10.3390/S21062222•

MRI-Based Brain Tumor Classification Using Ensemble of Deep Features and Machine Learning Classifiers

[...]

Jaeyong Kang¹, Zahid Ullah¹, Jeonghwan Gwak•Institutions (1)

Korea National University of Transportation¹

22 Mar 2021-Sensors

TL;DR: In this paper, the authors proposed a method for brain tumor classification using an ensemble of deep features and machine learning classifiers, where the top three deep features which perform well on several machine-learning classifiers are selected and concatenated as an ensemble-of-deep features which is then fed into several machine learning classes to predict the final output.

...read moreread less

Abstract: Brain tumor classification plays an important role in clinical diagnosis and effective treatment. In this work, we propose a method for brain tumor classification using an ensemble of deep features and machine learning classifiers. In our proposed framework, we adopt the concept of transfer learning and uses several pre-trained deep convolutional neural networks to extract deep features from brain magnetic resonance (MR) images. The extracted deep features are then evaluated by several machine learning classifiers. The top three deep features which perform well on several machine learning classifiers are selected and concatenated as an ensemble of deep features which is then fed into several machine learning classifiers to predict the final output. To evaluate the different kinds of pre-trained models as a deep feature extractor, machine learning classifiers, and the effectiveness of an ensemble of deep feature for brain tumor classification, we use three different brain magnetic resonance imaging (MRI) datasets that are openly accessible from the web. Experimental results demonstrate that an ensemble of deep features can help improving performance significantly, and in most cases, support vector machine (SVM) with radial basis function (RBF) kernel outperforms other machine learning classifiers, especially for large datasets.

...read moreread less

372 citations

Journal Article•10.1016/J.PETROL.2020.108182•

Application of supervised machine learning paradigms in the prediction of petroleum reservoir properties: Comparative analysis of ANN and SVM models

[...]

Daniel Asante Otchere¹, Tarek Omar Arbi Ganat¹, Raoof Gholami², Syahrir Ridha¹•Institutions (2)

Universiti Teknologi Petronas¹, University of Stavanger²

01 May 2021-Journal of Petroleum Science and Engineering

TL;DR: This review focuses on the most widely used machine learning algorithm employed in the petroleum industry, the Artificial Neural Network (ANN) with different shallow models used in reservoir characterisation, where in most cases based on this review it outperformed the ANN.

...read moreread less

365 citations

Journal Article•10.1186/S13321-020-00479-8•

Could graph neural networks learn better molecular representation for drug discovery? A comparison study of descriptor-based and graph-based models

[...]

Dejun Jiang¹, Zhenxing Wu¹, Chang-Yu Hsieh², Guangyong Chen, Ben Liao², Zhe Wang¹, Chao Shen¹, Dong-Sheng Cao³, Jian Wu¹, Tingjun Hou¹ - Show less +6 more•Institutions (3)

Zhejiang University¹, Tencent², Central South University³

17 Feb 2021-Journal of Cheminformatics

TL;DR: In this paper, the predictive capacity and computational efficiency of the prediction models developed by eight machine learning (ML) algorithms, including four descriptor-based models (SVM, XGBoost, RF and DNN) and four graph-based methods (GCN, GAT, MPNN and Attentive FP), were extensively tested and compared.

...read moreread less

Abstract: Graph neural networks (GNN) has been considered as an attractive modelling method for molecular property prediction, and numerous studies have shown that GNN could yield more promising results than traditional descriptor-based methods. In this study, based on 11 public datasets covering various property endpoints, the predictive capacity and computational efficiency of the prediction models developed by eight machine learning (ML) algorithms, including four descriptor-based models (SVM, XGBoost, RF and DNN) and four graph-based models (GCN, GAT, MPNN and Attentive FP), were extensively tested and compared. The results demonstrate that on average the descriptor-based models outperform the graph-based models in terms of prediction accuracy and computational efficiency. SVM generally achieves the best predictions for the regression tasks. Both RF and XGBoost can achieve reliable predictions for the classification tasks, and some of the graph-based models, such as Attentive FP and GCN, can yield outstanding performance for a fraction of larger or multi-task datasets. In terms of computational cost, XGBoost and RF are the two most efficient algorithms and only need a few seconds to train a model even for a large dataset. The model interpretations by the SHAP method can effectively explore the established domain knowledge for the descriptor-based models. Finally, we explored use of these models for virtual screening (VS) towards HIV and demonstrated that different ML algorithms offer diverse VS profiles. All in all, we believe that the off-the-shelf descriptor-based models still can be directly employed to accurately predict various chemical endpoints with excellent computability and interpretability.

...read moreread less

Journal Article•10.1109/ACCESS.2021.3064084•

Improving the Prediction of Heart Failure Patients’ Survival Using SMOTE and Effective Data Mining Techniques

[...]

Abid Ishaq¹, Saima Sadiq¹, Muhammad Umer¹, Saleem Ullah¹, Seyedali Mirjalili, Vaibhav Rupapara², Michele Nappi³ - Show less +3 more•Institutions (3)

University of Engineering and Technology, Lahore¹, Florida International University², University of Salerno³

04 Mar 2021-IEEE Access

TL;DR: In this paper, the authors analyzed the heart failure survivors from the dataset of 299 patients admitted in hospital and found significant features and effective data mining techniques that can boost the accuracy of cardiovascular patient's survivor prediction.

...read moreread less

Abstract: Cardiovascular disease is a substantial cause of mortality and morbidity in the world. In clinical data analytics, it is a great challenge to predict heart disease survivor. Data mining transforms huge amounts of raw data generated by the health industry into useful information that can help in making informed decisions. Various studies proved that significant features play a key role in improving performance of machine learning models. This study analyzes the heart failure survivors from the dataset of 299 patients admitted in hospital. The aim is to find significant features and effective data mining techniques that can boost the accuracy of cardiovascular patient’s survivor prediction. To predict patient’s survival, this study employs nine classification models: Decision Tree (DT), Adaptive boosting classifier (AdaBoost), Logistic Regression (LR), Stochastic Gradient classifier (SGD), Random Forest (RF), Gradient Boosting classifier (GBM), Extra Tree Classifier (ETC), Gaussian Naive Bayes classifier (G-NB) and Support Vector Machine (SVM). The imbalance class problem is handled by Synthetic Minority Oversampling Technique (SMOTE). Furthermore, machine learning models are trained on the highest ranked features selected by RF. The results are compared with those provided by machine learning algorithms using full set of features. Experimental results demonstrate that ETC outperforms other models and achieves 0.9262 accuracy value with SMOTE in prediction of heart patient’s survival.

...read moreread less

Journal Article•10.1007/S41870-017-0080-1•

Survey on SVM and their application in image classification

[...]

Mayank Arya Chandra¹, S. S. Bedi¹•Institutions (1)

M. J. P. Rohilkhand University¹

01 Oct 2021-International Journal of Information Technology

TL;DR: The different computational model of SVM and key process for the SVM system development are reviewed and a survey on their applications for image classification is provided.

...read moreread less

Abstract: Life of any living being is impossible if it does not have the ability to differentiate between various things, objects, smell, taste, colors, etc. Human being is a good ability to classify the object easily such as different human face, images. This is time of the machine so we want that machine can do all the work like as a human, this is part of machine learning. Here this paper discusses the some important technique for the image classification. What are the techniques through which a machine can learn for the image classification task as well as perform the classification task with efficiently. The most known technique to learn a machine is SVM. Support Vector machine (SVM) has evolved as an efficient paradigm for classification. SVM has a strongest mathematical model for classification and regression. This powerful mathematical foundation gives a new direction for further research in the vast field of classification and regression. Over the past few decades, various improvements to SVM has appeared, such as twin SVM, Lagrangian SVM, Quantum Support vector machine, least square support vector machine, etc., which will be further discussed in the paper, led to the creation of a new approach for better classification accuracy. For improving the accuracy as well as performance of SVM, we must aware of how a kernel function should be selected and what are the different approaches for parameter selection. This paper reviews the different computational model of SVM and key process for the SVM system development. Furthermore provides survey on their applications for image classification.

...read moreread less

Journal Article•10.3390/S21020446•

An Experimental Analysis of Attack Classification Using Machine Learning in IoT Networks.

[...]

Andrew Churcher¹, Rehmat Ullah², Jawad Ahmad¹, Sadaqat Ur Rehman³, Fawad Masood⁴, Mandar Gogate¹, Fehaid Alqahtani⁵, Boubakr Nour⁶, William J Buchanan¹ - Show less +5 more•Institutions (6)

Edinburgh Napier University¹, Queen's University Belfast², Namal College³, Yangzhou University⁴, United States Naval Academy⁵, Beijing Institute of Technology⁶

10 Jan 2021-Sensors

TL;DR: In this paper, the authors compared several machine learning (ML) methods such as k-nearest neighbor (KNN), support vector machine (SVM), decision tree (DT), naive Bayes (NB), random forest (RF), artificial neural network (ANN), and logistic regression (LR) for both binary and multi-class classification on Bot-IoT dataset.

...read moreread less

Abstract: In recent years, there has been a massive increase in the amount of Internet of Things (IoT) devices as well as the data generated by such devices. The participating devices in IoT networks can be problematic due to their resource-constrained nature, and integrating security on these devices is often overlooked. This has resulted in attackers having an increased incentive to target IoT devices. As the number of attacks possible on a network increases, it becomes more difficult for traditional intrusion detection systems (IDS) to cope with these attacks efficiently. In this paper, we highlight several machine learning (ML) methods such as k-nearest neighbour (KNN), support vector machine (SVM), decision tree (DT), naive Bayes (NB), random forest (RF), artificial neural network (ANN), and logistic regression (LR) that can be used in IDS. In this work, ML algorithms are compared for both binary and multi-class classification on Bot-IoT dataset. Based on several parameters such as accuracy, precision, recall, F1 score, and log loss, we experimentally compared the aforementioned ML algorithms. In the case of HTTP distributed denial-of-service (DDoS) attack, the accuracy of RF is 99%. Furthermore, other simulation results-based precision, recall, F1 score, and log loss metric reveal that RF outperforms on all types of attacks in binary classification. However, in multi-class classification, KNN outperforms other ML algorithms with an accuracy of 99%, which is 4% higher than RF.

...read moreread less

Journal Article•10.1016/J.IJCCE.2021.01.001•

An ensemble approach for classification and prediction of diabetes mellitus using soft voting classifier

[...]

Saloni Kumari¹, Deepika Kumar¹, Mamta Mittal•Institutions (1)

Bharati Vidyapeeth's College of Engineering¹

1 Jun 2021

TL;DR: The proposed ensemble soft voting classifier gives binary classification and uses the ensemble of three machine learning algorithms viz. random forest, logistic regression, and Naive Bayes for the classification.

...read moreread less

Abstract: Diabetes is a dreadful disease identified by escalated levels of glucose in the blood Machine learning algorithms help in identification and prediction of diabetes at an early stage The main objective of this study is to predict diabetes mellitus with better accuracy using an ensemble of machine learning algorithms The Pima Indians Diabetes dataset has been considered for experimentation, which gathers details of patients with and without having diabetes The proposed ensemble soft voting classifier gives binary classification and uses the ensemble of three machine learning algorithms viz random forest, logistic regression, and Naive Bayes for the classification Empirical evaluation of the proposed methodology has been conducted with state-of-the-art methodologies and base classifiers such as AdaBoost, Logistic Regression,Support Vector machine, Random forest, Naive Bayes, Bagging, GradientBoost, XGBoost, CatBoost by taking accuracy, precision, recall, F1-score as the evaluation criteria The proposed ensemble approach gives the highest accuracy, precision, recall, and F1_score value with 7904%, 7348%, 7145% and 806% respectively on the PIMA diabetes dataset Further, the efficiency of the proposed methodology has also been compared and analysed with breast cancer dataset The proposed ensemble soft voting classifier has given 9702% accuracy on the breast cancer dataset

...read moreread less

Journal Article•10.1109/ACCESS.2021.3056614•

Benchmarking of Machine Learning for Anomaly Based Intrusion Detection Systems in the CICIDS2017 Dataset

[...]

Ziadoon Kamil Maseer¹, Robiah Yusof¹, Nazrulazhar Bahaman¹, Salama A. Mostafa², Cik Feresa Mohd Foozy² - Show less +1 more•Institutions (2)

Universiti Teknikal Malaysia Melaka¹, Universiti Tun Hussein Onn Malaysia²

03 Feb 2021-IEEE Access

TL;DR: 10 popular supervised and unsupervised ML algorithms for identifying effective and efficient ML–AIDS of networks and computers are applied and the true positive and negative rates, accuracy, precision, recall, and F-Score of 31 ML-AIDS models are evaluated.

...read moreread less

Abstract: An intrusion detection system (IDS) is an important protection instrument for detecting complex network attacks Various machine learning (ML) or deep learning (DL) algorithms have been proposed for implementing anomaly-based IDS (AIDS) Our review of the AIDS literature identifies some issues in related work, including the randomness of the selected algorithms, parameters, and testing criteria, the application of old datasets, or shallow analyses and validation of the results This paper comprehensively reviews previous studies on AIDS by using a set of criteria with different datasets and types of attacks to set benchmarking outcomes that can reveal the suitable AIDS algorithms, parameters, and testing criteria Specifically, this paper applies 10 popular supervised and unsupervised ML algorithms for identifying effective and efficient ML–AIDS of networks and computers These supervised ML algorithms include the artificial neural network (ANN), decision tree (DT), k-nearest neighbor (k-NN), naive Bayes (NB), random forest (RF), support vector machine (SVM), and convolutional neural network (CNN) algorithms, whereas the unsupervised ML algorithms include the expectation-maximization (EM), k-means, and self-organizing maps (SOM) algorithms Several models of these algorithms are introduced, and the turning and training parameters of each algorithm are examined to achieve an optimal classifier evaluation Unlike previous studies, this study evaluates the performance of AIDS by measuring the true positive and negative rates, accuracy, precision, recall, and F-Score of 31 ML-AIDS models The training and testing time for ML-AIDS models are also considered in measuring their performance efficiency given that time complexity is an important factor in AIDSs The ML-AIDS models are tested by using a recent and highly unbalanced multiclass CICIDS2017 dataset that involves real-world network attacks In general, the k-NN-AIDS, DT-AIDS, and NB-AIDS models obtain the best results and show a greater capability in detecting web attacks compared with other models that demonstrate irregular and inferior results

...read moreread less

Journal Article•10.1016/J.ICTE.2021.02.004•

A comparison of machine learning algorithms for diabetes prediction

[...]

Jobeda J Khanam¹, Simon Y. Foo¹•Institutions (1)

Florida A&M University – Florida State University College of Engineering¹

20 Feb 2021-ICT Express

TL;DR: Data mining, machine learning (ML) algorithms, and Neural Network (NN) methods are used in diabetes prediction in this research, which found that the model with Logistic Regression and Support Vector Machine (SVM) works well on diabetes prediction.

...read moreread less

Journal Article•10.1016/J.CONBUILDMAT.2020.120950•

Efficient machine learning models for prediction of concrete strengths

[...]

Hoang X. Nguyen¹, Thanh Vu², Thuc P. Vo³, Huu-Tai Thai⁴•Institutions (4)

Northumbria University¹, Oracle Corporation², La Trobe University³, University of Melbourne⁴

10 Jan 2021-Construction and Building Materials

TL;DR: The comparative studies reveal that, for this particular prediction problem, the trained models based on GBR and XGBoost perform better than those of SVR and MLP.

...read moreread less

Journal Article•10.1007/S42979-020-00394-7•

Supervised Machine Learning Models for Prediction of COVID-19 Infection using Epidemiology Dataset

[...]

L. J. Muhammad, Ebrahem A. Algehyne¹, Sani Sharif Usman, Abdulkadir Ahmad, Chinmay Chakraborty², Ismail A. Mohammed³ - Show less +2 more•Institutions (3)

University of Tabuk¹, Birla Institute of Technology and Science², Bukar Abba Ibrahim University³

1 Feb 2021

TL;DR: Supervised machine learning models for COVID-19 infection were developed in this work with learning algorithms which include logistic regression, decision tree, support vector machine, naive Bayes, and artificial neutral network using epidemiology labeled dataset for positive and negative CO VID-19 cases of Mexico.

...read moreread less

Abstract: COVID-19 or 2019-nCoV is no longer pandemic but rather endemic, with more than 651,247 people around world having lost their lives after contracting the disease. Currently, there is no specific treatment or cure for COVID-19, and thus living with the disease and its symptoms is inevitable. This reality has placed a massive burden on limited healthcare systems worldwide especially in the developing nations. Although neither an effective, clinically proven antiviral agents' strategy nor an approved vaccine exist to eradicate the COVID-19 pandemic, there are alternatives that may reduce the huge burden on not only limited healthcare systems but also the economic sector; the most promising include harnessing non-clinical techniques such as machine learning, data mining, deep learning and other artificial intelligence. These alternatives would facilitate diagnosis and prognosis for 2019-nCoV pandemic patients. Supervised machine learning models for COVID-19 infection were developed in this work with learning algorithms which include logistic regression, decision tree, support vector machine, naive Bayes, and artificial neutral network using epidemiology labeled dataset for positive and negative COVID-19 cases of Mexico. The correlation coefficient analysis between various dependent and independent features was carried out to determine a strength relationship between each dependent feature and independent feature of the dataset prior to developing the models. The 80% of the training dataset were used for training the models while the remaining 20% were used for testing the models. The result of the performance evaluation of the models showed that decision tree model has the highest accuracy of 94.99% while the Support Vector Machine Model has the highest sensitivity of 93.34% and Naive Bayes Model has the highest specificity of 94.30%.

...read moreread less

Journal Article•10.1016/J.COMPBIOMED.2021.104572•

COVID-19 cough classification using machine learning and global smartphone recordings.

[...]

Madhurananda Pahar¹, Marisa Klopper¹, Robin M. Warren¹, Thomas Niesler¹•Institutions (1)

Stellenbosch University¹

17 Jun 2021-Computers in Biology and Medicine

TL;DR: Although all classifiers were able to identify COVID-19 coughs, the best performance was exhibited by the Resnet50 classifier, which was best able to discriminate between the CO VID-19 positive and the healthy coughs with an area under the ROC curve (AUC) of 0.98.

...read moreread less

Journal Article•10.1109/JAS.2020.1003393•

Automatic detection of COVID-19 infection using chest X-ray images through transfer learning

[...]

Elene Firmeza Ohata¹, Gabriel M. Bezerra, João Victor Souza das Chagas, Aloisio Vieira Lira Neto², Adriano Bessa Albuquerque², Victor Hugo C. de Albuquerque², Pedro Pedrosa Rebouças Filho¹ - Show less +3 more•Institutions (2)

Federal University of Ceará¹, University of Fortaleza²

01 Jan 2021-IEEE/CAA Journal of Automatica Sinica

TL;DR: This work proposes an automatic detection method for COVID-19 infection based on chest X-ray images using different architectures of convolutional neural networks trained on ImageNet, and adapt them to behave as feature extractors for the X-Ray images.

...read moreread less

Abstract: The new coronavirus ( COVID-19 ) , declared by the World Health Organization as a pandemic, has infected more than 1 million people and killed more than 50 thousand. An infection caused by COVID-19 can develop into pneumonia, which can be detected by a chest X-ray exam and should be treated appropriately. In this work, we propose an automatic detection method for COVID-19 infection based on chest X-ray images. The datasets constructed for this study are composed of 194 X-ray images of patients diagnosed with coronavirus and 194 X-ray images of healthy patients. Since few images of patients with COVID-19 are publicly available, we apply the concept of transfer learning for this task. We use different architectures of convolutional neural networks ( CNNs ) trained on ImageNet, and adapt them to behave as feature extractors for the X-ray images. Then, the CNNs are combined with consolidated machine learning methods, such as k-Nearest Neighbor, Bayes, Random Forest, multilayer perceptron ( MLP ) , and support vector machine ( SVM ) . The results show that, for one of the datasets, the extractor-classifier pair with the best performance is the MobileNet architecture with the SVM classifier using a linear kernel, which achieves an accuracy and an F1-score of 98.5 & . For the other dataset, the best pair is DenseNet201 with MLP, achieving an accuracy and an F1-score of 95.6 & . Thus, the proposed approach demonstrates efficiency in detecting COVID-19 in X-ray images.

...read moreread less

Journal Article•10.1109/TIE.2020.2994868•

Fault Diagnosis of an Autonomous Vehicle With an Improved SVM Algorithm Subject to Unbalanced Datasets

[...]

Qian Shi¹, Hui Zhang¹•Institutions (1)

Beihang University¹

01 Jul 2021-IEEE Transactions on Industrial Electronics

TL;DR: Experimental results and comparisons of an automated vehicle illustrate the effectiveness of the proposed algorithm on the steering actuator fault diagnosis and show that the proposed algorithms has superiority on the classification over existing methods.

...read moreread less

Abstract: Safety is one of the key requirements for automated vehicles and fault diagnosis is an effective technique to enhance the vehicle safety. The model-based fault diagnosis method models the fault into the system model and estimates the faults by observer. In this article, to avoid the complexity of designing observer, we investigate the problem of steering actuator fault diagnosis for automated vehicles based on the approach of model-based support vector machine (SVM) classification. The system model is utilized to generate the residual signal as the training data and the data-based algorithm of the SVM classification is employed to diagnose the fault. Due to the phenomena of data unbalance induced poor performance of the data-driven method, an undersampling procedure with the approach of linear discriminant analysis and a threshold adjustment using the algorithm of grey wolf optimizer are proposed to modify and improve the performance of classification and fault diagnosis. Various comparisons are carried out based on widely used datasets. The comparison results show that the proposed algorithm has superiority on the classification over existing methods. Experimental results and comparisons of an automated vehicle illustrate the effectiveness of the proposed algorithm on the steering actuator fault diagnosis.

...read moreread less

Journal Article•10.32604/IASC.2021.018983•

Modelling Supply Chain Information Collaboration Empowered with Machine Learning Technique

[...]

Naeem Ali, Alia Ahmed, Leena Anum, Taher M. Ghazal, Sagheer Abbas, Muhammad Adnan Khan, Haitham M. Alzoubi, Munir Ahmad - Show less +4 more

01 Jul 2021-Intelligent Automation and Soft Computing

Journal Article•10.1016/J.COMNET.2021.107840•

Machine learning methods for cyber security intrusion detection: Datasets and comparative study

[...]

Ilhan Firat Kilincer¹, Fatih Ertam¹, Abdulkadir Sengur¹•Institutions (1)

Fırat University¹

07 Apr 2021-Computer Networks

TL;DR: In this paper, the authors reviewed the literature studies using CSE-CIC IDS-2018, UNSW-NB15, ISCX-2012, NSL-KDD and CIDDS-001 data sets, which are widely used to develop IDS systems.

...read moreread less

Journal Article•10.1016/J.ENGAPPAI.2020.104015•

Optimization of support vector machine through the use of metaheuristic algorithms in forecasting TBM advance rate

[...]

Jian Zhou¹, Yingui Qiu¹, Shuangli Zhu¹, Danial Jahed Armaghani², Chuanqi Li¹, Hoang Nguyen³, Saffet Yagiz⁴ - Show less +3 more•Institutions (4)

Central South University¹, Duy Tan University², Hanoi University of Mining and Geology³, Nazarbayev University⁴

01 Jan 2021-Engineering Applications of Artificial Intelligence

TL;DR: Modeling results revealed that the MFO algorithm can capture better hyper-parameters of the SVM model in predicting TBM AR among all three hybrid models, confirming that this hybrid S VM model is a powerful and applicable technique addressing problems related to TBM performance with a high level of accuracy.

...read moreread less

Journal Article•10.3390/APP11020796•

Impact of Dataset Size on Classification Performance: An Empirical Evaluation in the Medical Domain

[...]

Alhanoof Althnian, Duaa H. AlSaeed, Heyam H. Al-Baity, Amani Khalaf Samha, Alanoud Bin Dris, Najla Alzakari, Afnan Abou Elwafa, Heba Kurdi - Show less +4 more

15 Jan 2021-Applied Sciences

TL;DR: It is found that the most robust model for limited medical data is AB and NB, followed by SVM, and then RF and NN, while the least robust model is DT.

...read moreread less

Abstract: Dataset size is considered a major concern in the medical domain, where lack of data is a common occurrence. This study aims to investigate the impact of dataset size on the overall performance of supervised classification models. We examined the performance of six widely-used models in the medical field, including support vector machine (SVM), neural networks (NN), C4.5 decision tree (DT), random forest (RF), adaboost (AB), and naive Bayes (NB) on eighteen small medical UCI datasets. We further implemented three dataset size reduction scenarios on two large datasets and analyze the performance of the models when trained on each resulting dataset with respect to accuracy, precision, recall, f-score, specificity, and area under the ROC curve (AUC). Our results indicated that the overall performance of classifiers depend on how much a dataset represents the original distribution rather than its size. Moreover, we found that the most robust model for limited medical data is AB and NB, followed by SVM, and then RF and NN, while the least robust model is DT. Furthermore, an interesting observation is that a robust machine learning model to limited dataset does not necessary imply that it provides the best performance compared to other models.

...read moreread less

Journal Article•10.1016/J.INPA.2020.09.006•

Insect classification and detection in field crops using modern machine learning techniques

[...]

Thenmozhi Kasinathan¹, Dakshayani Singaraju¹, Srinivasulu Reddy Uyyala¹•Institutions (1)

National Institute of Technology, Tiruchirappalli¹

01 Sep 2021-Information Processing in Agriculture

TL;DR: This paper presents the insect pest detection algorithm that consists of foreground extraction and contour identification to detect the insects for Wang, Xie, Deng, and IP102 datasets in a highly complex background and exhibits considerable improvement in classification accuracy, computation time performance, and state-of-the-art classification algorithms.

...read moreread less

Journal Article•10.1109/TTE.2020.3017090•

Battery Health Prediction Using Fusion-Based Feature Selection and Machine Learning

[...]

Xiaosong Hu¹, Yunhong Che¹, Xianke Lin², Simona Onori³•Institutions (3)

Chongqing University¹, University of Ontario Institute of Technology², Stanford University³

1 Jun 2021

TL;DR: A comprehensive study of the data-driven SOH estimation methods is conducted and the combination of the fusion-based selection method and GPR has an overall superior estimation performance in terms of both accuracy and computational efficiency.

...read moreread less

Abstract: State of health (SOH) is a key parameter to assess lithium-ion battery feasibility for secondary usage applications SOH estimation based on machine learning has attracted great attention in recent years and holds potentials for battery informatization and cloud battery management techniques In this article, a comprehensive study of the data-driven SOH estimation methods is conducted A new classification for health indicators (HIs) is proposed where the HIs are divided into the measured variables and calculated variables To illustrate the significance of data preprocessing, four noise reduction methods are assessed in the HIs extraction process; different feature selection methods, including filter-based method, wrapper-based method, and fusion-based method, are applied to select HIs subsets The four widely used machine learning algorithms, including artificial neural network, support vector machine, relevance vector machine, and Gaussian process regression (GPR), are applied and compared In order to evaluate the estimation performance in potential real usages under future big data era, the three HIs selection methods and four machine learning methods are evaluated using three public data sets and two estimation strategies The results show that the combination of the fusion-based selection method and GPR has an overall superior estimation performance in terms of both accuracy and computational efficiency

...read moreread less

Journal Article•10.1016/J.GSF.2020.02.012•

Landslide identification using machine learning

[...]

Haojie Wang¹, Li Min Zhang¹, Kesheng Yin¹, Hongyu Luo¹, Jinhui Li² - Show less +1 more•Institutions (2)

Hong Kong University of Science and Technology¹, Harbin Institute of Technology²

01 Jan 2021-Geoscience frontiers

TL;DR: By using machine learning and deep learning techniques, the proposed landslide identification method shows outstanding robustness and great potential in tackling the landslide identification problem.

...read moreread less

Abstract: Landslide identification is critical for risk assessment and mitigation. This paper proposes a novel machine-learning and deep-learning method to identify natural-terrain landslides using integrated geodatabases. First, landslide-related data are compiled, including topographic data, geological data and rainfall-related data. Then, three integrated geodatabases are established; namely, Recent Landslide Database (RecLD), Relict Landslide Database (RelLD) and Joint Landslide Database (JLD). After that, five machine learning and deep learning algorithms, including logistic regression (LR), support vector machine (SVM), random forest (RF), boosting methods and convolutional neural network (CNN), are utilized and evaluated on each database. A case study in Lantau, Hong Kong, is conducted to demonstrate the application of the proposed method. From the results of the case study, CNN achieves an identification accuracy of 92.5% on RecLD, and outperforms other algorithms due to its strengths in feature extraction and multi dimensional data processing. Boosting methods come second in terms of accuracy, followed by RF, LR and SVM. By using machine learning and deep learning techniques, the proposed landslide identification method shows outstanding robustness and great potential in tackling the landslide identification problem.

...read moreread less

Book Chapter•10.1007/978-3-030-57077-4_11•

Pattern Recognition and Machine Learning

[...]

Bharadwaj, Kolla Bhanu Prakash, G. R. Kanagachidambaresan¹•Institutions (1)

Techno India¹

1 Jan 2021

TL;DR: Support vector machine (SVM) is one of the most widely used classification algorithms as discussed by the authors, it uses supervised learning method (Aizerman et al., Auto Remote Cont 25:821-837, 1964) for training.

...read moreread less

Abstract: Support vector machine (SVM) is one of the most widely used classification algorithms. It uses supervised learning method (Aizerman et al., Auto Remote Cont 25:821–837, 1964) for training. The SVM classifier is mostly used in multi-classification problems. SVM differs from the traditional classifiers as it uses “decision boundary,” which separates the classes. The decision boundary maximizes distances of data points belongs to different classes .in this; decision boundary is the optimum that is Most optimal (Baron and Ensley, Opportunity recognition as the detection of meaningful patterns: evidence from the prototypes of novice and experienced entrepreneurs. Manuscript under review, 2005) decision boundary has maximum margin. The data points which are nearer to the boundary are called support vectors. The most important thing in SVM is its hyper plane, where for a N-dimensional space it is an (N-1)-dimensional subspace. To better understand, the hyper plane is just a line in one dimension for a two-dimensional space. It is a two-dimensional plane that separates the classes for a three-dimensional space.

...read moreread less

...

Expand