Scispace (Formerly Typeset)
  1. Home
  2. Topics
  3. Support vector machine
  4. 2021
  1. Home
  2. Topics
  3. Support vector machine
  4. 2021
Showing papers on "Support vector machine published in 2021"
Journal Article•10.1016/J.MEASUREMENT.2020.108288•
A hybrid deep transfer learning model with machine learning methods for face mask detection in the era of the COVID-19 pandemic

[...]

Mohamed Loey1, Gunasekaran Manogaran2, Gunasekaran Manogaran3, Mohamed Hamed N. Taha4, Nour Eldeen Khalifa4 •
Banha University1, University of California, Davis2, Asia University (Taiwan)3, Cairo University4
01 Jan 2021-Measurement
TL;DR: A hybrid model using deep and classical machine learning for face mask detection will be presented, and the SVM classifier achieved 99.64 % testing accuracy in RMFD.

807 citations

Journal Article•10.1016/J.ESWA.2020.114054•
Deep Learning Approaches for COVID-19 Detection Based on Chest X-ray Images.

[...]

Aras Masood Ismael1, Abdulkadir Sengur2•
Sulaimani Polytechnic University1, Fırat University2
01 Feb 2021-Expert Systems With Applications
TL;DR: Results showed the deep approaches to be quite efficient when compared to the local texture descriptors in the detection of COVID-19 based on chest X-ray images.
Abstract: COVID-19 is a novel virus that causes infection in both the upper respiratory tract and the lungs. The numbers of cases and deaths have increased on a daily basis on the scale of a global pandemic. Chest X-ray images have proven useful for monitoring various lung diseases and have recently been used to monitor the COVID-19 disease. In this paper, deep-learning-based approaches, namely deep feature extraction, fine-tuning of pretrained convolutional neural networks (CNN), and end-to-end training of a developed CNN model, have been used in order to classify COVID-19 and normal (healthy) chest X-ray images. For deep feature extraction, pretrained deep CNN models (ResNet18, ResNet50, ResNet101, VGG16, and VGG19) were used. For classification of the deep features, the Support Vector Machines (SVM) classifier was used with various kernel functions, namely Linear, Quadratic, Cubic, and Gaussian. The aforementioned pretrained deep CNN models were also used for the fine-tuning procedure. A new CNN model is proposed in this study with end-to-end training. A dataset containing 180 COVID-19 and 200 normal (healthy) chest X-ray images was used in the study's experimentation. Classification accuracy was used as the performance measurement of the study. The experimental works reveal that deep learning shows potential in the detection of COVID-19 based on chest X-ray images. The deep features extracted from the ResNet50 model and SVM classifier with the Linear kernel function produced a 94.7% accuracy score, which was the highest among all the obtained results. The achievement of the fine-tuned ResNet50 model was found to be 92.6%, whilst end-to-end training of the developed CNN model produced a 91.6% result. Various local texture descriptors and SVM classifications were also used for performance comparison with alternative deep approaches; the results of which showed the deep approaches to be quite efficient when compared to the local texture descriptors in the detection of COVID-19 based on chest X-ray images.

706 citations

Journal Article•10.1016/J.PATREC.2020.07.042•
Comparative analysis of image classification algorithms based on traditional machine learning and deep learning

[...]

Pin Wang1, En Fan2, Peng Wang3•
Shenzhen Polytechnic1, Shaoxing University2, Chinese Academy of Sciences3
01 Jan 2021-Pattern Recognition Letters
TL;DR: The experimental results in this paper show that traditional machine learning has a better solution effect on small sample data sets, and deep learning framework has higher recognition accuracy on large sample data set.

591 citations

Journal Article•10.1016/J.ESWA.2020.114060•
Machine learning and data mining in manufacturing

[...]

Alican Dogan1, Derya Birant1•
Dokuz Eylül University1
15 Mar 2021-Expert Systems With Applications
TL;DR: A comprehensive literature review is presented to provide an overview of how machine learning techniques can be applied to realize manufacturing mechanisms with intelligent actions and points to several significant research questions that are unanswered in the recent literature having the same target.
Abstract: Manufacturing organizations need to use different kinds of techniques and tools in order to fulfill their foundation goals. In this aspect, using machine learning (ML) and data mining (DM) techniques and tools could be very helpful for dealing with challenges in manufacturing. Therefore, in this paper, a comprehensive literature review is presented to provide an overview of how machine learning techniques can be applied to realize manufacturing mechanisms with intelligent actions. Furthermore, it points to several significant research questions that are unanswered in the recent literature having the same target. Our survey aims to provide researchers with a solid understanding of the main approaches and algorithms used to improve manufacturing processes over the past two decades. It presents the previous ML studies and recent advances in manufacturing by grouping them under four main subjects: scheduling, monitoring, quality, and failure. It comprehensively discusses existing solutions in manufacturing according to various aspects, including tasks (i.e., clustering, classification, regression), algorithms (i.e., support vector machine, neural network), learning types (i.e., ensemble learning, deep learning), and performance metrics (i.e., accuracy, mean absolute error). Furthermore, the main steps of knowledge discovery in databases (KDD) process to be followed in manufacturing applications are explained in detail. In addition, some statistics about the current state are also given from different perspectives. Besides, it explains the advantages of using machine learning techniques in manufacturing, expresses the ways to overcome certain challenges, and offers some possible further research directions.

533 citations

Journal Article•10.1109/ACCESS.2021.3053759•
Efficient Prediction of Cardiovascular Disease Using Machine Learning Algorithms With Relief and LASSO Feature Selection Techniques

[...]

Pronab Ghosh1, Sami Azam2, Mirjam Jonkman2, Asif Karim2, F. M. Javed Mehedi Shamrat, Eva Ignatious2, Shahana Shultana1, Abhijith Reddy Beeravolu2, Friso De Boer2 •
Daffodil International University1, Charles Darwin University2
22 Jan 2021-IEEE Access
TL;DR: In this article, the authors proposed a model that incorporates different methods to achieve effective prediction of heart disease, which used efficient Data Collection, Data Pre-processing and Data Transformation methods to create accurate information for the training model.
Abstract: Cardiovascular diseases (CVD) are among the most common serious illnesses affecting human health. CVDs may be prevented or mitigated by early diagnosis, and this may reduce mortality rates. Identifying risk factors using machine learning models is a promising approach. We would like to propose a model that incorporates different methods to achieve effective prediction of heart disease. For our proposed model to be successful, we have used efficient Data Collection, Data Pre-processing and Data Transformation methods to create accurate information for the training model. We have used a combined dataset (Cleveland, Long Beach VA, Switzerland, Hungarian and Stat log). Suitable features are selected by using the Relief, and Least Absolute Shrinkage and Selection Operator (LASSO) techniques. New hybrid classifiers like Decision Tree Bagging Method (DTBM), Random Forest Bagging Method (RFBM), K-Nearest Neighbors Bagging Method (KNNBM), AdaBoost Boosting Method (ABBM), and Gradient Boosting Boosting Method (GBBM) are developed by integrating the traditional classifiers with bagging and boosting methods, which are used in the training process. We have also instrumented some machine learning algorithms to calculate the Accuracy (ACC), Sensitivity (SEN), Error Rate, Precision (PRE) and F1 Score (F1) of our model, along with the Negative Predictive Value (NPR), False Positive Rate (FPR), and False Negative Rate (FNR). The results are shown separately to provide comparisons. Based on the result analysis, we can conclude that our proposed model produced the highest accuracy while using RFBM and Relief feature selection methods (99.05%).

434 citations

Journal Article•10.3390/TECHNOLOGIES9030052•
Effect of Data Scaling Methods on Machine Learning Algorithms and Model Performance

[...]

Manjurul Ahsan, M. A. Parvez Mahmud, Pritom Kumar Saha, Kishor Datta Gupta, Zahed Siddique 
1 Jan 2021
TL;DR: CART, along with RS or QT, outperforms all other ML algorithms with 100% accuracy, 100% precision, 99% recall, and 100% F1 score, and the study outcomes demonstrate that the model’s performance varies depending on the data scaling method.
Abstract: Heart disease, one of the main reasons behind the high mortality rate around the world, requires a sophisticated and expensive diagnosis process. In the recent past, much literature has demonstrated machine learning approaches as an opportunity to efficiently diagnose heart disease patients. However, challenges associated with datasets such as missing data, inconsistent data, and mixed data (containing inconsistent missing data both as numerical and categorical) are often obstacles in medical diagnosis. This inconsistency led to a higher probability of misprediction and a misled result. Data preprocessing steps like feature reduction, data conversion, and data scaling are employed to form a standard dataset—such measures play a crucial role in reducing inaccuracy in final prediction. This paper aims to evaluate eleven machine learning (ML) algorithms—Logistic Regression (LR), Linear Discriminant Analysis (LDA), K-Nearest Neighbors (KNN), Classification and Regression Trees (CART), Naive Bayes (NB), Support Vector Machine (SVM), XGBoost (XGB), Random Forest Classifier (RF), Gradient Boost (GB), AdaBoost (AB), Extra Tree Classifier (ET)—and six different data scaling methods—Normalization (NR), Standscale (SS), MinMax (MM), MaxAbs (MA), Robust Scaler (RS), and Quantile Transformer (QT) on a dataset comprising of information of patients with heart disease. The result shows that CART, along with RS or QT, outperforms all other ML algorithms with 100% accuracy, 100% precision, 99% recall, and 100% F1 score. The study outcomes demonstrate that the model’s performance varies depending on the data scaling method.

407 citations

Journal Article•10.1007/S40745-021-00344-X•
A Comprehensive Comparative Study of Artificial Neural Network (ANN) and Support Vector Machines (SVM) on Stock Forecasting

[...]

Akshit Kurani1, Pavan Doshi1, Aarya Vakharia1, Manan Shah2•
Indus University1, Pandit Deendayal Petroleum University2
02 Jun 2021-Annals of Data Science
TL;DR: Conclusively SVM and ANN played prominent roles in tackling these issue to an extent and can be enhanced with their integration with other novel techniques resulting in hybrid methodologies and will lead students, researchers and financial enthusiasts to more potent approaches for Stock forecasting.
Abstract: From exchanging budgetary instruments to tracking individual spending plans to detail a business's profit, money-related organisations utilise computational innovation day by day. Here in this paper, we focus on the significance of innovation in accounts such as financial risk management and stock prediction. We discuss two significant algorithms that have a notable role in stock forecasting. Artificial Neural Networks (ANN), as absenteeism of some data points, does not hamper the network functioning. Secondly, Support Vector Machines (SVM) has several features, and due to simple decision boundaries, it avoids over-fitting. The paper first looks at the different technologies applied in stock market prediction. It examines how sentimental analysis, decision trees, moving average algorithm, and data mining is applied in various stock prediction scenarios. The paper covers the recent past studies to explore the concepts and methodologies through which ANN's and SVM's have been used. Additionally, the paper incorporates significant aspects of novel methods and technologies in which ANN as a hybrid model like ANN-MLP, GARCH-MLP, a combination of the Backpropagation algorithm and Multilayer Feed-forward network, yields better results. Simultaneously, SVM's have been successfully applied in stock prediction, giving an accuracy of about 60%–70% for simple SVM, which is further improved by combining methods like Random Forest, Genetic Algorithm more accurate outcomes. Further, we present our thoughts on where SVM's and ANN's stand as prediction algorithms and challenges like the time constraint, current scenarios, data limitation, and cold start problems were raised. Conclusively SVM and ANN played prominent roles in tackling these issue to an extent and can further be enhanced with their integration with other novel techniques resulting in hybrid methodologies. It will lead students, researchers and financial enthusiasts to more potent approaches for Stock forecasting.

381 citations

Journal Article•10.1016/J.ASEJ.2020.11.011•
Extreme gradient boosting (Xgboost) model to predict the groundwater levels in Selangor Malaysia

[...]

Ahmedbahaaaldin Ibrahem Ahmed Osman1, Ali Najah Ahmed1, Ming Fai Chow1, Yuk Feng Huang2, Ahmed El-Shafie3, Ahmed El-Shafie4 •
Universiti Tenaga Nasional1, Universiti Tunku Abdul Rahman2, University of Malaya3, United Arab Emirates University4
22 Jan 2021-Ain Shams Engineering Journal
TL;DR: The proposed Xgboost model outperformed both the Artificial Neural Network and Support Vector Regression models for all different input combinations and serves as a great benchmark for future groundwater levels prediction using Xg Boost algorithm.

372 citations

Journal Article•10.3390/S21062222•
MRI-Based Brain Tumor Classification Using Ensemble of Deep Features and Machine Learning Classifiers

[...]

Jaeyong Kang1, Zahid Ullah1, Jeonghwan Gwak•
Korea National University of Transportation1
22 Mar 2021-Sensors
TL;DR: In this paper, the authors proposed a method for brain tumor classification using an ensemble of deep features and machine learning classifiers, where the top three deep features which perform well on several machine-learning classifiers are selected and concatenated as an ensemble-of-deep features which is then fed into several machine learning classes to predict the final output.
Abstract: Brain tumor classification plays an important role in clinical diagnosis and effective treatment. In this work, we propose a method for brain tumor classification using an ensemble of deep features and machine learning classifiers. In our proposed framework, we adopt the concept of transfer learning and uses several pre-trained deep convolutional neural networks to extract deep features from brain magnetic resonance (MR) images. The extracted deep features are then evaluated by several machine learning classifiers. The top three deep features which perform well on several machine learning classifiers are selected and concatenated as an ensemble of deep features which is then fed into several machine learning classifiers to predict the final output. To evaluate the different kinds of pre-trained models as a deep feature extractor, machine learning classifiers, and the effectiveness of an ensemble of deep feature for brain tumor classification, we use three different brain magnetic resonance imaging (MRI) datasets that are openly accessible from the web. Experimental results demonstrate that an ensemble of deep features can help improving performance significantly, and in most cases, support vector machine (SVM) with radial basis function (RBF) kernel outperforms other machine learning classifiers, especially for large datasets.

372 citations

Journal Article•10.1016/J.PETROL.2020.108182•
Application of supervised machine learning paradigms in the prediction of petroleum reservoir properties: Comparative analysis of ANN and SVM models

[...]

Daniel Asante Otchere1, Tarek Omar Arbi Ganat1, Raoof Gholami2, Syahrir Ridha1•
Universiti Teknologi Petronas1, University of Stavanger2
01 May 2021-Journal of Petroleum Science and Engineering
TL;DR: This review focuses on the most widely used machine learning algorithm employed in the petroleum industry, the Artificial Neural Network (ANN) with different shallow models used in reservoir characterisation, where in most cases based on this review it outperformed the ANN.

365 citations

Journal Article•10.1186/S13321-020-00479-8•
Could graph neural networks learn better molecular representation for drug discovery? A comparison study of descriptor-based and graph-based models

[...]

Dejun Jiang1, Zhenxing Wu1, Chang-Yu Hsieh2, Guangyong Chen, Ben Liao2, Zhe Wang1, Chao Shen1, Dong-Sheng Cao3, Jian Wu1, Tingjun Hou1 •
Zhejiang University1, Tencent2, Central South University3
17 Feb 2021-Journal of Cheminformatics
TL;DR: In this paper, the predictive capacity and computational efficiency of the prediction models developed by eight machine learning (ML) algorithms, including four descriptor-based models (SVM, XGBoost, RF and DNN) and four graph-based methods (GCN, GAT, MPNN and Attentive FP), were extensively tested and compared.
Abstract: Graph neural networks (GNN) has been considered as an attractive modelling method for molecular property prediction, and numerous studies have shown that GNN could yield more promising results than traditional descriptor-based methods. In this study, based on 11 public datasets covering various property endpoints, the predictive capacity and computational efficiency of the prediction models developed by eight machine learning (ML) algorithms, including four descriptor-based models (SVM, XGBoost, RF and DNN) and four graph-based models (GCN, GAT, MPNN and Attentive FP), were extensively tested and compared. The results demonstrate that on average the descriptor-based models outperform the graph-based models in terms of prediction accuracy and computational efficiency. SVM generally achieves the best predictions for the regression tasks. Both RF and XGBoost can achieve reliable predictions for the classification tasks, and some of the graph-based models, such as Attentive FP and GCN, can yield outstanding performance for a fraction of larger or multi-task datasets. In terms of computational cost, XGBoost and RF are the two most efficient algorithms and only need a few seconds to train a model even for a large dataset. The model interpretations by the SHAP method can effectively explore the established domain knowledge for the descriptor-based models. Finally, we explored use of these models for virtual screening (VS) towards HIV and demonstrated that different ML algorithms offer diverse VS profiles. All in all, we believe that the off-the-shelf descriptor-based models still can be directly employed to accurately predict various chemical endpoints with excellent computability and interpretability.
Journal Article•10.1109/ACCESS.2021.3064084•
Improving the Prediction of Heart Failure Patients’ Survival Using SMOTE and Effective Data Mining Techniques

[...]

Abid Ishaq1, Saima Sadiq1, Muhammad Umer1, Saleem Ullah1, Seyedali Mirjalili, Vaibhav Rupapara2, Michele Nappi3 •
University of Engineering and Technology, Lahore1, Florida International University2, University of Salerno3
04 Mar 2021-IEEE Access
TL;DR: In this paper, the authors analyzed the heart failure survivors from the dataset of 299 patients admitted in hospital and found significant features and effective data mining techniques that can boost the accuracy of cardiovascular patient's survivor prediction.
Abstract: Cardiovascular disease is a substantial cause of mortality and morbidity in the world. In clinical data analytics, it is a great challenge to predict heart disease survivor. Data mining transforms huge amounts of raw data generated by the health industry into useful information that can help in making informed decisions. Various studies proved that significant features play a key role in improving performance of machine learning models. This study analyzes the heart failure survivors from the dataset of 299 patients admitted in hospital. The aim is to find significant features and effective data mining techniques that can boost the accuracy of cardiovascular patient’s survivor prediction. To predict patient’s survival, this study employs nine classification models: Decision Tree (DT), Adaptive boosting classifier (AdaBoost), Logistic Regression (LR), Stochastic Gradient classifier (SGD), Random Forest (RF), Gradient Boosting classifier (GBM), Extra Tree Classifier (ETC), Gaussian Naive Bayes classifier (G-NB) and Support Vector Machine (SVM). The imbalance class problem is handled by Synthetic Minority Oversampling Technique (SMOTE). Furthermore, machine learning models are trained on the highest ranked features selected by RF. The results are compared with those provided by machine learning algorithms using full set of features. Experimental results demonstrate that ETC outperforms other models and achieves 0.9262 accuracy value with SMOTE in prediction of heart patient’s survival.
Journal Article•10.1007/S41870-017-0080-1•
Survey on SVM and their application in image classification

[...]

Mayank Arya Chandra1, S. S. Bedi1•
M. J. P. Rohilkhand University1
01 Oct 2021-International Journal of Information Technology
TL;DR: The different computational model of SVM and key process for the SVM system development are reviewed and a survey on their applications for image classification is provided.
Abstract: Life of any living being is impossible if it does not have the ability to differentiate between various things, objects, smell, taste, colors, etc. Human being is a good ability to classify the object easily such as different human face, images. This is time of the machine so we want that machine can do all the work like as a human, this is part of machine learning. Here this paper discusses the some important technique for the image classification. What are the techniques through which a machine can learn for the image classification task as well as perform the classification task with efficiently. The most known technique to learn a machine is SVM. Support Vector machine (SVM) has evolved as an efficient paradigm for classification. SVM has a strongest mathematical model for classification and regression. This powerful mathematical foundation gives a new direction for further research in the vast field of classification and regression. Over the past few decades, various improvements to SVM has appeared, such as twin SVM, Lagrangian SVM, Quantum Support vector machine, least square support vector machine, etc., which will be further discussed in the paper, led to the creation of a new approach for better classification accuracy. For improving the accuracy as well as performance of SVM, we must aware of how a kernel function should be selected and what are the different approaches for parameter selection. This paper reviews the different computational model of SVM and key process for the SVM system development. Furthermore provides survey on their applications for image classification.
Journal Article•10.3390/S21020446•
An Experimental Analysis of Attack Classification Using Machine Learning in IoT Networks.

[...]

Andrew Churcher1, Rehmat Ullah2, Jawad Ahmad1, Sadaqat Ur Rehman3, Fawad Masood4, Mandar Gogate1, Fehaid Alqahtani5, Boubakr Nour6, William J Buchanan1 •
Edinburgh Napier University1, Queen's University Belfast2, Namal College3, Yangzhou University4, United States Naval Academy5, Beijing Institute of Technology6
10 Jan 2021-Sensors
TL;DR: In this paper, the authors compared several machine learning (ML) methods such as k-nearest neighbor (KNN), support vector machine (SVM), decision tree (DT), naive Bayes (NB), random forest (RF), artificial neural network (ANN), and logistic regression (LR) for both binary and multi-class classification on Bot-IoT dataset.
Abstract: In recent years, there has been a massive increase in the amount of Internet of Things (IoT) devices as well as the data generated by such devices. The participating devices in IoT networks can be problematic due to their resource-constrained nature, and integrating security on these devices is often overlooked. This has resulted in attackers having an increased incentive to target IoT devices. As the number of attacks possible on a network increases, it becomes more difficult for traditional intrusion detection systems (IDS) to cope with these attacks efficiently. In this paper, we highlight several machine learning (ML) methods such as k-nearest neighbour (KNN), support vector machine (SVM), decision tree (DT), naive Bayes (NB), random forest (RF), artificial neural network (ANN), and logistic regression (LR) that can be used in IDS. In this work, ML algorithms are compared for both binary and multi-class classification on Bot-IoT dataset. Based on several parameters such as accuracy, precision, recall, F1 score, and log loss, we experimentally compared the aforementioned ML algorithms. In the case of HTTP distributed denial-of-service (DDoS) attack, the accuracy of RF is 99%. Furthermore, other simulation results-based precision, recall, F1 score, and log loss metric reveal that RF outperforms on all types of attacks in binary classification. However, in multi-class classification, KNN outperforms other ML algorithms with an accuracy of 99%, which is 4% higher than RF.
Journal Article•10.1016/J.IJCCE.2021.01.001•
An ensemble approach for classification and prediction of diabetes mellitus using soft voting classifier

[...]

Saloni Kumari1, Deepika Kumar1, Mamta Mittal•
Bharati Vidyapeeth's College of Engineering1
1 Jun 2021
TL;DR: The proposed ensemble soft voting classifier gives binary classification and uses the ensemble of three machine learning algorithms viz. random forest, logistic regression, and Naive Bayes for the classification.
Abstract: Diabetes is a dreadful disease identified by escalated levels of glucose in the blood Machine learning algorithms help in identification and prediction of diabetes at an early stage The main objective of this study is to predict diabetes mellitus with better accuracy using an ensemble of machine learning algorithms The Pima Indians Diabetes dataset has been considered for experimentation, which gathers details of patients with and without having diabetes The proposed ensemble soft voting classifier gives binary classification and uses the ensemble of three machine learning algorithms viz random forest, logistic regression, and Naive Bayes for the classification Empirical evaluation of the proposed methodology has been conducted with state-of-the-art methodologies and base classifiers such as AdaBoost, Logistic Regression,Support Vector machine, Random forest, Naive Bayes, Bagging, GradientBoost, XGBoost, CatBoost by taking accuracy, precision, recall, F1-score as the evaluation criteria The proposed ensemble approach gives the highest accuracy, precision, recall, and F1_score value with 7904%, 7348%, 7145% and 806% respectively on the PIMA diabetes dataset Further, the efficiency of the proposed methodology has also been compared and analysed with breast cancer dataset The proposed ensemble soft voting classifier has given 9702% accuracy on the breast cancer dataset
Journal Article•10.1109/ACCESS.2021.3056614•
Benchmarking of Machine Learning for Anomaly Based Intrusion Detection Systems in the CICIDS2017 Dataset

[...]

Ziadoon Kamil Maseer1, Robiah Yusof1, Nazrulazhar Bahaman1, Salama A. Mostafa2, Cik Feresa Mohd Foozy2 •
Universiti Teknikal Malaysia Melaka1, Universiti Tun Hussein Onn Malaysia2
03 Feb 2021-IEEE Access
TL;DR: 10 popular supervised and unsupervised ML algorithms for identifying effective and efficient ML–AIDS of networks and computers are applied and the true positive and negative rates, accuracy, precision, recall, and F-Score of 31 ML-AIDS models are evaluated.
Abstract: An intrusion detection system (IDS) is an important protection instrument for detecting complex network attacks Various machine learning (ML) or deep learning (DL) algorithms have been proposed for implementing anomaly-based IDS (AIDS) Our review of the AIDS literature identifies some issues in related work, including the randomness of the selected algorithms, parameters, and testing criteria, the application of old datasets, or shallow analyses and validation of the results This paper comprehensively reviews previous studies on AIDS by using a set of criteria with different datasets and types of attacks to set benchmarking outcomes that can reveal the suitable AIDS algorithms, parameters, and testing criteria Specifically, this paper applies 10 popular supervised and unsupervised ML algorithms for identifying effective and efficient ML–AIDS of networks and computers These supervised ML algorithms include the artificial neural network (ANN), decision tree (DT), k-nearest neighbor (k-NN), naive Bayes (NB), random forest (RF), support vector machine (SVM), and convolutional neural network (CNN) algorithms, whereas the unsupervised ML algorithms include the expectation-maximization (EM), k-means, and self-organizing maps (SOM) algorithms Several models of these algorithms are introduced, and the turning and training parameters of each algorithm are examined to achieve an optimal classifier evaluation Unlike previous studies, this study evaluates the performance of AIDS by measuring the true positive and negative rates, accuracy, precision, recall, and F-Score of 31 ML-AIDS models The training and testing time for ML-AIDS models are also considered in measuring their performance efficiency given that time complexity is an important factor in AIDSs The ML-AIDS models are tested by using a recent and highly unbalanced multiclass CICIDS2017 dataset that involves real-world network attacks In general, the k-NN-AIDS, DT-AIDS, and NB-AIDS models obtain the best results and show a greater capability in detecting web attacks compared with other models that demonstrate irregular and inferior results
Journal Article•10.1016/J.ICTE.2021.02.004•
A comparison of machine learning algorithms for diabetes prediction

[...]

Jobeda J Khanam1, Simon Y. Foo1•
Florida A&M University – Florida State University College of Engineering1
20 Feb 2021-ICT Express
TL;DR: Data mining, machine learning (ML) algorithms, and Neural Network (NN) methods are used in diabetes prediction in this research, which found that the model with Logistic Regression and Support Vector Machine (SVM) works well on diabetes prediction.
Journal Article•10.1016/J.CONBUILDMAT.2020.120950•
Efficient machine learning models for prediction of concrete strengths

[...]

Hoang X. Nguyen1, Thanh Vu2, Thuc P. Vo3, Huu-Tai Thai4•
Northumbria University1, Oracle Corporation2, La Trobe University3, University of Melbourne4
10 Jan 2021-Construction and Building Materials
TL;DR: The comparative studies reveal that, for this particular prediction problem, the trained models based on GBR and XGBoost perform better than those of SVR and MLP.
Journal Article•10.1007/S42979-020-00394-7•
Supervised Machine Learning Models for Prediction of COVID-19 Infection using Epidemiology Dataset

[...]

L. J. Muhammad, Ebrahem A. Algehyne1, Sani Sharif Usman, Abdulkadir Ahmad, Chinmay Chakraborty2, Ismail A. Mohammed3 •
University of Tabuk1, Birla Institute of Technology and Science2, Bukar Abba Ibrahim University3
1 Feb 2021
TL;DR: Supervised machine learning models for COVID-19 infection were developed in this work with learning algorithms which include logistic regression, decision tree, support vector machine, naive Bayes, and artificial neutral network using epidemiology labeled dataset for positive and negative CO VID-19 cases of Mexico.
Abstract: COVID-19 or 2019-nCoV is no longer pandemic but rather endemic, with more than 651,247 people around world having lost their lives after contracting the disease. Currently, there is no specific treatment or cure for COVID-19, and thus living with the disease and its symptoms is inevitable. This reality has placed a massive burden on limited healthcare systems worldwide especially in the developing nations. Although neither an effective, clinically proven antiviral agents' strategy nor an approved vaccine exist to eradicate the COVID-19 pandemic, there are alternatives that may reduce the huge burden on not only limited healthcare systems but also the economic sector; the most promising include harnessing non-clinical techniques such as machine learning, data mining, deep learning and other artificial intelligence. These alternatives would facilitate diagnosis and prognosis for 2019-nCoV pandemic patients. Supervised machine learning models for COVID-19 infection were developed in this work with learning algorithms which include logistic regression, decision tree, support vector machine, naive Bayes, and artificial neutral network using epidemiology labeled dataset for positive and negative COVID-19 cases of Mexico. The correlation coefficient analysis between various dependent and independent features was carried out to determine a strength relationship between each dependent feature and independent feature of the dataset prior to developing the models. The 80% of the training dataset were used for training the models while the remaining 20% were used for testing the models. The result of the performance evaluation of the models showed that decision tree model has the highest accuracy of 94.99% while the Support Vector Machine Model has the highest sensitivity of 93.34% and Naive Bayes Model has the highest specificity of 94.30%.
Journal Article•10.1016/J.COMPBIOMED.2021.104572•
COVID-19 cough classification using machine learning and global smartphone recordings.

[...]

Madhurananda Pahar1, Marisa Klopper1, Robin M. Warren1, Thomas Niesler1•
Stellenbosch University1
17 Jun 2021-Computers in Biology and Medicine
TL;DR: Although all classifiers were able to identify COVID-19 coughs, the best performance was exhibited by the Resnet50 classifier, which was best able to discriminate between the CO VID-19 positive and the healthy coughs with an area under the ROC curve (AUC) of 0.98.
Journal Article•10.1109/JAS.2020.1003393•
Automatic detection of COVID-19 infection using chest X-ray images through transfer learning

[...]

Elene Firmeza Ohata1, Gabriel M. Bezerra, João Victor Souza das Chagas, Aloisio Vieira Lira Neto2, Adriano Bessa Albuquerque2, Victor Hugo C. de Albuquerque2, Pedro Pedrosa Rebouças Filho1 •
Federal University of Ceará1, University of Fortaleza2
01 Jan 2021-IEEE/CAA Journal of Automatica Sinica
TL;DR: This work proposes an automatic detection method for COVID-19 infection based on chest X-ray images using different architectures of convolutional neural networks trained on ImageNet, and adapt them to behave as feature extractors for the X-Ray images.
Abstract: The new coronavirus ( COVID-19 ) , declared by the World Health Organization as a pandemic, has infected more than 1 million people and killed more than 50 thousand. An infection caused by COVID-19 can develop into pneumonia, which can be detected by a chest X-ray exam and should be treated appropriately. In this work, we propose an automatic detection method for COVID-19 infection based on chest X-ray images. The datasets constructed for this study are composed of 194 X-ray images of patients diagnosed with coronavirus and 194 X-ray images of healthy patients. Since few images of patients with COVID-19 are publicly available, we apply the concept of transfer learning for this task. We use different architectures of convolutional neural networks ( CNNs ) trained on ImageNet, and adapt them to behave as feature extractors for the X-ray images. Then, the CNNs are combined with consolidated machine learning methods, such as k-Nearest Neighbor, Bayes, Random Forest, multilayer perceptron ( MLP ) , and support vector machine ( SVM ) . The results show that, for one of the datasets, the extractor-classifier pair with the best performance is the MobileNet architecture with the SVM classifier using a linear kernel, which achieves an accuracy and an F1-score of 98.5 & . For the other dataset, the best pair is DenseNet201 with MLP, achieving an accuracy and an F1-score of 95.6 & . Thus, the proposed approach demonstrates efficiency in detecting COVID-19 in X-ray images.
Journal Article•10.1109/TIE.2020.2994868•
Fault Diagnosis of an Autonomous Vehicle With an Improved SVM Algorithm Subject to Unbalanced Datasets

[...]

Qian Shi1, Hui Zhang1•
Beihang University1
01 Jul 2021-IEEE Transactions on Industrial Electronics
TL;DR: Experimental results and comparisons of an automated vehicle illustrate the effectiveness of the proposed algorithm on the steering actuator fault diagnosis and show that the proposed algorithms has superiority on the classification over existing methods.
Abstract: Safety is one of the key requirements for automated vehicles and fault diagnosis is an effective technique to enhance the vehicle safety. The model-based fault diagnosis method models the fault into the system model and estimates the faults by observer. In this article, to avoid the complexity of designing observer, we investigate the problem of steering actuator fault diagnosis for automated vehicles based on the approach of model-based support vector machine (SVM) classification. The system model is utilized to generate the residual signal as the training data and the data-based algorithm of the SVM classification is employed to diagnose the fault. Due to the phenomena of data unbalance induced poor performance of the data-driven method, an undersampling procedure with the approach of linear discriminant analysis and a threshold adjustment using the algorithm of grey wolf optimizer are proposed to modify and improve the performance of classification and fault diagnosis. Various comparisons are carried out based on widely used datasets. The comparison results show that the proposed algorithm has superiority on the classification over existing methods. Experimental results and comparisons of an automated vehicle illustrate the effectiveness of the proposed algorithm on the steering actuator fault diagnosis.
Journal Article•10.32604/IASC.2021.018983•
Modelling Supply Chain Information Collaboration Empowered with Machine Learning Technique

[...]

Naeem Ali, Alia Ahmed, Leena Anum, Taher M. Ghazal, Sagheer Abbas, Muhammad Adnan Khan, Haitham M. Alzoubi, Munir Ahmad 
01 Jul 2021-Intelligent Automation and Soft Computing
Journal Article•10.1016/J.COMNET.2021.107840•
Machine learning methods for cyber security intrusion detection: Datasets and comparative study

[...]

Ilhan Firat Kilincer1, Fatih Ertam1, Abdulkadir Sengur1•
Fırat University1
07 Apr 2021-Computer Networks
TL;DR: In this paper, the authors reviewed the literature studies using CSE-CIC IDS-2018, UNSW-NB15, ISCX-2012, NSL-KDD and CIDDS-001 data sets, which are widely used to develop IDS systems.
Journal Article•10.1016/J.ENGAPPAI.2020.104015•
Optimization of support vector machine through the use of metaheuristic algorithms in forecasting TBM advance rate

[...]

Jian Zhou1, Yingui Qiu1, Shuangli Zhu1, Danial Jahed Armaghani2, Chuanqi Li1, Hoang Nguyen3, Saffet Yagiz4 •
Central South University1, Duy Tan University2, Hanoi University of Mining and Geology3, Nazarbayev University4
01 Jan 2021-Engineering Applications of Artificial Intelligence
TL;DR: Modeling results revealed that the MFO algorithm can capture better hyper-parameters of the SVM model in predicting TBM AR among all three hybrid models, confirming that this hybrid S VM model is a powerful and applicable technique addressing problems related to TBM performance with a high level of accuracy.
Journal Article•10.3390/APP11020796•
Impact of Dataset Size on Classification Performance: An Empirical Evaluation in the Medical Domain

[...]

Alhanoof Althnian, Duaa H. AlSaeed, Heyam H. Al-Baity, Amani Khalaf Samha, Alanoud Bin Dris, Najla Alzakari, Afnan Abou Elwafa, Heba Kurdi 
15 Jan 2021-Applied Sciences
TL;DR: It is found that the most robust model for limited medical data is AB and NB, followed by SVM, and then RF and NN, while the least robust model is DT.
Abstract: Dataset size is considered a major concern in the medical domain, where lack of data is a common occurrence. This study aims to investigate the impact of dataset size on the overall performance of supervised classification models. We examined the performance of six widely-used models in the medical field, including support vector machine (SVM), neural networks (NN), C4.5 decision tree (DT), random forest (RF), adaboost (AB), and naive Bayes (NB) on eighteen small medical UCI datasets. We further implemented three dataset size reduction scenarios on two large datasets and analyze the performance of the models when trained on each resulting dataset with respect to accuracy, precision, recall, f-score, specificity, and area under the ROC curve (AUC). Our results indicated that the overall performance of classifiers depend on how much a dataset represents the original distribution rather than its size. Moreover, we found that the most robust model for limited medical data is AB and NB, followed by SVM, and then RF and NN, while the least robust model is DT. Furthermore, an interesting observation is that a robust machine learning model to limited dataset does not necessary imply that it provides the best performance compared to other models.
Journal Article•10.1016/J.INPA.2020.09.006•
Insect classification and detection in field crops using modern machine learning techniques

[...]

Thenmozhi Kasinathan1, Dakshayani Singaraju1, Srinivasulu Reddy Uyyala1•
National Institute of Technology, Tiruchirappalli1
01 Sep 2021-Information Processing in Agriculture
TL;DR: This paper presents the insect pest detection algorithm that consists of foreground extraction and contour identification to detect the insects for Wang, Xie, Deng, and IP102 datasets in a highly complex background and exhibits considerable improvement in classification accuracy, computation time performance, and state-of-the-art classification algorithms.
Journal Article•10.1109/TTE.2020.3017090•
Battery Health Prediction Using Fusion-Based Feature Selection and Machine Learning

[...]

Xiaosong Hu1, Yunhong Che1, Xianke Lin2, Simona Onori3•
Chongqing University1, University of Ontario Institute of Technology2, Stanford University3
1 Jun 2021
TL;DR: A comprehensive study of the data-driven SOH estimation methods is conducted and the combination of the fusion-based selection method and GPR has an overall superior estimation performance in terms of both accuracy and computational efficiency.
Abstract: State of health (SOH) is a key parameter to assess lithium-ion battery feasibility for secondary usage applications SOH estimation based on machine learning has attracted great attention in recent years and holds potentials for battery informatization and cloud battery management techniques In this article, a comprehensive study of the data-driven SOH estimation methods is conducted A new classification for health indicators (HIs) is proposed where the HIs are divided into the measured variables and calculated variables To illustrate the significance of data preprocessing, four noise reduction methods are assessed in the HIs extraction process; different feature selection methods, including filter-based method, wrapper-based method, and fusion-based method, are applied to select HIs subsets The four widely used machine learning algorithms, including artificial neural network, support vector machine, relevance vector machine, and Gaussian process regression (GPR), are applied and compared In order to evaluate the estimation performance in potential real usages under future big data era, the three HIs selection methods and four machine learning methods are evaluated using three public data sets and two estimation strategies The results show that the combination of the fusion-based selection method and GPR has an overall superior estimation performance in terms of both accuracy and computational efficiency
Journal Article•10.1016/J.GSF.2020.02.012•
Landslide identification using machine learning

[...]

Haojie Wang1, Li Min Zhang1, Kesheng Yin1, Hongyu Luo1, Jinhui Li2 •
Hong Kong University of Science and Technology1, Harbin Institute of Technology2
01 Jan 2021-Geoscience frontiers
TL;DR: By using machine learning and deep learning techniques, the proposed landslide identification method shows outstanding robustness and great potential in tackling the landslide identification problem.
Abstract: Landslide identification is critical for risk assessment and mitigation. This paper proposes a novel machine-learning and deep-learning method to identify natural-terrain landslides using integrated geodatabases. First, landslide-related data are compiled, including topographic data, geological data and rainfall-related data. Then, three integrated geodatabases are established; namely, Recent Landslide Database (RecLD), Relict Landslide Database (RelLD) and Joint Landslide Database (JLD). After that, five machine learning and deep learning algorithms, including logistic regression (LR), support vector machine (SVM), random forest (RF), boosting methods and convolutional neural network (CNN), are utilized and evaluated on each database. A case study in Lantau, Hong Kong, is conducted to demonstrate the application of the proposed method. From the results of the case study, CNN achieves an identification accuracy of 92.5% on RecLD, and outperforms other algorithms due to its strengths in feature extraction and multi dimensional data processing. Boosting methods come second in terms of accuracy, followed by RF, LR and SVM. By using machine learning and deep learning techniques, the proposed landslide identification method shows outstanding robustness and great potential in tackling the landslide identification problem.
Book Chapter•10.1007/978-3-030-57077-4_11•
Pattern Recognition and Machine Learning

[...]

Bharadwaj, Kolla Bhanu Prakash, G. R. Kanagachidambaresan1•
Techno India1
1 Jan 2021
TL;DR: Support vector machine (SVM) is one of the most widely used classification algorithms as discussed by the authors, it uses supervised learning method (Aizerman et al., Auto Remote Cont 25:821-837, 1964) for training.
Abstract: Support vector machine (SVM) is one of the most widely used classification algorithms. It uses supervised learning method (Aizerman et al., Auto Remote Cont 25:821–837, 1964) for training. The SVM classifier is mostly used in multi-classification problems. SVM differs from the traditional classifiers as it uses “decision boundary,” which separates the classes. The decision boundary maximizes distances of data points belongs to different classes .in this; decision boundary is the optimum that is Most optimal (Baron and Ensley, Opportunity recognition as the detection of meaningful patterns: evidence from the prototypes of novice and experienced entrepreneurs. Manuscript under review, 2005) decision boundary has maximum margin. The data points which are nearer to the boundary are called support vectors. The most important thing in SVM is its hyper plane, where for a N-dimensional space it is an (N-1)-dimensional subspace. To better understand, the hyper plane is just a line in one dimension for a two-dimensional space. It is a two-dimensional plane that separates the classes for a three-dimensional space.
...

Tools

SciSpace AgentBiomedical AgentSciSpace RecruitSciSpace for EnterpriseAgent GalleryChat with PDFLiterature ReviewAI WriterFind TopicsParaphraserCitation GeneratorExtract DataAI DetectorCitation Booster

Learn

ResourcesLive Workshops

SciSpace

CareersSupportBrowse PapersPricingSciSpace Affiliate ProgramCancellation & Refund PolicyTermsPrivacyData Sources

Directories

PapersTopicsJournalsAuthorsConferencesInstitutionsCitation StylesWriting templates

Extension & Apps

SciSpace Chrome ExtensionSciSpace Mobile App

Contact

support@scispace.com
SciSpace

© 2026 | PubGenius Inc. | Suite # 217 691 S Milpitas Blvd Milpitas CA 95035, USA

soc2
Secured by Delve