Top 6535 papers published in the topic of Support vector machine in 2020

Showing papers on "Support vector machine published in 2020"

Journal Article•10.1016/J.NEUCOM.2019.10.118•

A comprehensive survey on support vector machine classification: Applications, challenges and trends

[...]

Jair Cervantes¹, Farid García-Lamont¹, Lisbeth Rodríguez-Mazahua, Asdrúbal López¹•Institutions (1)

National Autonomous University of Mexico¹

30 Sep 2020-Neurocomputing

TL;DR: A brief introduction of SVMs is provided, many applications are described and challenges and trends are summarized, especially in the some fields.

...read moreread less

1,566 citations

Journal Article•10.3390/RS12071135•

Land-Use Land-Cover Classification by Machine Learning Classifiers for Satellite Observations—A Review

[...]

Swapan Talukdar¹, Pankaj Singha¹, Susanta Mahato¹, Shahfahad², Swades Pal¹, Yuei-An Liou, Atiqur Rahman² - Show less +3 more•Institutions (2)

University of Gour Banga¹, Jamia Millia Islamia²

01 Apr 2020-Remote Sensing

TL;DR: The RF algorithm is the best machine-learning LULC classifier, among the six examined algorithms although it is necessary to further test the RF algorithm in different morphoclimatic conditions in the future.

...read moreread less

Abstract: Rapid and uncontrolled population growth along with economic and industrial development, especially in developing countries during the late twentieth and early twenty-first centuries, have increased the rate of land-use/land-cover (LULC) change many times. Since quantitative assessment of changes in LULC is one of the most efficient means to understand and manage the land transformation, there is a need to examine the accuracy of different algorithms for LULC mapping in order to identify the best classifier for further applications of earth observations. In this article, six machine-learning algorithms, namely random forest (RF), support vector machine (SVM), artificial neural network (ANN), fuzzy adaptive resonance theory-supervised predictive mapping (Fuzzy ARTMAP), spectral angle mapper (SAM) and Mahalanobis distance (MD) were examined. Accuracy assessment was performed by using Kappa coefficient, receiver operational curve (RoC), index-based validation and root mean square error (RMSE). Results of Kappa coefficient show that all the classifiers have a similar accuracy level with minor variation, but the RF algorithm has the highest accuracy of 0.89 and the MD algorithm (parametric classifier) has the least accuracy of 0.82. In addition, the index-based LULC and visual cross-validation show that the RF algorithm (correlations between RF and normalised differentiation water index, normalised differentiation vegetation index and normalised differentiation built-up index are 0.96, 0.99 and 1, respectively, at 0.05 level of significance) has the highest accuracy level in comparison to the other classifiers adopted. Findings from the literature also proved that ANN and RF algorithms are the best LULC classifiers, although a non-parametric classifier like SAM (Kappa coefficient 0.84; area under curve (AUC) 0.85) has a better and consistent accuracy level than the other machine-learning algorithms. Finally, this review concludes that the RF algorithm is the best machine-learning LULC classifier, among the six examined algorithms although it is necessary to further test the RF algorithm in different morphoclimatic conditions in the future.

...read moreread less

858 citations

Journal Article•10.1109/ACCESS.2020.2980942•

Analysis of Dimensionality Reduction Techniques on Big Data

[...]

G. Thippa Reddy¹, M. Praveen Kumar Reddy¹, Kuruva Lakshmanna¹, Rajesh Kaluri¹, Dharmendra Singh Rajput¹, Gautam Srivastava², Thar Baker³ - Show less +3 more•Institutions (3)

VIT University¹, Brandon University², Liverpool John Moores University³

16 Mar 2020-IEEE Access

TL;DR: Two of the prominent dimensionality reduction techniques, Linear Discriminant Analysis (LDA) and Principal Component Analysis (PCA) are investigated on four popular Machine Learning (ML) algorithms using publicly available Cardiotocography dataset from University of California and Irvine Machine Learning Repository to prove that PCA outperforms LDA in all the measures.

...read moreread less

Abstract: Due to digitization, a huge volume of data is being generated across several sectors such as healthcare, production, sales, IoT devices, Web, organizations. Machine learning algorithms are used to uncover patterns among the attributes of this data. Hence, they can be used to make predictions that can be used by medical practitioners and people at managerial level to make executive decisions. Not all the attributes in the datasets generated are important for training the machine learning algorithms. Some attributes might be irrelevant and some might not affect the outcome of the prediction. Ignoring or removing these irrelevant or less important attributes reduces the burden on machine learning algorithms. In this work two of the prominent dimensionality reduction techniques, Linear Discriminant Analysis (LDA) and Principal Component Analysis (PCA) are investigated on four popular Machine Learning (ML) algorithms, Decision Tree Induction, Support Vector Machine (SVM), Naive Bayes Classifier and Random Forest Classifier using publicly available Cardiotocography (CTG) dataset from University of California and Irvine Machine Learning Repository. The experimentation results prove that PCA outperforms LDA in all the measures. Also, the performance of the classifiers, Decision Tree, Random Forest examined is not affected much by using PCA and LDA.To further analyze the performance of PCA and LDA the eperimentation is carried out on Diabetic Retinopathy (DR) and Intrusion Detection System (IDS) datasets. Experimentation results prove that ML algorithms with PCA produce better results when dimensionality of the datasets is high. When dimensionality of datasets is low it is observed that the ML algorithms without dimensionality reduction yields better results.

...read moreread less

741 citations

Journal Article•10.1109/JSTARS.2020.3026724•

Support Vector Machine Versus Random Forest for Remote Sensing Image Classification: A Meta-Analysis and Systematic Review

[...]

Mohammadreza Sheykhmousa, Masoud Mahdianpari¹, Hamid Ghanbari², Fariba Mohammadimanesh¹, Pedram Ghamisi³, Saeid Homayouni⁴ - Show less +2 more•Institutions (4)

St. John's University¹, Laval University², Helmholtz-Zentrum Dresden-Rossendorf³, Institut national de la recherche scientifique⁴

25 Sep 2020-IEEE Journal of Selected Topics in Applied Earth Observations and Remote Sensing

TL;DR: A meta-analysis of 251 peer-reviewed journal papers relevant to remote sensing image classification and a comparative analysis regarding the performances of RF and SVM classification against various parameters is applied.

...read moreread less

Abstract: Several machine-learning algorithms have been proposed for remote sensing image classification during the past two decades. Among these machine learning algorithms, Random Forest (RF) and Support Vector Machines (SVM) have drawn attention to image classification in several remote sensing applications. This article reviews RF and SVM concepts relevant to remote sensing image classification and applies a meta-analysis of 251 peer-reviewed journal papers. A database with more than 40 quantitative and qualitative fields was constructed from these reviewed papers. The meta-analysis mainly focuses on 1) the analysis regarding the general characteristics of the studies, such as geographical distribution, frequency of the papers considering time, journals, application domains, and remote sensing software packages used in the case studies, and 2) a comparative analysis regarding the performances of RF and SVM classification against various parameters, such as data type, RS applications, spatial resolution, and the number of extracted features in the feature engineering step. The challenges, recommendations, and potential directions for future research are also discussed in detail. Moreover, a summary of the results is provided to aid researchers to customize their efforts in order to achieve the most accurate results based on their thematic applications.

...read moreread less

714 citations

Journal Article•10.1186/S40537-020-00327-4•

Selecting critical features for data classification based on machine learning methods

[...]

Rung-Ching Chen¹, Christine Dewi², Christine Dewi¹, Su-Wen Huang¹, Rezzy Eko Caraka¹ - Show less +1 more•Institutions (2)

Chaoyang University of Technology¹, Satya Wacana Christian University²

23 Jul 2020-Journal of Big Data

TL;DR: This paper adopts Random Forest to select the important feature in classification and compares the result of the dataset with and without essential features selection by RF methods varImp(), Boruta, and Recursive Feature Elimination to get the best percentage accuracy and kappa.

...read moreread less

Abstract: Feature selection becomes prominent, especially in the data sets with many variables and features. It will eliminate unimportant variables and improve the accuracy as well as the performance of classification. Random Forest has emerged as a quite useful algorithm that can handle the feature selection issue even with a higher number of variables. In this paper, we use three popular datasets with a higher number of variables (Bank Marketing, Car Evaluation Database, Human Activity Recognition Using Smartphones) to conduct the experiment. There are four main reasons why feature selection is essential. First, to simplify the model by reducing the number of parameters, next to decrease the training time, to reduce overfilling by enhancing generalization, and to avoid the curse of dimensionality. Besides, we evaluate and compare each accuracy and performance of the classification model, such as Random Forest (RF), Support Vector Machines (SVM), K-Nearest Neighbors (KNN), and Linear Discriminant Analysis (LDA). The highest accuracy of the model is the best classifier. Practically, this paper adopts Random Forest to select the important feature in classification. Our experiments clearly show the comparative study of the RF algorithm from different perspectives. Furthermore, we compare the result of the dataset with and without essential features selection by RF methods varImp(), Boruta, and Recursive Feature Elimination (RFE) to get the best percentage accuracy and kappa. Experimental results demonstrate that Random Forest achieves a better performance in all experiment groups.

...read moreread less

683 citations

Journal Article•10.1016/J.CHAOS.2020.110212•

Predictions for COVID-19 with Deep Learning Models of LSTM, GRU and Bi-LSTM

[...]

Farah Shahid¹, Aneela Zameer¹, Muhammad Muneeb¹•Institutions (1)

Pakistan Institute of Engineering and Applied Sciences¹

19 Aug 2020-Chaos Solitons & Fractals

TL;DR: Proposed forecast models comprising autoregressive integrated moving average (ARIMA), support vector regression (SVR), long shot term memory (LSTM), bidirectional long shortterm memory (Bi-L STM), and ARIMA are assessed for time series prediction of confirmed cases, deaths and recoveries in ten major countries affected due to COVID-19.

...read moreread less

Abstract: COVID-19, responsible of infecting billions of people and economy across the globe, requires detailed study of the trend it follows to develop adequate short-term prediction models for forecasting the number of future cases. In this perspective, it is possible to develop strategic planning in the public health system to avoid deaths as well as managing patients. In this paper, proposed forecast models comprising autoregressive integrated moving average (ARIMA), support vector regression (SVR), long shot term memory (LSTM), bidirectional long short term memory (Bi-LSTM) are assessed for time series prediction of confirmed cases, deaths and recoveries in ten major countries affected due to COVID-19. The performance of models is measured by mean absolute error, root mean square error and r2_score indices. In the majority of cases, Bi-LSTM model outperforms in terms of endorsed indices. Models ranking from good performance to the lowest in entire scenarios is Bi-LSTM, LSTM, GRU, SVR and ARIMA. Bi-LSTM generates lowest MAE and RMSE values of 0.0070 and 0.0077, respectively, for deaths in China. The best r2_score value is 0.9997 for recovered cases in China. On the basis of demonstrated robustness and enhanced prediction accuracy, Bi-LSTM can be exploited for pandemic prediction for better planning and management.

...read moreread less

649 citations

Journal Article•10.1016/J.COMPBIOMED.2020.103805•

COVID-19 detection using deep learning models to exploit Social Mimic Optimization and structured chest X-ray images using fuzzy color and stacking approaches.

[...]

Mesut Toğaçar¹, Burhan Ergen¹, Zafer Cömert•Institutions (1)

Fırat University¹

01 Jun 2020-Computers in Biology and Medicine

TL;DR: With the proposed approach in this study, it is evident that the model can efficiently contribute to the detection of COVID-19 disease.

...read moreread less

597 citations

Journal Article•10.1016/J.CONBUILDMAT.2019.117000•

Machine learning-based compressive strength prediction for concrete: An adaptive boosting approach

[...]

De-Cheng Feng¹, Liu Zhentao¹, Wang Xiaodan¹, Chen Yin¹, Chang Jiaqi¹, Wei Dongfang¹, Zhong-Ming Jiang² - Show less +3 more•Institutions (2)

Southeast University¹, Southwest University²

10 Jan 2020-Construction and Building Materials

TL;DR: An intelligent approach based on the machine learning technique is proposed for predicting the compressive strength of concrete by employing the adaptive boosting algorithm to construct a strong learner by integrating several weak learners, which can find the mapping between the input data and output data.

...read moreread less

585 citations

Book Chapter•10.1016/B978-0-12-815739-8.00006-7•

Support vector machine

[...]

Derek Pisner¹, David M. Schnyer¹•Institutions (1)

University of Texas at Austin¹

1 Jan 2020

TL;DR: This chapter explores Support Vector Machine (SVM)—a machine learning method that has become exceedingly popular for neuroimaging analysis in recent years and is reviewed for applications that involve predicting diagnosis and prognosis of brain diseases such as Alzheimer's disease, schizophrenia, and depression.

...read moreread less

Abstract: In this chapter, we explore Support Vector Machine (SVM)—a machine learning method that has become exceedingly popular for neuroimaging analysis in recent years. Because of their relative simplicity and flexibility for addressing a range of classification problems, SVMs distinctively afford balanced predictive performance, even in studies where sample sizes may be limited. In brain disorders research, SVMs are typically employed using multivoxel pattern analysis (MVPA) because their relative simplicity carries a lower risk of overfitting even using high-dimensional imaging data. More recently, SVMs have been used in the context of precision psychiatry, particularly for applications that involve predicting diagnosis and prognosis of brain diseases such as Alzheimer's disease, schizophrenia, and depression. In the last section of this chapter, we review a number of recent studies that use SVM for such applications.

...read moreread less

560 citations

Journal Article•10.1109/ACCESS.2020.3001149•

Heart Disease Identification Method Using Machine Learning Classification in E-Healthcare

[...]

Jianping Li¹, Amin Ul Haq¹, Salah Ud Din¹, Jalaluddin Khan¹, Asif Khan¹, Abdus Saboor¹ - Show less +2 more•Institutions (1)

University of Electronic Science and Technology of China¹

09 Jun 2020-IEEE Access

TL;DR: The experimental results show that the proposed feature selection algorithm (FCMIM) is feasible with classifier support vector machine for designing a high-level intelligent system to identify heart disease and it achieved good accuracy as compared to previously proposed methods.

...read moreread less

Abstract: Heart disease is one of the complex diseases and globally many people suffered from this disease. On time and efficient identification of heart disease plays a key role in healthcare, particularly in the field of cardiology. In this article, we proposed an efficient and accurate system to diagnosis heart disease and the system is based on machine learning techniques. The system is developed based on classification algorithms includes Support vector machine, Logistic regression, Artificial neural network, K-nearest neighbor, Naive bays, and Decision tree while standard features selection algorithms have been used such as Relief, Minimal redundancy maximal relevance, Least absolute shrinkage selection operator and Local learning for removing irrelevant and redundant features. We also proposed novel fast conditional mutual information feature selection algorithm to solve feature selection problem. The features selection algorithms are used for features selection to increase the classification accuracy and reduce the execution time of classification system. Furthermore, the leave one subject out cross-validation method has been used for learning the best practices of model assessment and for hyperparameter tuning. The performance measuring metrics are used for assessment of the performances of the classifiers. The performances of the classifiers have been checked on the selected features as selected by features selection algorithms. The experimental results show that the proposed feature selection algorithm (FCMIM) is feasible with classifier support vector machine for designing a high-level intelligent system to identify heart disease. The suggested diagnosis system (FCMIM-SVM) achieved good accuracy as compared to previously proposed methods. Additionally, the proposed system can easily be implemented in healthcare for the identification of heart disease.

...read moreread less

550 citations

Journal Article•10.1109/ACCESS.2020.2997311•

COVID-19 Future Forecasting Using Supervised Machine Learning Models

[...]

Furqan Rustam, Aijaz Ahmad Reshi¹, Arif Mehmood², Saleem Ullah, Byung-Won On³, Waqar Aslam², Gyu Sang Choi⁴ - Show less +3 more•Institutions (4)

Taibah University¹, Islamia University², Kunsan National University³, Yeungnam University⁴

25 May 2020-IEEE Access

TL;DR: The results prove that the ES performs best among all the used models followed by LR and LASSO which performs well in forecasting the new confirmed cases, death rate as well as recovery rate, while SVM performs poorly in all the prediction scenarios given the available dataset.

...read moreread less

Abstract: Machine learning (ML) based forecasting mechanisms have proved their significance to anticipate in perioperative outcomes to improve the decision making on the future course of actions. The ML models have long been used in many application domains which needed the identification and prioritization of adverse factors for a threat. Several prediction methods are being popularly used to handle forecasting problems. This study demonstrates the capability of ML models to forecast the number of upcoming patients affected by COVID-19 which is presently considered as a potential threat to mankind. In particular, four standard forecasting models, such as linear regression (LR), least absolute shrinkage and selection operator (LASSO), support vector machine (SVM), and exponential smoothing (ES) have been used in this study to forecast the threatening factors of COVID-19. Three types of predictions are made by each of the models, such as the number of newly infected cases, the number of deaths, and the number of recoveries in the next 10 days. The results produced by the study proves it a promising mechanism to use these methods for the current scenario of the COVID-19 pandemic. The results prove that the ES performs best among all the used models followed by LR and LASSO which performs well in forecasting the new confirmed cases, death rate as well as recovery rate, while SVM performs poorly in all the prediction scenarios given the available dataset.

...read moreread less

Proceedings Article•10.1109/ICICS49469.2020.239556•

Machine Learning with Oversampling and Undersampling Techniques: Overview Study and Experimental Results

[...]

Roweida Mohammed¹, Jumanah Rawashdeh¹, Malak Abdullah¹•Institutions (1)

Jordan University of Science and Technology¹

7 Apr 2020

TL;DR: One of the key findings of this paper is noticing that oversampling performs better than undersampling for different classifiers and obtains higher scores in different evaluation metrics.

...read moreread less

Abstract: Data imbalance in Machine Learning refers to an unequal distribution of classes within a dataset. This issue is encountered mostly in classification tasks in which the distribution of classes or labels in a given dataset is not uniform. The straightforward method to solve this problem is the resampling method by adding records to the minority class or deleting ones from the majority class. In this paper, we have experimented with the two resampling widely adopted techniques: oversampling and undersampling. In order to explore both techniques, we have chosen a public imbalanced dataset from kaggle website Santander Customer Transaction Prediction and have applied a group of well-known machine learning algorithms with different hyperparamters that give best results for both resampling techniques. One of the key findings of this paper is noticing that oversampling performs better than undersampling for different classifiers and obtains higher scores in different evaluation metrics.

...read moreread less

Book Chapter•10.1007/978-3-030-22475-2_1•

A Systematic Review on Supervised and Unsupervised Machine Learning Algorithms for Data Science

[...]

Mohamed Alloghani¹, Dhiya Al-Jumeily¹, Jamila Mustafina², Abir Hussain¹, Ahmed J. Aljaaf¹ - Show less +1 more•Institutions (2)

Liverpool John Moores University¹, Kazan Federal University²

1 Jan 2020

TL;DR: A systematic review of scholarly articles published between 2015 and 2018 addressing or implementing supervised and unsupervised machine learning techniques in different problem-solving paradigms revealed decision tree, support vector machine, and Naive Bayes algorithms appeared to be the most cited, discussed, and implemented supervised learners.

...read moreread less

Abstract: Machine learning is as growing as fast as concepts such as Big data and the field of data science in general. The purpose of the systematic review was to analyze scholarly articles that were published between 2015 and 2018 addressing or implementing supervised and unsupervised machine learning techniques in different problem-solving paradigms. Using the elements of PRISMA, the review process identified 84 scholarly articles that had been published in different journals. Of the 84 articles, 6 were published before 2015 despite their metadata indicating that they were published in 2015. The existence of the six articles in the final papers was attributed to errors in indexing. Nonetheless, from the reviewed papers, decision tree, support vector machine, and Naive Bayes algorithms appeared to be the most cited, discussed, and implemented supervised learners. Conversely, k-means, hierarchical clustering, and principal component analysis also emerged as the commonly used unsupervised learners. The review also revealed other commonly used algorithms that include ensembles and reinforce learners, and future systematic reviews can focus on them because of the developments that machine learning and data science is undergoing at the moment.

...read moreread less

Book Chapter•10.1016/B978-0-12-815739-8.00007-9•

Support vector regression

[...]

Fan Zhang¹, Lauren J. O'Donnell¹•Institutions (1)

Brigham and Women's Hospital¹

1 Jan 2020

TL;DR: A number of studies that have applied SVR to magnetic resonance imaging data to performance multivariate pattern regression analysis of brain disorders have been successful in revealing spatially distributed patterns across multiple brain regions in several brain disorders including schizophrenia, autism, and attention-deficit/hyperactivity disorder.

...read moreread less

Abstract: This chapter provides an overview of the support vector regression (SVR), an analytical technique to investigate the relationship between one or more predictor variables and a real-valued (continuous) dependent variable. In the first part of the chapter, we provide a description of the SVR algorithm. Unlike traditional regression methods that depend on assumptions of the model that might not be accurate (e.g., linear data distribution), SVR is a machine learning technique in which a model learns a variable's importance for characterizing the relationship between input and output. In the second part of the chapter, we review a number of studies that have applied SVR to magnetic resonance imaging data to performance multivariate pattern regression analysis of brain disorders. These studies have been successful in revealing spatially distributed patterns across multiple brain regions in several brain disorders including schizophrenia, autism, and attention-deficit/hyperactivity disorder.

...read moreread less

Journal Article•10.1016/J.ASOC.2019.105946•

Chaotic multi-swarm whale optimizer boosted support vector machine for medical diagnosis

[...]

Mingjing Wang¹, Huiling Chen²•Institutions (2)

Duy Tan University¹, Wenzhou University²

01 Mar 2020-Applied Soft Computing

TL;DR: To perform parameter optimization and feature selection simultaneously for SVM, an improved whale optimization algorithm (CMWOA), which combines chaotic and multi-swarm strategies is proposed, which significantly outperformed all the other competitors in terms of classification performance and feature subset size.

...read moreread less

Book•

Machine Learning Approach

[...]

Namita Srivastava, C. K. Verma, Rabia Aziz Musheer

28 Feb 2020

TL;DR: This book applied different combinations of feature selection / extraction methods, as a novel hybrid dimension reduction method for SVM, ANN and NB classifiers, and the obtained results are compared with other popular published dimension reduction methods for S VM, NB and ANN classifiers.

...read moreread less

Abstract: For past several years, microarray technology has attracted tremendous interest for both scientific community and industry. Recently, the applications of microarrays include gene discovery, disease diagnosis and prognosis, drug discovery, etc. High dimensional data with small sample size is the main problem that generate the application of dimension reduction in microarray data analysis. It is seen that SVM, ANN and NB have recently gained wide popularity for cancer classification problems. An efficient and reliable method of dimension reduction plays an important role to improve the performance of SVM, ANN and NB, when applied for classification of high dimensional microarray data. In this book, we applied different combinations of feature selection / extraction methods, as a novel hybrid dimension reduction method for SVM, ANN and NB classifiers. The obtained results are compared with other popular published dimension reduction methods for SVM, NB and ANN classifiers.

...read moreread less

Journal Article•10.1016/J.COMPAG.2020.105527•

Deep feature based rice leaf disease identification using support vector machine

[...]

Prabira Kumar Sethy¹, Nalini Kanta Barpanda¹, Amiya Kumar Rath², Santi Kumari Behera²•Institutions (2)

Sambalpur University¹, Veer Surendra Sai University of Technology²

01 Aug 2020-Computers and Electronics in Agriculture

TL;DR: The simulation results show the deep feature plus SVM perform better classification compared to transfer learning counterpart, and the F1 score of CNN classification models was compared with other traditional image classification models.

...read moreread less

Journal Article•10.1016/J.CHB.2019.106189•

Predicting Academic Performance of Students from VLE Big Data using Deep Learning Models

[...]

Hajra Waheed¹, Saeed-Ul Hassan¹, Naif Radi Aljohani², Julie Hardman³, Salem Alelyani⁴, Raheel Nawaz³ - Show less +2 more•Institutions (4)

Information Technology University¹, King Abdulaziz University², Manchester Metropolitan University³, King Khalid University⁴

01 Mar 2020-Computers in Human Behavior

TL;DR: A deep artificial neural network is deployed on a set of unique handcrafted features, extracted from the virtual learning environments clickstream data, to predict at-risk students providing measures for early intervention of such cases, to assist institutes in formulating a necessary framework for pedagogical support.

...read moreread less

Monograph•10.1017/9781108679930•

Mathematics for Machine Learning

[...]

Marc Peter Deisenroth¹, A. Aldo Faisal², Cheng Soon Ong³•Institutions (3)

University College London¹, Imperial College London², Commonwealth Scientific and Industrial Research Organisation³

23 Apr 2020

TL;DR: This self-contained textbook bridges the gap between mathematical and machine learning texts, introducing the mathematical concepts with a minimum of prerequisites to derive four central machine learning methods.

...read moreread less

Abstract: The fundamental mathematical tools needed to understand machine learning include linear algebra, analytic geometry, matrix decompositions, vector calculus, optimization, probability and statistics. These topics are traditionally taught in disparate courses, making it hard for data science or computer science students, or professionals, to efficiently learn the mathematics. This self-contained textbook bridges the gap between mathematical and machine learning texts, introducing the mathematical concepts with a minimum of prerequisites. It uses these concepts to derive four central machine learning methods: linear regression, principal component analysis, Gaussian mixture models and support vector machines. For students and others with a mathematical background, these derivations provide a starting point to machine learning texts. For those learning the mathematics for the first time, the methods help build intuition and practical experience with applying mathematical concepts. Every chapter includes worked examples and exercises to test understanding. Programming tutorials are offered on the book's web site.

...read moreread less

Journal Article•10.1109/ACCESS.2020.3009537•

A Novel CNN-GRU-Based Hybrid Approach for Short-Term Residential Load Forecasting

[...]

Muhammad Sajjad¹, Zulfiqar Ahmad Khan², Amin Ullah², Tanveer Hussain², Waseem Ullah², Mi Young Lee², Sung Wook Baik² - Show less +3 more•Institutions (2)

Islamia College University¹, Sejong University²

15 Jul 2020-IEEE Access

TL;DR: The proposed model is an effective alternative to the previous hybrid models in terms of computational complexity as well prediction accuracy, due to the representative features’ extraction potentials of CNNs and effectual gated structure of multi-layered GRU.

...read moreread less

Abstract: Electric energy forecasting domain attracts researchers due to its key role in saving energy resources, where mainstream existing models are based on Gradient Boosting Regression (GBR), Artificial Neural Networks (ANNs), Extreme Learning Machine (ELM) and Support Vector Machine (SVM). These models encounter high-level of non-linearity between input data and output predictions and limited adoptability in real-world scenarios. Meanwhile, energy forecasting domain demands more robustness, higher prediction accuracy and generalization ability for real-world implementation. In this paper, we achieve the mentioned tasks by developing a hybrid sequential learning-based energy forecasting model that employs Convolution Neural Network (CNN) and Gated Recurrent Units (GRU) into a unified framework for accurate energy consumption prediction. The proposed framework has two major phases: (1) data refinement and (2) training, where the data refinement phase applies preprocessing strategies over raw data. In the training phase, CNN features are extracted from input dataset and fed in to GRU, that is selected as optimal and observed to have enhanced sequence learning abilities after extensive experiments. The proposed model is an effective alternative to the previous hybrid models in terms of computational complexity as well prediction accuracy, due to the representative features' extraction potentials of CNNs and effectual gated structure of multi-layered GRU. The experimental evaluation over existing energy forecasting datasets reveal the better performance of our method in terms of preciseness and efficiency. The proposed method achieved the smallest error rate on Appliances Energy Prediction (AEP) and Individual Household Electric Power Consumption (IHEPC) datasets, when compared to other baseline models.

...read moreread less

Posted Content•

Coronavirus (COVID-19) Classification using CT Images by Machine Learning Methods

[...]

Mucahid Barstugan, Umut Özkaya, Şaban Öztürk¹•Institutions (1)

Amasya University¹

20 Mar 2020-arXiv: Computer Vision and Pattern Recognition

TL;DR: Early phase detection of Coronavirus (COVID-19), which is named by World Health Organization (WHO), by machine learning methods was implemented on abdominal Computed Tomography (CT) images to increase the classification performance.

...read moreread less

Abstract: This study presents early phase detection of Coronavirus (COVID-19), which is named by World Health Organization (WHO), by machine learning methods. The detection process was implemented on abdominal Computed Tomography (CT) images. The expert radiologists detected from CT images that COVID-19 shows different behaviours from other viral pneumonia. Therefore, the clinical experts specify that COVID-19 virus needs to be diagnosed in early phase. For detection of the COVID-19, four different datasets were formed by taking patches sized as 16x16, 32x32, 48x48, 64x64 from 150 CT images. The feature extraction process was applied to patches to increase the classification performance. Grey Level Co-occurrence Matrix (GLCM), Local Directional Pattern (LDP), Grey Level Run Length Matrix (GLRLM), Grey-Level Size Zone Matrix (GLSZM), and Discrete Wavelet Transform (DWT) algorithms were used as feature extraction methods. Support Vector Machines (SVM) classified the extracted features. 2-fold, 5-fold and 10-fold cross-validations were implemented during the classification process. Sensitivity, specificity, accuracy, precision, and F-score metrics were used to evaluate the classification performance. The best classification accuracy was obtained as 99.68% with 10-fold cross-validation and GLSZM feature extraction method.

...read moreread less

Journal Article•10.33889/IJMEMS.2020.5.4.052•

Detection of Coronavirus Disease (COVID-19) based on Deep Features and Support Vector Machine

[...]

Prabira Kumar Sethy¹, Santi Kumari Behera², Pradyumna Kumar Ratha¹, Preesat Biswas³•Institutions (3)

Sambalpur University¹, Osmania University², Dr. C. V. Raman University³

1 Jan 2020

TL;DR: The deep feature plus support vector machine (SVM) based methodology is suggested for detection of coronavirus infected patient using X-ray images and the method is beneficial for the medical practitioner to classify among the COVID-19 patient, pneumonia patient and healthy people.

...read moreread less

Abstract: The detection of coronavirus (COVID-19) is now a critical task for the medical practitioner The coronavirus spread so quickly between people and approaches 100,000 people worldwide In this consequence, it is very much essential to identify the infected people so that prevention of spread can be taken In this paper, the deep feature plus support vector machine (SVM) based methodology is suggested for detection of coronavirus infected patient using X-ray images For classification, SVM is used instead of deep learning based classifier, as the later one need a large dataset for training and validation The deep features from the fully connected layer of CNN model are extracted and fed to SVM for classification purpose The SVM classifies the corona affected X-ray images from others The methodology consists of three categories of Xray images, i e , COVID-19, pneumonia and normal The method is beneficial for the medical practitioner to classify among the COVID-19 patient, pneumonia patient and healthy people SVM is evaluated for detection of COVID-19 using the deep features of different 13 number of CNN models The SVM produced the best results using the deep feature of ResNet50 The classification model, i e ResNet50 plus SVM achieved accuracy, sensitivity, FPR and F1 score of 95 33%,95 33%,2 33% and 95 34% respectively for detection of COVID-19 (ignoring SARS, MERS and ARDS) Again, the highest accuracy achieved by ResNet50 plus SVM is 98 66% The result is based on the Xray images available in the repository of GitHub and Kaggle As the data set is in hundreds, the classification based on SVM is more robust compared to the transfer learning approach Also, a comparison analysis of other traditional classification method is carried out The traditional methods are local binary patterns (LBP) plus SVM, histogram of oriented gradients (HOG) plus SVM and Gray Level Co-occurrence Matrix (GLCM) plus SVM In traditional image classification method, LBP plus SVM achieved 93 4% of accuracy

...read moreread less

Journal Article•10.1186/S40537-020-00379-6•

Performance Analysis of Intrusion Detection Systems Using a Feature Selection Method on the UNSW-NB15 Dataset

[...]

Sydney Mambwe Kasongo¹, Yanxia Sun¹•Institutions (1)

University of Johannesburg¹

01 Dec 2020-Journal of Big Data

TL;DR: An analysis of the UNSW-NB15 intrusion detection dataset is presented and a filter-based feature reduction technique using the XGBoost algorithm is applied that allows for methods such as the DT to increase its test accuracy from 88.13 to 90.85% for the binary classification scheme.

...read moreread less

Abstract: Computer networks intrusion detection systems (IDSs) and intrusion prevention systems (IPSs) are critical aspects that contribute to the success of an organization. Over the past years, IDSs and IPSs using different approaches have been developed and implemented to ensure that computer networks within enterprises are secure, reliable and available. In this paper, we focus on IDSs that are built using machine learning (ML) techniques. IDSs based on ML methods are effective and accurate in detecting networks attacks. However, the performance of these systems decreases for high dimensional data spaces. Therefore, it is crucial to implement an appropriate feature extraction method that can prune some of the features that do not possess a great impact in the classification process. Moreover, many of the ML based IDSs suffer from an increase in false positive rate and a low detection accuracy when the models are trained on highly imbalanced datasets. In this paper, we present an analysis the UNSW-NB15 intrusion detection dataset that will be used for training and testing our models. Moreover, we apply a filter-based feature reduction technique using the XGBoost algorithm. We then implement the following ML approaches using the reduced feature space: Support Vector Machine (SVM), k-Nearest-Neighbour (kNN), Logistic Regression (LR), Artificial Neural Network (ANN) and Decision Tree (DT). In our experiments, we considered both the binary and multiclass classification configurations. The results demonstrated that the XGBoost-based feature selection method allows for methods such as the DT to increase its test accuracy from 88.13 to 90.85% for the binary classification scheme.

...read moreread less

Journal Article•10.1007/S10346-019-01274-9•

A deep learning algorithm using a fully connected sparse autoencoder neural network for landslide susceptibility prediction

[...]

Faming Huang¹, Jing Zhang¹, Chuangbing Zhou¹, Yuhao Wang¹, Jinsong Huang², Li Zhu¹ - Show less +2 more•Institutions (2)

Nanchang University¹, University of Newcastle²

01 Jan 2020-Landslides

TL;DR: The asymmetric and unsupervised FC-SAE can extract optimal non-linear features from environmental factors successfully, outperforms some conventional machine learning methods, and is promising for LSP.

...read moreread less

Abstract: The environmental factors of landslide susceptibility are generally uncorrelated or non-linearly correlated, resulting in the limited prediction performances of conventional machine learning methods for landslide susceptibility prediction (LSP). Deep learning methods can exploit low-level features and high-level representations of information from environmental factors. In this paper, a novel deep learning–based algorithm, the fully connected spare autoencoder (FC-SAE), is proposed for LSP. The FC-SAE consists of four steps: raw feature dropout in input layers, a sparse feature encoder in hidden layers, sparse feature extraction in output layers, and classification and prediction. The Sinan County of Guizhou Province in China, with a total of 23,195 landslide grid cells (306 recorded landslides) and 23,195 randomly selected non-landslide grid cells, was used as study case. The frequency ratio values of 27 environmental factors were taken as the input variables of FC-SAE. All 46,390 landslide and non-landslide grid cells were randomly divided into a training dataset (70%) and a test dataset (30%). By analyzing real landslide/non-landslide data, the performances of the FC-SAE and two other conventional machine learning methods, support vector machine (SVM) and back-propagation neural network (BPNN), were compared. The results show that the prediction rate and total accuracies of the FC-SAE are 0.854 and 85.2% which are higher than those of the SVM-only (0.827 and 81.56%) and BPNN (0.819 and 80.86%), respectively. In conclusion, the asymmetric and unsupervised FC-SAE can extract optimal non-linear features from environmental factors successfully, outperforms some conventional machine learning methods, and is promising for LSP.

...read moreread less

Journal Article•10.1109/ACCESS.2020.2980961•

Machine Learning Applied to Electrified Vehicle Battery State of Charge and State of Health Estimation: State-of-the-Art

[...]

Carlos Vidal¹, Pawel Malysz¹, Phillip J. Kollmeyer¹, Ali Emadi¹•Institutions (1)

McMaster University¹

16 Mar 2020-IEEE Access

TL;DR: A survey of battery state estimation methods based on ML approaches such as feedforward neural networks, recurrent neural networks (RNNs), support vector machines (SVM), radial basis functions (RBF), and Hamming networks is provided.

...read moreread less

Abstract: The growing interest and recent breakthroughs in artificial intelligence and machine learning (ML) have actively contributed to an increase in research and development of new methods to estimate the states of electrified vehicle batteries. Data-driven approaches, such as ML, are becoming more popular for estimating the state of charge (SOC) and state of health (SOH) due to greater availability of battery data and improved computing power capabilities. This paper provides a survey of battery state estimation methods based on ML approaches such as feedforward neural networks (FNNs), recurrent neural networks (RNNs), support vector machines (SVM), radial basis functions (RBF), and Hamming networks. Comparisons between methods are shown in terms of data quality, inputs and outputs, test conditions, battery types, and stated accuracy to give readers a bigger picture view of the ML landscape for SOC and SOH estimation. Additionally, to provide insight into how to best approach with the comparison of different neural network structures, an FNN and long short-term memory (LSTM) RNN are trained fifty times each for 3000 epochs. The error is somewhat different for each training repetition due to the random initial values of the trainable parameters, demonstrating that it is important to train networks multiple times to achieve the best result. Furthermore, it is recommended that when performing a comparison among estimation techniques such as those presented in this review paper, the compared networks should have a similar number of learnable parameters and be trained and tested with identical data. Otherwise, it is difficult to make a general conclusion regarding the quality of a given estimation technique.

...read moreread less

Journal Article•10.1016/J.CATENA.2020.104580•

Comparisons of heuristic, general statistical and machine learning models for landslide susceptibility prediction and mapping

[...]

Faming Huang¹, Zhongshan Cao¹, Jianfei Guo¹, Shui-Hua Jiang¹, Shu Li, Zizheng Guo² - Show less +2 more•Institutions (2)

Nanchang University¹, China University of Geosciences (Wuhan)²

01 Aug 2020-Catena

TL;DR: It can be inferred that the machine learning models have higher LSP performance than general statistical and heuristic models due to its high AUC accuracy and reasonable LSIs distribution features, while general statistical model is limited by its linear analysis and heuristics limited by subjective weighting process.

...read moreread less

Abstract: Commonly used data-driven models for landslide susceptibility prediction (LSP) can be mainly classified as heuristic, general statistical or machine learning models. This study plans to compare the prediction performance of these data-driven models on the landslide susceptibility mapping, thus further to explore the inherently features of these data-driven models. As a result, a more accurate and reliable LSP can be realized through choosing an optimal data-based model. A heuristic model represented by the analytic hierarchy process (AHP), a general statistical model represented by the general linear model (GLM) and information value (IV) model, and machine learning models represented by binary logistic regression (BLR), Multilayer Perceptron (MLP), back-propagation neural network (BPNN), support vector machine (SVM) and C5.0 decision tree (C5.0 DT) are adopted in this study. Shicheng County in China is used as the study area. In total, 369 landslides identified through field investigation are classified as training (70%) and testing datasets (30%). Next, 13 landslide conditioning factors (elevation, slope, aspect, plan curvature, profile curvature, relief amplitude, total surface radiation, population density, Normalized difference vegetation index, distance to river, topographic wetness index and rock types) are acquired from data sources of the free remote sensing images, Digital Elevation Model, field investigation and government reports. The correlations between these conditioning factors and the landslide locations are determined by frequency ratio analysis. Then, the landslide susceptibility indexes (LSIs) calculated by the eight trained models are imported into GIS software to produce landslide susceptibility maps of Shicheng County. Finally, the area under receiver operating characteristic curve (AUC), the calculated LSIs are applied to assess the LSP performance of the present eight models. The testing results show that these eight models generate reasonable LSP results as a whole, further showing that the C5.0 DT is of the highest prediction accuracy with an AUC value of 0.868, followed by the SVM (0.813), BPNN (0.803), MLP (0.792), BLR (0.784), GLM (0.779), IV (0.774) and AHP (0.773). It can be inferred that the machine learning models have higher LSP performance than general statistical and heuristic models due to its high AUC accuracy and reasonable LSIs distribution features, while general statistical model is limited by its linear analysis and heuristic model is limited by subjective weighting process.

...read moreread less

Journal Article•10.1038/S41567-021-01287-Z•

A rigorous and robust quantum speed-up in supervised machine learning

[...]

Yunchao Liu¹, Yunchao Liu², Srinivasan Arunachalam¹, Kristan Temme¹•Institutions (2)

IBM¹, University of California, Berkeley²

05 Oct 2020-arXiv: Quantum Physics

TL;DR: A rigorous quantum speed-up for supervised classification using a quantum learning algorithm that only requires classical access to data and achieves high accuracy, robust against additive errors in the kernel entries that arise from finite sampling statistics.

...read moreread less

Abstract: Over the past few years several quantum machine learning algorithms were proposed that promise quantum speed-ups over their classical counterparts. Most of these learning algorithms either assume quantum access to data -- making it unclear if quantum speed-ups still exist without making these strong assumptions, or are heuristic in nature with no provable advantage over classical algorithms. In this paper, we establish a rigorous quantum speed-up for supervised classification using a general-purpose quantum learning algorithm that only requires classical access to data. Our quantum classifier is a conventional support vector machine that uses a fault-tolerant quantum computer to estimate a kernel function. Data samples are mapped to a quantum feature space and the kernel entries can be estimated as the transition amplitude of a quantum circuit. We construct a family of datasets and show that no classical learner can classify the data inverse-polynomially better than random guessing, assuming the widely-believed hardness of the discrete logarithm problem. Meanwhile, the quantum classifier achieves high accuracy and is robust against additive errors in the kernel entries that arise from finite sampling statistics.

...read moreread less

Journal Article•10.1109/ACCESS.2020.2972627•

BAT: Deep Learning Methods on Network Intrusion Detection Using NSL-KDD Dataset

[...]

Tongtong Su¹, Huazhi Sun¹, Jinqi Zhu¹, Sheng Wang¹, Yabo Li¹ - Show less +1 more•Institutions (1)

Tianjin Normal University¹

10 Feb 2020-IEEE Access

TL;DR: The proposed end-to-end model does not use any feature engineering skills and can automatically learn the key features of the hierarchy and can well describe the network traffic behavior and improve the ability of anomaly detection effectively.

...read moreread less

Abstract: Intrusion detection can identify unknown attacks from network traffics and has been an effective means of network security. Nowadays, existing methods for network anomaly detection are usually based on traditional machine learning models, such as KNN, SVM, etc. Although these methods can obtain some outstanding features, they get a relatively low accuracy and rely heavily on manual design of traffic features, which has been obsolete in the age of big data. To solve the problems of low accuracy and feature engineering in intrusion detection, a traffic anomaly detection model BAT is proposed. The BAT model combines BLSTM (Bidirectional Long Short-term memory) and attention mechanism. Attention mechanism is used to screen the network flow vector composed of packet vectors generated by the BLSTM model, which can obtain the key features for network traffic classification. In addition, we adopt multiple convolutional layers to capture the local features of traffic data. As multiple convolutional layers are used to process data samples, we refer BAT model as BAT-MC. The softmax classifier is used for network traffic classification. The proposed end-to-end model does not use any feature engineering skills and can automatically learn the key features of the hierarchy. It can well describe the network traffic behavior and improve the ability of anomaly detection effectively. We test our model on a public benchmark dataset, and the experimental results demonstrate our model has better performance than other comparison methods.

...read moreread less

Journal Article•10.1016/J.APM.2019.12.016•

Selecting appropriate machine learning methods for digital soil mapping

[...]

Yones Khaledian¹, Bradley A. Miller¹•Institutions (1)

Iowa State University¹

01 May 2020-Applied Mathematical Modelling

TL;DR: This work compares the strengths and weaknesses of multiple linear regression (MLR), k-nearest neighbors (KNN), support vector regression (SVR), Cubist, random forest (RF), and artificial neural networks (ANN) for DSM.

...read moreread less

Journal Article•10.1007/S00366-020-01081-0•

A novel hybrid extreme learning machine–grey wolf optimizer (ELM-GWO) model to predict compressive strength of concrete with partial replacements for cement

[...]

Mahdi Shariati¹, Mohammad Saeed Mafipour², Behzad Ghahremani², Fazel Azarhomayun², Masoud Ahmadi, Nguyen Thoi Trung³, Ali Shariati³ - Show less +3 more•Institutions (3)

Duy Tan University¹, University of Tehran², Ton Duc Thang University³

20 Jun 2020-Engineering With Computers

TL;DR: The results of the paper show that combining the ELM model with GWO can efficiently improve the performance of this model, and it is deducted that the ELm-GWO model is capable of reaching superior performance indices in comparison with those of the other models.

...read moreread less

Abstract: Compressive strength of concrete is one of the most determinant parameters in the design of engineering structures. This parameter is generally determined by conducting several tests at different ages of concrete in spite of the fact that such tests are not only costly but also time-consuming. As an alternative to these tests, machine learning (ML) techniques can be used to estimate experimental results. However, the dependence of compressive strength on different parameters in the fabrication of concrete makes the prediction problem challenging, especially in the case of concrete with partial replacements for cement. In this investigation, an extreme learning machine (ELM) is combined with a metaheuristic algorithm known as grey wolf optimizer (GWO) and a novel hybrid ELM-GWO model is proposed to predict the compressive strength of concrete with partial replacements for cement. To evaluate the performance of the ELM-GWO model, five of the most well-known ML models including an artificial neural network (ANN), an adaptive neuro-fuzzy inference system (ANFIS), an extreme learning machine, a support vector regression with radial basis function (RBF) kernel (SVR-RBF), and another SVR with a polynomial function (Poly) kernel (SVR-Poly) are developed. Finally, the performance of the models is compared with each other. The results of the paper show that combining the ELM model with GWO can efficiently improve the performance of this model. Also, it is deducted that the ELM-GWO model is capable of reaching superior performance indices in comparison with those of the other models.

...read moreread less

...

Expand