Top 19165 papers published in the topic of Support vector machine in 2022

Showing papers on "Support vector machine published in 2022"

Proceedings Article•10.1109/cvpr52688.2022.01166•

Scaling Up Your Kernels to 31×31: Revisiting Large Kernel Design in CNNs

[...]

1 Jun 2022

TL;DR: RepLKNet as discussed by the authors proposes to use a few large convolutional kernels instead of a stack of small kernels to close the performance gap between CNNs and ViTs, achieving comparable or superior results than Swin Transformer on ImageNet.

...read moreread less

Abstract: We revisit large kernel design in modern convolutional neural networks (CNNs). Inspired by recent advances in vision transformers (ViTs), in this paper, we demonstrate that using a few large convolutional kernels instead of a stack of small kernels could be a more powerful paradigm. We suggested five guidelines, e.g., applying re-parameterized large depthwise convolutions, to design efficient high-performance large-kernel CNNs. Following the guidelines, we propose RepLKNet, a pure CNN architecture whose kernel size is as large as 31×31, in contrast to commonly used 3×3. RepLKNet greatly closes the performance gap between CNNs and ViTs, e.g., achieving comparable or superior results than Swin Transformer on ImageNet and a few typical downstream tasks, with lower latency. RepLKNet also shows nice scalability to big data and large models, obtaining 87.8% top-1 accuracy on ImageNet and 56.0% mIoU on ADE20K, which is very competitive among the state-of-the-arts with similar model sizes. Our study further reveals that, in contrast to small-kernel CNNs, large-kernel CNNs have much larger effective receptive fields and higher shape bias rather than texture bias. Code & models at https://github.com/megvii-research/RepLKNet.

...read moreread less

640 citations

Journal Article•10.1016/j.dajour.2022.100071•

A comparative analysis of K-Nearest Neighbour, Genetic, Support Vector Machine, Decision Tree, and Long Short Term Memory algorithms in machine learning

[...]

Malti Bansal, Apoorva Goyal, Apoorva Choudhary

01 May 2022-Decision Analytics Journal

TL;DR: In this article , the authors reviewed and compared five machine learning algorithms, namely, KNN, GA, SVM, Decision Tree (DT), Long Short Term Memory (LSTM) and Support Vector Machine (SVM).

...read moreread less

373 citations

Journal Article•10.1016/j.eswa.2022.116659•

Machine learning techniques and data for stock market forecasting: A literature review

[...]

Mahinda Mailagaha Kumbure, Christoph Lohrmann, Pasi Luukka, Jari Porras

01 Feb 2022-Expert systems with applications

TL;DR: In this paper , a review of machine learning techniques applied for stock market prediction is presented, focusing on the stock markets investigated in the literature as well as the types of variables used as input in the machine learning methods used for predicting these markets.

...read moreread less

Abstract: In this literature review, we investigate machine learning techniques that are applied for stock market prediction. A focus area in this literature review is the stock markets investigated in the literature as well as the types of variables used as input in the machine learning techniques used for predicting these markets. We examined 138 journal articles published between 2000 and 2019. The main contributions of this review are: (1) an extensive examination of the data, in particular, the markets and stock indices covered in the predictions, as well as the 2173 unique variables used for stock market predictions, including technical indicators, macro-economic variables, and fundamental indicators, and (2) an in-depth review of the machine learning techniques and their variants deployed for the predictions. In addition, we provide a bibliometric analysis of these journal articles, highlighting the most influential works and articles.

...read moreread less

310 citations

Journal Article•10.1016/j.irbm.2021.06.003•

A Hybrid CNN-SVM Threshold Segmentation Approach for Tumor Detection and Classification of MRI Brain Images

[...]

1 Aug 2022

TL;DR: In this paper , a hybrid model combined CNN and support vector machine (SVM) in terms of classification and with threshold-based segmentation for detection of brain tumor in MRI images.

...read moreread less

Abstract: In this research paper, the brain MRI images are going to classify by considering the excellence of CNN on a public dataset to classify Benign and Malignant tumors. Deep learning (DL) methods due to good performance in the last few years have become more popular for Image classification. Convolution Neural Network (CNN), with several methods, can extract features without using handcrafted models, and eventually, show better accuracy of classification. The proposed hybrid model combined CNN and support vector machine (SVM) in terms of classification and with threshold-based segmentation in terms of detection. The findings of previous studies are based on different models with their accuracy as Rough Extreme Learning Machine (RELM)-94.233%, Deep CNN (DCNN)-95%, Deep Neural Network (DNN) and Discrete Wavelet Autoencoder (DWA)-96%, k-nearest neighbors (kNN)-96.6%, CNN-97.5%. The overall accuracy of the hybrid CNN-SVM is obtained as 98.4959%. In today's world, brain cancer is one of the most dangerous diseases with the highest death rate, detection and classification of brain tumors due to abnormal growth of cells, shapes, orientation, and the location is a challengeable task in medical imaging. Magnetic resonance imaging (MRI) is a typical method of medical imaging for brain tumor analysis. Conventional machine learning (ML) techniques categorize brain cancer based on some handicraft property with the radiologist specialist choice. That can lead to failure in the execution and also decrease the effectiveness of an Algorithm. With a brief look came to know that the proposed hybrid model provides more effective and improvement techniques for classification.

...read moreread less

293 citations

Journal Article•10.1186/s40561-022-00192-z•

Educational data mining: prediction of students' academic performance using machine learning algorithms

[...]

Mustafa Yağcı

03 Mar 2022-Smart Learning Environments

TL;DR: In this article , a new model based on machine learning algorithms to predict the final exam grades of undergraduate students, taking their midterm exam grades as the source data, was proposed and the results showed that the proposed model achieved a classification accuracy of 70-75%.

...read moreread less

Abstract: Abstract Educational data mining has become an effective tool for exploring the hidden relationships in educational data and predicting students' academic achievements. This study proposes a new model based on machine learning algorithms to predict the final exam grades of undergraduate students, taking their midterm exam grades as the source data. The performances of the random forests, nearest neighbour, support vector machines, logistic regression, Naïve Bayes, and k-nearest neighbour algorithms, which are among the machine learning algorithms, were calculated and compared to predict the final exam grades of the students. The dataset consisted of the academic achievement grades of 1854 students who took the Turkish Language-I course in a state University in Turkey during the fall semester of 2019–2020. The results show that the proposed model achieved a classification accuracy of 70–75%. The predictions were made using only three types of parameters; midterm exam grades, Department data and Faculty data. Such data-driven studies are very important in terms of establishing a learning analysis framework in higher education and contributing to the decision-making processes. Finally, this study presents a contribution to the early prediction of students at high risk of failure and determines the most effective machine learning methods.

...read moreread less

289 citations

Journal Article•10.1016/j.jobe.2021.103406•

Building energy consumption prediction for residential buildings using deep learning and other machine learning techniques

[...]

01 Jan 2022-Journal of building engineering

TL;DR: In this article , the authors presented the utilization of several machine learning techniques such as Artificial Neural Network (ANN), Gradient Boosting (GB), Deep Neural Networks (DNN), Random Forest (RF), Stacking, K Nearest Neighbour (KNN), Support Vector Machine (SVM), Decision tree (DT) and Linear Regression (LR) for predicting annual building energy consumption using a large dataset of residential buildings.

...read moreread less

Abstract: The high proportion of energy consumed in buildings has engendered the manifestation of many environmental problems which deploy adverse impacts on the existence of mankind. The prediction of building energy use is essentially proclaimed to be a method for energy conservation and improved decision-making towards decreasing energy usage. Also, the construction of energy efficient buildings will aid the reduction of total energy consumed in newly constructed buildings. Machine Learning (ML) method is recognised as the best suited approach for producing desired outcomes in prediction task. Hence, in several studies, ML has been applied in the field of energy consumption of operational building. However, there are not many studies investigating the suitability of ML methods for forecasting the potential building energy consumption at the early design phase to reduce the construction of more energy inefficient buildings. To address this gap, this paper presents the utilization of several machine learning techniques namely Artificial Neural Network (ANN), Gradient Boosting (GB), Deep Neural Network (DNN), Random Forest (RF), Stacking, K Nearest Neighbour (KNN), Support Vector Machine (SVM), Decision tree (DT) and Linear Regression (LR) for predicting annual building energy consumption using a large dataset of residential buildings. This study also examines the effect of the building clusters on the model performance. The novelty of this paper is to develop a model that enables designers input key features of a building design and forecast the annual average energy consumption at the early stages of development. This result reveals DNN as the most efficient predictive model for energy use at the early design phase and this presents a motivation for building designers to utilize it before construction to make informed decision, manage and optimize design.

...read moreread less

288 citations

Journal Article•10.1016/j.gltp.2022.03.016•

Plant Leaf Disease Detection using Computer Vision and Machine Learning Algorithms

[...]

Sunil S. Harakannanavar, J M Rudagi, Veena I Puranikmath, Ayesha Siddiqua, R. Pramodhini - Show less +1 more

01 Apr 2022-Global transitions proceedings

TL;DR: In this article , the authors proposed a method to detect the leaf diseases in the tomato plant using support vector machine (SVM), convolutional neural network (CNN), and K-Nearest Neighbor (K-NN).

...read moreread less

Abstract: Agriculture provides food to all the human beings even in case of rapid increase in the population. It is recommended to predict the plant diseases at their early stage in the field of agriculture is essential to cater the food to the overall population. But it unfortunate to predict the diseases at the early stage of the crops. The idea behind the paper is to bring awareness amongst the farmers about the cutting-edge technologies to reduces diseases in plant leaf. Since tomato is merely available vegetable, the approaches of machine learning and image processing with an accurate algorithm is identified to detect the leaf diseases in the tomato plant. In this investigation, the samples of tomato leaves having disorders are considered. With these disorder samples of tomato leaves, the farmers will easily find the diseases based on the early symptoms. Firstly, the samples of tomato leaves are resized to 256 × 256 pixels and then Histogram Equalization is used to improve the quality of tomato samples. The K-means clustering is introduced for partitioning of dataspace into Voronoi cells. The boundary of leaf samples is extracted using contour tracing. The multiple descriptors viz., Discrete Wavelet Transform, Principal Component Analysis and Grey Level Co-occurrence Matrix are used to extract the informative features of the leaf samples. Finally, the extracted features are classified using machine learning approaches such as Support Vector Machine (SVM), Convolutional Neural Network (CNN) and K-Nearest Neighbor (K-NN). The accuracy of the proposed model is tested using SVM (88%), K-NN (97%) and CNN (99.6%) on tomato disordered samples.

...read moreread less

284 citations

Journal Article•10.22452/mjcs.sp2022no1.10•

Predictive analysis of heart diseases with machine learning approaches

[...]

Ramesh Pandian Tr, Umesh Kumar Lilhore, P. M, Sarita Simaiya, Amandeep Kaur, Mounir Hamdi - Show less +2 more

31 Mar 2022-Malaysian Journal of Computer Science

TL;DR: The experimental results demonstrate the strength of KNN with eight neighbours order to test the effectiveness, effectiveness, precision, and accuracy, F1-score; as compared to other methods, i.e., Naive Bayes, SVM, Logistic regression, Decision Tree Classifier, Random Forest, and K- Nearest Neighbor.

...read moreread less

Abstract: Machine Learning (ML) is used in healthcare sectors worldwide. ML methods help in the protection of heart diseases, locomotor disorders in the medical data set. The discovery of such essential data helps researchers gain valuable insight into how to utilize their diagnosis and treatment for a particular patient. Researchers use various Machine Learning methods to examine massive amounts of complex healthcare data, which aids healthcare professionals in predicting diseases. In this research, we are using an online UCI dataset with 303 rows and 76 properties. Approximately 14 of these 76 properties are selected for testing, which is necessary to validate the performances of different methods. The isolation forest approach uses the data set’s most essential qualities and metrics to standardize the information for better precision. This analysis is based on supervised learning methods, i.e., Naive Bayes, SVM, Logistic regression, Decision Tree Classifier, Random Forest, and K- Nearest Neighbor. The experimental results demonstrate the strength of KNN with eight neighbours order to test the effectiveness, sensitivity, precision, and accuracy, F1-score; as compared to other methods, i.e., Naive Bayes, SVM (Linear Kernel), Decision Tree Classifier with 4 or 18 features, and Random Forest classifiers.

...read moreread less

268 citations

Journal Article•10.2174/1574893617666220404145517•

Distance-based support vector machine to predict DNA N6-methyladenine modification

[...]

Haoyu Zhang, Quan Zou, Ying Ju, Chenggang Song, Dong Chen - Show less +1 more

04 Apr 2022-Current Bioinformatics

TL;DR: The outcomes show that the DB-SVM method outperforms the iIM-CNN and csDMA in the prediction of DNA 6mA modification, which are the lastest research onDNA 6mA.

...read moreread less

Abstract: DNA N6-methyladenine plays an important role in the restriction-modification system to isolate invasion from adventive DNA. The shortcomings of the high time-consumption and high costs of experimental methods have been exposed, and some computational methods have emerged. The support vector machine theory has received extensive attention in the bioinformatics field due to its solid theoretical foundation and many good characteristics. General machine learning methods include an important step of extracting features. The research has omitted this step and replaced with easy-to-obtain sequence distances matrix to obtain better results First sequence alignment technology was used to achieve the similarity matrix. Then a novel transformation turned the similarity matrix into a distance matrix. Next, the similarity-distance matrix is made positive semi-definite so that it can be used in the kernel matrix. Finally, the LIBSVM software was applied to solve the support vector machine. The five-fold cross-validation of this model on rice and mouse data has achieved excellent accuracy rates of 92.04% and 96.51%, respectively. This shows that the DB-SVM method has obvious advantages compared with traditional machine learning methods. Meanwhile this model achieved 0.943,0.982 and 0.818 accuracy,0.944, 0.982, and 0.838 Matthews correlation coefficient and 0.942, 0.982 and 0.840 F1 scores for the rice, M. musculus and cross-species genome datasets, respectively. These outcomes show that this model outperforms the iIM-CNN and csDMA in the prediction of DNA 6mA modification, which are the lastest research on DNA 6mA.

...read moreread less

264 citations

Journal Article•10.1016/j.est.2022.104215•

State of health estimation of lithium-ion battery based on improved ant lion optimization and support vector regression

[...]

Qiang Li, Dezhi Li, Kunliang Zhao, Licheng Wang, Kai Wang - Show less +1 more

01 Jun 2022-Journal of energy storage

TL;DR: Zhang et al. as discussed by the authors proposed a SOH estimation method based on improved ant lion optimization algorithm and support vector regression (IALO-SVR), which can achieve accurate estimation of SOH with high estimation accuracy and robustness, and the estimation error is stable within 2%.

...read moreread less

Abstract: The state of health (SOH) estimation plays an important role in keeping the safe and stable operation of lithium-ion battery management system (BMS). To solve the problem of low estimation accuracy of traditional estimation methods, this paper proposes a SOH estimation method based on improved ant lion optimization algorithm and support vector regression (IALO-SVR). Firstly, the data of battery charge and discharge are analyzed geometrically, and four health features highly correlated with SOH decline are selected as the input of SVR model. Pearson correlation coefficient is used to quantitatively analyze the correlation between features and SOH. On the other hand, the IALO algorithm is used to optimize the kernel parameters of SVR, and the SOH estimation model is obtained after training with battery training set. To verify this method, batteries in different working conditions are verified on NASA battery data set, and compared with ALO-SVR and SVR. The experimental results show that this method can achieve accurate estimation of SOH, with high estimation accuracy and robustness, and the estimation error is stable within 2%.

...read moreread less

224 citations

Journal Article•10.1109/access.2022.3142097•

Prediction of diabetes empowered with fused machine learning

[...]

Usama Ali Ahmed, Ghassan F. Issa, Muhammad Adnan Khan, Shabib Aftab, Muhammad Farhan Khan, Raed A. Said, Taher M. Ghazal, Munir Uddin Ahmad - Show less +4 more

01 Jan 2022-IEEE Access

TL;DR: A model using a fused machine learning approach for diabetes prediction based on the patient’s real-time medical record has a prediction accuracy of 94.87, which is higher than the previously published methods.

...read moreread less

Abstract: In the medical field, it is essential to predict diseases early to prevent them. Diabetes is one of the most dangerous diseases all over the world. In modern lifestyles, sugar and fat are typically present in our dietary habits, which have increased the risk of diabetes. To predict the disease, it is extremely important to understand its symptoms. Currently, machine-learning (ML) algorithms are valuable for disease detection. This article presents a model using a fused machine learning approach for diabetes prediction. The conceptual framework consists of two types of models: Support Vector Machine (SVM) and Artificial Neural Network (ANN) models. These models analyze the dataset to determine whether a diabetes diagnosis is positive or negative. The dataset used in this research is divided into training data and testing data with a ratio of 70:30 respectively. The output of these models becomes the input membership function for the fuzzy model, whereas the fuzzy logic finally determines whether a diabetes diagnosis is positive or negative. A cloud storage system stores the fused models for future use. Based on the patient’s real-time medical record, the fused model predicts whether the patient is diabetic or not. The proposed fused ML model has a prediction accuracy of 94.87, which is higher than the previously published methods.

...read moreread less

Journal Article•10.1007/s00521-022-07292-4•

Human emotion recognition from EEG-based brain–computer interface using machine learning: a comprehensive review

[...]

Essam H. Houssein, A. Hammad, Abdelmgeid A. Ali

07 May 2022-Neural Computing and Applications

TL;DR: In this article , a review of the EEG-based emotion recognition methods is presented, including feature extraction, feature selection/reduction, machine learning methods (e.g., k-nearest neighbor), support vector machine, decision tree, artificial neural network, random forest, and naive Bayes) and deep learning methods.

...read moreread less

Abstract: Abstract Affective computing, a subcategory of artificial intelligence, detects, processes, interprets, and mimics human emotions. Thanks to the continued advancement of portable non-invasive human sensor technologies, like brain–computer interfaces (BCI), emotion recognition has piqued the interest of academics from a variety of domains. Facial expressions, speech, behavior (gesture/posture), and physiological signals can all be used to identify human emotions. However, the first three may be ineffectual because people may hide their true emotions consciously or unconsciously (so-called social masking). Physiological signals can provide more accurate and objective emotion recognition. Electroencephalogram (EEG) signals respond in real time and are more sensitive to changes in affective states than peripheral neurophysiological signals. Thus, EEG signals can reveal important features of emotional states. Recently, several EEG-based BCI emotion recognition techniques have been developed. In addition, rapid advances in machine and deep learning have enabled machines or computers to understand, recognize, and analyze emotions. This study reviews emotion recognition methods that rely on multi-channel EEG signal-based BCIs and provides an overview of what has been accomplished in this area. It also provides an overview of the datasets and methods used to elicit emotional states. According to the usual emotional recognition pathway, we review various EEG feature extraction, feature selection/reduction, machine learning methods (e.g., k-nearest neighbor), support vector machine, decision tree, artificial neural network, random forest, and naive Bayes) and deep learning methods (e.g., convolutional and recurrent neural networks with long short term memory). In addition, EEG rhythms that are strongly linked to emotions as well as the relationship between distinct brain areas and emotions are discussed. We also discuss several human emotion recognition studies, published between 2015 and 2021, that use EEG data and compare different machine and deep learning algorithms. Finally, this review suggests several challenges and future research directions in the recognition and classification of human emotional states using EEG.

...read moreread less

Journal Article•10.1016/j.neuri.2022.100060•

MRI-based Brain Tumor Image Detection Using CNN based Deep Learning Method

[...]

Arkapravo Chattopadhyay, Mausumi Maitra

01 Feb 2022-Neuroscience Informatics

TL;DR: In this article , a convolutional neural network (CNN) was used to segment brain tumours from 2D Magnetic Resonance brain Images (MRI) followed by traditional classifiers and deep learning methods.

...read moreread less

Journal Article•10.1186/s40561-022-00192-z•

Educational data mining: prediction of students' academic performance using machine learning algorithms

[...]

Mustafa Yağcı

03 Mar 2022-Smart Learning Environments

...read moreread less

Journal Article•10.1155/2022/7799812•

Improved Artificial Neural Network with State Order Dataset Estimation for Brain Cancer Cell Diagnosis

[...]

Dnvsls Indira, Rajendra Kumar Ganiya, P. Ashok babu, A. Xavier, L. Kavisankar, S. Hemalatha, V. Senthilkumar, Thummuluru Kavitha, A. Rajaram, Karthik Annam, A. Yeshitla - Show less +7 more

16 Apr 2022-BioMed Research International

TL;DR: In this research, the improved gene expression programming (IGEP) effectual features are selected by using remainder performance to improve the prognosis efficiency.

...read moreread less

Abstract: Brain cancer is one of the cell synthesis diseases. Brain cancer cells are analyzed for patient diagnosis. Due to this composite cell, the conceptual classifications differ from each and every brain cancer investigation. In the gene test, patient prognosis is identified based on individual biocell appearance. Classification of advanced artificial neural network subtypes attains improved performance compared to previous enhanced artificial neural network (EANN) biocell subtype investigation. In this research, the proposed features are selected based on improved gene expression programming (IGEP) with modified brute force algorithm. Then, the maximum and minimum term survivals are classified by using PCA with enhanced artificial neural network (EANN). In this, the improved gene expression programming (IGEP) effectual features are selected by using remainder performance to improve the prognosis efficiency. This system is estimated by using the Cancer Genome Atlas (CGA) dataset. Simulation outputs present improved gene expression programming (IGEP) with modified brute force algorithm which achieves accurate efficiency of 96.37%, specificity of 96.37%, sensitivity of 98.37%, precision of 78.78%, F-measure of 80.22%, and recall of 64.29% when compared to generalized regression neural network (GRNN), improved extreme learning machine (IELM) with minimum redundancy maximum relevance (MRMR) method, and support vector machine (SVM).

...read moreread less

Journal Article•10.1016/j.ress.2021.108223•

Machine learning-based methods in structural reliability analysis: A review

[...]

01 Mar 2022-Reliability Engineering & System Safety

TL;DR: A review of the machine learning-based structural reliability analysis methods is presented in this article , where the authors focus on the different models' structures and diverse applications of each ML method in different aspects of SRA.

...read moreread less

Journal Article•10.1155/2022/4688327•

Hybrid Model for Detection of Cervical Cancer Using Causal Analysis and Machine Learning Techniques

[...]

Umesh Kumar Lilhore, M. Poongodi, Amandeep Kaur, Sarita Simaiya, Abeer D. Algarni, Hela Elmannai, V. Vijayakumar, Godwin Brown Tunze, Mounir Hamdi - Show less +5 more

04 May 2022-Computational and Mathematical Methods in Medicine

TL;DR: The proposed model's primary aim is to determine the importance of cervical cancer screening factors for classifying high-risk patients depending on the findings, and the proposed Boruta analysis with SVM performs outstanding over existing methods.

...read moreread less

Abstract: Cervical cancer has become the third most common form of cancer in the in-universe, after the widespread breast cancer. Human papillomavirus risk of infection is linked to the majority of cancer cases. Preventive care, the most expensive way of fighting cancer, can protect about 37% of cancer cases. The Pap smear examination is a standard screening procedure for the initial screening of cervical cancer. However, this manual test procedure generates many false-positive outcomes due to individual errors. Various researchers have extensively investigated machine learning (ML) methods for classifying cervical Pap cells to enhance manual testing. The random forest method is the most popular method for anticipating features from a high-dimensional cancer image dataset. However, the random forest method can get too slow and inefficient for real-time forecasts when too many decision trees are used. This research proposed an efficient feature selection and prediction model for cervical cancer datasets using Boruta analysis and SVM method to deal with this challenge. A Boruta analysis method is used. It is improved from of random forest method and mainly discovers feature subsets from the data source that are significant to assigned classification activity. The proposed model's primary aim is to determine the importance of cervical cancer screening factors for classifying high-risk patients depending on the findings. This research work analyses cervical cancer and various risk factors to help detect cervical cancer. The proposed model Boruta with SVM and various popular ML models are implemented using Python and various performance measuring parameters, i.e., accuracy, precision, F1–Score, and recall. However, the proposed Boruta analysis with SVM performs outstanding over existing methods.

...read moreread less

Journal Article•10.3390/s22165986•

A Hybrid Intrusion Detection Model Using EGA-PSO and Improved Random Forest Method

[...]

Amit Kumar Balyan, Sachin Ahuja, Umesh Kumar Lilhore, Sanjeev Kumar Sharma, Poongodi Manoharan, Abeer D. Algarni, Hela Elmannai, Kaamran Raahemifar - Show less +4 more

01 Aug 2022-Sensors

TL;DR: This research develops an efficient hybrid network-based IDS model (HNIDS), which is utilized using the enhanced genetic algorithm and particle swarm optimization (EGA-PSO) and improved random forest (IRF) methods to deal with the data imbalance issue.

...read moreread less

Abstract: Due to the rapid growth in IT technology, digital data have increased availability, creating novel security threats that need immediate attention. An intrusion detection system (IDS) is the most promising solution for preventing malicious intrusions and tracing suspicious network behavioral patterns. Machine learning (ML) methods are widely used in IDS. Due to a limited training dataset, an ML-based IDS generates a higher false detection ratio and encounters data imbalance issues. To deal with the data-imbalance issue, this research develops an efficient hybrid network-based IDS model (HNIDS), which is utilized using the enhanced genetic algorithm and particle swarm optimization(EGA-PSO) and improved random forest (IRF) methods. In the initial phase, the proposed HNIDS utilizes hybrid EGA-PSO methods to enhance the minor data samples and thus produce a balanced data set to learn the sample attributes of small samples more accurately. In the proposed HNIDS, a PSO method improves the vector. GA is enhanced by adding a multi-objective function, which selects the best features and achieves improved fitness outcomes to explore the essential features and helps minimize dimensions, enhance the true positive rate (TPR), and lower the false positive rate (FPR). In the next phase, an IRF eliminates the less significant attributes, incorporates a list of decision trees across each iterative process, supervises the classifier’s performance, and prevents overfitting issues. The performance of the proposed method and existing ML methods are tested using the benchmark datasets NSL-KDD. The experimental findings demonstrated that the proposed HNIDS method achieves an accuracy of 98.979% on BCC and 88.149% on MCC for the NSL-KDD dataset, which is far better than the other ML methods i.e., SVM, RF, LR, NB, LDA, and CART.

...read moreread less

Journal Article•10.1155/2022/1684017•

A Novel Diabetes Healthcare Disease Prediction Framework Using Machine Learning Techniques

[...]

R. Krishnamoorthi, Shubham Joshi, Hatim Almarzouki, Piyush Kumar Shukla, Ali Rizwan, C. Kalpana, Basant Tiwari - Show less +3 more

11 Jan 2022-Journal of Healthcare Engineering

TL;DR: It is proposed in this study that a unique intelligent diabetes mellitus prediction framework (IDMPF) is developed using machine learning after conducting a rigorous review of existing prediction models in the literature and examining their applicability to diabetes.

...read moreread less

Abstract: Diabetes is a chronic disease that continues to be a significant and global concern since it affects the entire population's health. It is a metabolic disorder that leads to high blood sugar levels and many other problems such as stroke, kidney failure, and heart and nerve problems. Several researchers have attempted to construct an accurate diabetes prediction model over the years. However, this subject still faces significant open research issues due to a lack of appropriate data sets and prediction approaches, which pushes researchers to use big data analytics and machine learning (ML)-based methods. Applying four different machine learning methods, the research tries to overcome the problems and investigate healthcare predictive analytics. The study's primary goal was to see how big data analytics and machine learning-based techniques may be used in diabetes. The examination of the results shows that the suggested ML-based framework may achieve a score of 86. Health experts and other stakeholders are working to develop categorization models that will aid in the prediction of diabetes and the formulation of preventative initiatives. The authors perform a review of the literature on machine models and suggest an intelligent framework for diabetes prediction based on their findings. Machine learning models are critically examined, and an intelligent machine learning-based architecture for diabetes prediction is proposed and evaluated by the authors. In this study, the authors utilize our framework to develop and assess decision tree (DT)-based random forest (RF) and support vector machine (SVM) learning models for diabetes prediction, which are the most widely used techniques in the literature at the time of writing. It is proposed in this study that a unique intelligent diabetes mellitus prediction framework (IDMPF) is developed using machine learning. According to the framework, it was developed after conducting a rigorous review of existing prediction models in the literature and examining their applicability to diabetes. Using the framework, the authors describe the training procedures, model assessment strategies, and issues associated with diabetes prediction, as well as solutions they provide. The findings of this study may be utilized by health professionals, stakeholders, students, and researchers who are involved in diabetes prediction research and development. The proposed work gives 83% accuracy with the minimum error rate.

...read moreread less

Journal Article•10.3390/biology11121732•

Artificial Intelligence-Based Robust Hybrid Algorithm Design and Implementation for Real-Time Detection of Plant Diseases in Agricultural Environments

[...]

İlayda Yağ, Aytac Altan

29 Nov 2022-Biology

TL;DR: In this article , a robust hybrid classification model based on swarm optimization-supported feature selection, including machine learning and deep learning algorithms, was developed for real-time classification of diseases in apple, grape, and tomato plants.

...read moreread less

Abstract: Simple Summary Plant disease, defined as an abnormal condition that disrupts the normal growth of the plant, is one of the main causes of economic losses in the agricultural industry. Early diagnosis of plant disease is critical to increasing agricultural crop productivity. In this paper, a new robust hybrid classification model based on swarm optimization-supported feature selection, including machine learning and deep learning algorithms, that allows real-time classification of diseases in apple, grape, and tomato plants has been developed. In this way, it will be possible to diagnose the plant disease at an early phase and apply the appropriate treatment. Abstract The early detection and prevention of plant diseases that are an important cause of famine and food insecurity worldwide are very important for increasing agricultural product productivity. Not only the early detection of the plant disease but also the determination of its type play a critical role in determining the appropriate treatment. The fact that visual inspection, which is frequently used in determining plant disease and types, is tiring and prone to human error, necessitated the development of algorithms that can automatically classify plant disease with high accuracy and low computational cost. In this study, a new hybrid plant leaf disease classification model with high accuracy and low computational complexity, consisting of the wrapper approach, including the flower pollination algorithm (FPA) and support vector machine (SVM), and a convolutional neural network (CNN) classifier, is developed with a wrapper-based feature selection approach using metaheuristic optimization techniques. The features of the image dataset consisting of apple, grape, and tomato plants have been extracted by a two-dimensional discrete wavelet transform (2D-DWT) using wavelet families such as biorthogonal, Coiflets, Daubechies, Fejer–Korovkin, and symlets. Features that keep classifier performance high for each family are selected by the wrapper approach, consisting of the population-based metaheuristics FPA and SVM. The performance of the proposed optimization algorithm is compared with the particle swarm optimization (PSO) algorithm. Afterwards, the classification performance is obtained by using the lowest number of features that can keep the classification performance high for the CNN classifier. The CNN classifier with a single layer of classification without a feature extraction layer is used to minimize the complexity of the model and to deal with the model hyperparameter problem. The obtained model is embedded in the NVIDIA Jetson Nano developer kit on the unmanned aerial vehicle (UAV), and real-time classification tests are performed on apple, grape, and tomato plants. The experimental results obtained show that the proposed model classifies the specified plant leaf diseases in real time with high accuracy. Moreover, it is concluded that the robust hybrid classification model, which is created by selecting the lowest number of features with the optimization algorithm with low computational complexity, can classify plant leaf diseases in real time with precision.

...read moreread less

Journal Article•10.32604/CMC.2022.019625•

Data Analytics for the Identification of Fake Reviews Using Supervised Learning

[...]

Saleh Nagi Alsubari, Sachin N. Deshmukh, Ahmed Abdullah Alqarni, Nizar Alsharif, Theyazn H. H. Aldhyani, Fawaz Waselallah Alsaade, Osamah Ibrahim Khalaf - Show less +3 more

01 Jan 2022-Cmc-computers Materials & Continua

Journal Article•10.1016/j.jwpe.2022.102920•

Water quality classification using machine learning algorithms

[...]

N. Nasir, Afreen Kansal, Omar Alshaltone, F. Barneih, Mustafa Sameer, Abdallah Shanableh, Ahmed Al-Shamma'a - Show less +3 more

01 Aug 2022-Journal of water process engineering

TL;DR: In this paper , various machine learning classifiers and their stacking ensemble models were used to classify the WQ data via the Water Quality Index (WQI), and the results revealed that the CATBoost model offered the most accurate classifier with a percentage of 94.51.

...read moreread less

Abstract: Monitoring water quality is essential for protecting human health and the environment and controlling water quality. Artificial Intelligence (AI) offers significant opportunities to help improve the classification and prediction of water quality (WQ). In this study, various AI algorithms are assessed to handle WQ data collected over an extended period and develop a dependable approach for forecasting water quality as accurately as possible. Specifically, various machine learning classifiers and their stacking ensemble models were used to classify the WQ data via the Water Quality Index (WQI). The studied classifiers included Support Vector Machine (SVM), Random Forest (RF), Logistic Regression (LR), Decision Tree (DT), CATBoost, XGBoost, and Multilayer Perceptron (MLP). The dataset used in the study included 1679 samples and their meta-data collected over nine years. In addition, precision-recall curves and Receiver Operating Characteristic curves (ROC) were used to assess the performance of the various classifiers. The findings revealed that the CATBoost model offered the most accurate classifier with a percentage of 94.51. Moreover, after applying stacking ensemble models with all classifiers, accuracy reached 100% in various Meta-classifiers. Furthermore, the CATBoost achieved the highest accuracy as a primary gradient boosting algorithm and a meta classifier. Therefore, the boosting algorithm is proposed as a reliable approach for the WQ classification. The analysis presented in this article presents a framework that can support the efforts of researchers working toward water quality improvement using artificial intelligence.

...read moreread less

Journal Article•10.3390/atmos13020180•

Machine Learning in Weather Prediction and Climate Analyses—Applications and Perspectives

[...]

Bogdan Bochenek, Zbigniew Ustrnul

23 Jan 2022-Atmosphere

TL;DR: In this paper, the authors performed an analysis of the 500 most relevant scientific articles published since 2018, concerning machine learning methods in the field of climate and numerical weather prediction using the Google Scholar search engine.

...read moreread less

Abstract: In this paper, we performed an analysis of the 500 most relevant scientific articles published since 2018, concerning machine learning methods in the field of climate and numerical weather prediction using the Google Scholar search engine. The most common topics of interest in the abstracts were identified, and some of them examined in detail: in numerical weather prediction research—photovoltaic and wind energy, atmospheric physics and processes; in climate research—parametrizations, extreme events, and climate change. With the created database, it was also possible to extract the most commonly examined meteorological fields (wind, precipitation, temperature, pressure, and radiation), methods (Deep Learning, Random Forest, Artificial Neural Networks, Support Vector Machine, and XGBoost), and countries (China, USA, Australia, India, and Germany) in these topics. Performing critical reviews of the literature, authors are trying to predict the future research direction of these fields, with the main conclusion being that machine learning methods will be a key feature in future weather forecasting.

...read moreread less

Journal Article•10.3390/medicina58081090•

Multi-Modal Brain Tumor Detection Using Deep Neural Network and Multiclass SVM

[...]

Sarmad Maqsood, Robertas Damaševičius, Rytis Maskeliūnas

01 Aug 2022-Medicina-lithuania

TL;DR: The proposed brain tumor detection and classification method outperforms other methods both visually and quantitatively, obtaining an accuracy of 97.47% and 98.92%, respectively.

...read moreread less

Abstract: Background and Objectives: Clinical diagnosis has become very significant in today’s health system. The most serious disease and the leading cause of mortality globally is brain cancer which is a key research topic in the field of medical imaging. The examination and prognosis of brain tumors can be improved by an early and precise diagnosis based on magnetic resonance imaging. For computer-aided diagnosis methods to assist radiologists in the proper detection of brain tumors, medical imagery must be detected, segmented, and classified. Manual brain tumor detection is a monotonous and error-prone procedure for radiologists; hence, it is very important to implement an automated method. As a result, the precise brain tumor detection and classification method is presented. Materials and Methods: The proposed method has five steps. In the first step, a linear contrast stretching is used to determine the edges in the source image. In the second step, a custom 17-layered deep neural network architecture is developed for the segmentation of brain tumors. In the third step, a modified MobileNetV2 architecture is used for feature extraction and is trained using transfer learning. In the fourth step, an entropy-based controlled method was used along with a multiclass support vector machine (M-SVM) for the best features selection. In the final step, M-SVM is used for brain tumor classification, which identifies the meningioma, glioma and pituitary images. Results: The proposed method was demonstrated on BraTS 2018 and Figshare datasets. Experimental study shows that the proposed brain tumor detection and classification method outperforms other methods both visually and quantitatively, obtaining an accuracy of 97.47% and 98.92%, respectively. Finally, we adopt the eXplainable Artificial Intelligence (XAI) method to explain the result. Conclusions: Our proposed approach for brain tumor detection and classification has outperformed prior methods. These findings demonstrate that the proposed approach obtained higher performance in terms of both visually and enhanced quantitative evaluation with improved accuracy.

...read moreread less

Journal Article•10.1007/s13369-022-06560-8•

Predictive Performances of Ensemble Machine Learning Algorithms in Landslide Susceptibility Mapping Using Random Forest, Extreme Gradient Boosting (XGBoost) and Natural Gradient Boosting (NGBoost)

[...]

Taskin Kavzoglu, Alihan Teke

17 Jan 2022-Arabian journal for science and engineering

TL;DR: This work intended to propose natural gradient boosting (NGBoost), a novel member of the ensemble learning family, for modeling landslide susceptibility for Macka County of Trabzon province, Turkey, and indicated that the NGBeost method utilized for landslide susceptibility mapping problem for the first time had the greatest predictive ability.

...read moreread less

Journal Article•10.1016/j.neuri.2021.100034•

Multiclass skin cancer classification using EfficientNets – a first step towards preventing skin cancer

[...]

Coyne Deirdre Joan¹•Institutions (1)

Shenyang Normal University¹

01 Dec 2022-Neuroscience Informatics

TL;DR: In this article , the performance of all EfficientNet variants on this imbalanced multiclass classification task using metrics such as Precision, Recall, Accuracy, F1 Score, and Confusion Matrices was evaluated.

...read moreread less

Journal Article•10.1016/j.jrmge.2021.12.011•

Slope stability prediction using ensemble learning techniques: A case study in Yunyang County, Chongqing, China

[...]

Wengang Zhang, Hongrui Li, Liang Han, Longlong Chen, Lina Wang - Show less +1 more

01 Jan 2022-Journal of rock mechanics and geotechnical engineering

TL;DR: Wang et al. as discussed by the authors developed an ensemble learning-based method to predict the slope stability by introducing the random forest (RF) and extreme gradient boosting (XGBoost), which is applied to the stability prediction of 786 landslide cases in Yunyang County, Chongqing, China.

...read moreread less

Abstract: Slope stability prediction plays a significant role in landslide disaster prevention and mitigation. This study develops an ensemble learning-based method to predict the slope stability by introducing the random forest (RF) and extreme gradient boosting (XGBoost). As an illustration, the proposed approach is applied to the stability prediction of 786 landslide cases in Yunyang County, Chongqing, China. For comparison, the predictive performance of RF, XGBoost, support vector machine (SVM), and logistic regression (LR) is systematically investigated based on the well-established confusion matrix, which contains the known indices of recall rate, precision, and accuracy. Furthermore, the feature importance of the 12 influencing variables is also explored. Results show that the accuracy of the XGBoost and RF for both the training and testing data is superior to that of SVM and LR, revealing the superiority of the ensemble learning models (i.e. XGBoost and RF) in the slope stability prediction of Yunyang County. Among the 12 influencing factors, the profile shape is the most important one. The proposed ensemble learning-based method offers a promising way to rationally capture the slope status. It can be extended to the prediction of slope stability of other landslide-prone areas of interest.

...read moreread less

Journal Article•10.1007/s13762-022-04241-5•

Air pollution prediction with machine learning: a case study of Indian cities

[...]

K. Saravana Kumar, B. P. Pande

15 May 2022-International Journal of Environmental Science and Technology

TL;DR: In this article , the authors investigated six years of air pollution data from 23 Indian cities for air quality analysis and prediction, and five machine learning models were employed to predict air quality.

...read moreread less

Abstract: The survival of mankind cannot be imagined without air. Consistent developments in almost all realms of modern human society affected the health of the air adversely. Daily industrial, transport, and domestic activities are stirring hazardous pollutants in our environment. Monitoring and predicting air quality have become essentially important in this era, especially in developing countries like India. In contrast to the traditional methods, the prediction technologies based on machine learning techniques are proved to be the most efficient tools to study such modern hazards. The present work investigates six years of air pollution data from 23 Indian cities for air quality analysis and prediction. The dataset is well preprocessed and key features are selected through the correlation analysis. An exploratory data analysis is exercised to develop insights into various hidden patterns in the dataset and pollutants directly affecting the air quality index are identified. A significant fall in almost all pollutants is observed in the pandemic year, 2020. The data imbalance problem is solved with a resampling technique and five machine learning models are employed to predict air quality. The results of these models are compared with the standard metrics. The Gaussian Naive Bayes model achieves the highest accuracy while the Support Vector Machine model exhibits the lowest accuracy. The performances of these models are evaluated and compared through established performance parameters. The XGBoost model performed the best among the other models and gets the highest linearity between the predicted and actual data.

...read moreread less

Journal Article•10.1007/s10479-022-04575-w•

Comprehensive review on twin support vector machines

[...]

08 Mar 2022-Annals of Operations Research

TL;DR: Twin support vector machine (TWSVM) and twin support vector regression (TSVR) are newly emerging efficient machine learning techniques which offer promising solutions for classification and regression challenges respectively as mentioned in this paper .

...read moreread less

Abstract: Twin support vector machine (TWSVM) and twin support vector regression (TSVR) are newly emerging efficient machine learning techniques which offer promising solutions for classification and regression challenges respectively. TWSVM is based upon the idea to identify two nonparallel hyperplanes which classify the data points to their respective classes. It requires to solve two small sized quadratic programming problems (QPPs) in lieu of solving single large size QPP in support vector machine (SVM) while TSVR is formulated on the lines of TWSVM and requires to solve two SVM kind problems. Although there has been good research progress on these techniques; there is limited literature on the comparison of different variants of TSVR. Thus, this review presents a rigorous analysis of recent research in TWSVM and TSVR simultaneously mentioning their limitations and advantages. To begin with we first introduce the basic theory of support vector machine, TWSVM and then focus on the various improvements and applications of TWSVM, and then we introduce TSVR and its various enhancements. Finally, we suggest future research and development prospects.

...read moreread less

Journal Article•10.1109/access.2022.3166891•

Credit Card Fraud Detection Using State-of-the-Art Machine Learning and Deep Learning Algorithms

[...]

01 Jan 2022-IEEE Access

TL;DR: In this paper , three architectures based on a convolutional neural network are applied to improve fraud detection performance, including Extreme Learning Method, Decision Tree, Random Forest, Support Vector Machine, Logistic Regression and XG Boost.

...read moreread less

Abstract: People can use credit cards for online transactions as it provides an efficient and easy-to-use facility. With the increase in usage of credit cards, the capacity of credit card misuse has also enhanced. Credit card frauds cause significant financial losses for both credit card holders and financial companies. In this research study, the main aim is to detect such frauds, including the accessibility of public data, high-class imbalance data, the changes in fraud nature, and high rates of false alarm. The relevant literature presents many machines learning based approaches for credit card detection, such as Extreme Learning Method, Decision Tree, Random Forest, Support Vector Machine, Logistic Regression and XG Boost. However, due to low accuracy, there is still a need to apply state of the art deep learning algorithms to reduce fraud losses. The main focus has been to apply the recent development of deep learning algorithms for this purpose. Comparative analysis of both machine learning and deep learning algorithms was performed to find efficient outcomes. The detailed empirical analysis is carried out using the European card benchmark dataset for fraud detection. A machine learning algorithm was first applied to the dataset, which improved the accuracy of detection of the frauds to some extent. Later, three architectures based on a convolutional neural network are applied to improve fraud detection performance. Further addition of layers further increased the accuracy of detection. A comprehensive empirical analysis has been carried out by applying variations in the number of hidden layers, epochs and applying the latest models. The evaluation of research work shows the improved results achieved, such as accuracy, f1-score, precision and AUC Curves having optimized values of 99.9%,85.71%,93%, and 98%, respectively. The proposed model outperforms the state-of-the-art machine learning and deep learning algorithms for credit card detection problems. In addition, we have performed experiments by balancing the data and applying deep learning algorithms to minimize the false negative rate. The proposed approaches can be implemented effectively for the real-world detection of credit card fraud.

...read moreread less

...

Expand