Top 5525 papers published in the topic of Support vector machine in 2017

Showing papers on "Support vector machine published in 2017"

Book•

Hands-On Machine Learning with Scikit-Learn and TensorFlow: Concepts, Tools, and Techniques to Build Intelligent Systems

[...]

13 Mar 2017

TL;DR: This practical book shows you how to implement programs capable of learning from data by using concrete examples, minimal theory, and two production-ready Python frameworks-scikit-learn and TensorFlow-author Aurelien Geron helps you gain an intuitive understanding of the concepts and tools for building intelligent systems.

...read moreread less

Abstract: Through a series of recent breakthroughs, deep learning has boosted the entire field of machine learning. Now, even programmers who know close to nothing about this technology can use simple, efficient tools to implement programs capable of learning from data. This practical book shows you how. By using concrete examples, minimal theory, and two production-ready Python frameworks-scikit-learn and TensorFlow-author Aurelien Geron helps you gain an intuitive understanding of the concepts and tools for building intelligent systems. You'll learn a range of techniques, starting with simple linear regression and progressing to deep neural networks. With exercises in each chapter to help you apply what you've learned, all you need is programming experience to get started. Explore the machine learning landscape, particularly neural nets Use scikit-learn to track an example machine-learning project end-to-end Explore several training models, including support vector machines, decision trees, random forests, and ensemble methods Use the TensorFlow library to build and train neural nets Dive into neural net architectures, including convolutional nets, recurrent nets, and deep reinforcement learning Learn techniques for training and scaling deep neural nets Apply practical code examples without acquiring excessive machine learning theory or algorithm details

...read moreread less

2,758 citations

Journal Article•10.1109/ACCESS.2017.2762418•

A Deep Learning Approach for Intrusion Detection Using Recurrent Neural Networks

[...]

Chuanlong Yin, Yuefei Zhu, Jinlong Fei, Xinzheng He

12 Oct 2017-IEEE Access

TL;DR: The experimental results show that RNN-IDS is very suitable for modeling a classification model with high accuracy and that its performance is superior to that of traditional machine learning classification methods in both binary and multiclass classification.

...read moreread less

Abstract: Intrusion detection plays an important role in ensuring information security, and the key technology is to accurately identify various attacks in the network. In this paper, we explore how to model an intrusion detection system based on deep learning, and we propose a deep learning approach for intrusion detection using recurrent neural networks (RNN-IDS). Moreover, we study the performance of the model in binary classification and multiclass classification, and the number of neurons and different learning rate impacts on the performance of the proposed model. We compare it with those of J48, artificial neural network, random forest, support vector machine, and other machine learning methods proposed by previous researchers on the benchmark data set. The experimental results show that RNN-IDS is very suitable for modeling a classification model with high accuracy and that its performance is superior to that of traditional machine learning classification methods in both binary and multiclass classification. The RNN-IDS model improves the accuracy of the intrusion detection and provides a new research method for intrusion detection.

...read moreread less

1,722 citations

Journal Article•10.1016/J.RENENE.2016.12.095•

Machine learning methods for solar radiation forecasting: A review

[...]

Cyril Voyant¹, Gilles Notton¹, Soteris A. Kalogirou², Marie Laure Nivet¹, Christophe Paoli³, Christophe Paoli¹, Fabrice Motte¹, Alexis Fouilloy¹ - Show less +4 more•Institutions (3)

Centre national de la recherche scientifique¹, Cyprus University of Technology², Galatasaray University³

01 May 2017-Renewable Energy

TL;DR: An overview of forecasting methods of solar irradiation using machine learning approaches is given and it will be shown that other methods begin to be used in this context of prediction.

...read moreread less

1,546 citations

Journal Article•10.1038/NMETH.4236•

SC3: consensus clustering of single-cell RNA-seq data

[...]

Vladimir Yu. Kiselev¹, Kristina Kirschner², Michael T. Schaub³, Michael T. Schaub⁴, Tallulah S. Andrews¹, Andrew Yiu¹, Tamir Chandra⁵, Tamir Chandra¹, Kedar Nath Natarajan⁶, Kedar Nath Natarajan¹, Wolf Reik², Wolf Reik⁵, Wolf Reik¹, Mauricio Barahona⁷, Anthony R. Green², Martin Hemberg¹ - Show less +12 more•Institutions (7)

Wellcome Trust Sanger Institute¹, University of Cambridge², Université catholique de Louvain³, Université de Namur⁴, Babraham Institute⁵, European Bioinformatics Institute⁶, Imperial College London⁷

01 May 2017-Nature Methods

TL;DR: It is demonstrated that SC3 is capable of identifying subclones from the transcriptomes of neoplastic cells collected from patients and achieves high accuracy and robustness by combining multiple clustering solutions through a consensus approach.

...read moreread less

Abstract: Single-cell RNA-seq enables the quantitative characterization of cell types based on global transcriptome profiles. We present single-cell consensus clustering (SC3), a user-friendly tool for unsupervised clustering, which achieves high accuracy and robustness by combining multiple clustering solutions through a consensus approach (http://bioconductor.org/packages/SC3). We demonstrate that SC3 is capable of identifying subclones from the transcriptomes of neoplastic cells collected from patients.

...read moreread less

1,469 citations

Proceedings Article•

A Sensitivity Analysis of (and Practitioners’ Guide to) Convolutional Neural Networks for Sentence Classification

[...]

Ye Zhang¹, Byron C. Wallace²•Institutions (2)

University of Texas at Austin¹, Northeastern University²

1 Nov 2017

TL;DR: A sensitivity analysis of one-layer CNNs is conducted to explore the effect of architecture components on model performance; the aim is to distinguish between important and comparatively inconsequential design decisions for sentence classification.

...read moreread less

Abstract: Convolutional Neural Networks (CNNs) have recently achieved remarkably strong performance on the practically important task of sentence classification (Kim, 2014; Kalchbrenner et al., 2014; Johnson and Zhang, 2014; Zhang et al., 2016). However, these models require practitioners to specify an exact model architecture and set accompanying hyperparameters, including the filter region size, regularization parameters, and so on. It is currently unknown how sensitive model performance is to changes in these configurations for the task of sentence classification. We thus conduct a sensitivity analysis of one-layer CNNs to explore the effect of architecture components on model performance; our aim is to distinguish between important and comparatively inconsequential design decisions for sentence classification. We focus on one-layer CNNs (to the exclusion of more complex models) due to their comparative simplicity and strong empirical performance, which makes it a modern standard baseline method akin to Support Vector Machine (SVMs) and logistic regression. We derive practical advice from our extensive empirical results for those interested in getting the most out of CNNs for sentence classification in real world settings.

...read moreread less

1,325 citations

Journal Article•10.3390/S18010018•

Comparison of Random Forest, k-Nearest Neighbor, and Support Vector Machine Classifiers for Land Cover Classification Using Sentinel-2 Imagery

[...]

Phan Thanh Noi¹, Martin Kappas¹•Institutions (1)

University of Göttingen¹

22 Dec 2017-Sensors

TL;DR: This study examined and compared the performances of the RF, kNN, and SVM classifiers for land use/cover classification using Sentinel-2 image data and found that SVM produced the highest OA with the least sensitivity to the training sample sizes.

...read moreread less

Abstract: In previous classification studies, three non-parametric classifiers, Random Forest (RF), k-Nearest Neighbor (kNN), and Support Vector Machine (SVM), were reported as the foremost classifiers at producing high accuracies. However, only a few studies have compared the performances of these classifiers with different training sample sizes for the same remote sensing images, particularly the Sentinel-2 Multispectral Imager (MSI). In this study, we examined and compared the performances of the RF, kNN, and SVM classifiers for land use/cover classification using Sentinel-2 image data. An area of 30 × 30 km2 within the Red River Delta of Vietnam with six land use/cover types was classified using 14 different training sample sizes, including balanced and imbalanced, from 50 to over 1250 pixels/class. All classification results showed a high overall accuracy (OA) ranging from 90% to 95%. Among the three classifiers and 14 sub-datasets, SVM produced the highest OA with the least sensitivity to the training sample sizes, followed consecutively by RF and kNN. In relation to the sample size, all three classifiers showed a similar and high OA (over 93.85%) when the training sample size was large enough, i.e., greater than 750 pixels/class or representing an area of approximately 0.25% of the total study area. The high accuracy was achieved with both imbalanced and balanced datasets.

...read moreread less

1,206 citations

Journal Article•10.1371/JOURNAL.PONE.0177678•

Optimal classifier for imbalanced data using Matthews Correlation Coefficient metric.

[...]

Sabri Boughorbel¹, Fethi Jarray², Mohammed Elanbari¹•Institutions (2)

Qatar Airways¹, Conservatoire national des arts et métiers²

02 Jun 2017-PLOS ONE

TL;DR: The proposed MCC-classifier has a close performance to SVM-imba while being simpler and more efficient and an optimal Bayes classifier for the MCC metric using an approach based on Frechet derivative.

...read moreread less

Abstract: Data imbalance is frequently encountered in biomedical applications Resampling techniques can be used in binary classification to tackle this issue However such solutions are not desired when the number of samples in the small class is limited Moreover the use of inadequate performance metrics, such as accuracy, lead to poor generalization results because the classifiers tend to predict the largest size class One of the good approaches to deal with this issue is to optimize performance metrics that are designed to handle data imbalance Matthews Correlation Coefficient (MCC) is widely used in Bioinformatics as a performance metric We are interested in developing a new classifier based on the MCC metric to handle imbalanced data We derive an optimal Bayes classifier for the MCC metric using an approach based on Frechet derivative We show that the proposed algorithm has the nice theoretical property of consistency Using simulated data, we verify the correctness of our optimality result by searching in the space of all possible binary classifiers The proposed classifier is evaluated on 64 datasets from a wide range data imbalance We compare both classification performance and CPU efficiency for three classifiers: 1) the proposed algorithm (MCC-classifier), the Bayes classifier with a default threshold (MCC-base) and imbalanced SVM (SVM-imba) The experimental evaluation shows that MCC-classifier has a close performance to SVM-imba while being simpler and more efficient

...read moreread less

1,171 citations

Journal Article•10.1016/J.ESWA.2017.04.006•

Machine learning models and bankruptcy prediction

[...]

Flavio Barboza, Herbert Kimura, Edward I. Altman¹•Institutions (1)

New York University¹

15 Oct 2017-Expert Systems With Applications

TL;DR: This study tests machine learning models to predict bankruptcy one year prior to the event, and finds that bagging, boosting, and random forest models outperform the others techniques, and that all prediction accuracy in the testing sample improves when the additional variables are included.

...read moreread less

Abstract: Machine learning models show improved bankruptcy prediction accuracy over traditional models.Various models were tested using different accuracy metrics.Boosting, bagging, and random forest models provide better results. There has been intensive research from academics and practitioners regarding models for predicting bankruptcy and default events, for credit risk management. Seminal academic research has evaluated bankruptcy using traditional statistics techniques (e.g. discriminant analysis and logistic regression) and early artificial intelligence models (e.g. artificial neural networks). In this study, we test machine learning models (support vector machines, bagging, boosting, and random forest) to predict bankruptcy one year prior to the event, and compare their performance with results from discriminant analysis, logistic regression, and neural networks. We use data from 1985 to 2013 on North American firms, integrating information from the Salomon Center database and Compustat, analysing more than 10,000 firm-year observations. The key insight of the study is a substantial improvement in prediction accuracy using machine learning techniques especially when, in addition to the original Altmans Z-score variables, we include six complementary financial indicators. Based on Carton and Hofer (2006), we use new variables, such as the operating margin, change in return-on-equity, change in price-to-book, and growth measures related to assets, sales, and number of employees, as predictive variables. Machine learning models show, on average, approximately 10% more accuracy in relation to traditional models. Comparing the best models, with all predictive variables, the machine learning technique related to random forest led to 87% accuracy, whereas logistic regression and linear discriminant analysis led to 69% and 50% accuracy, respectively, in the testing sample. We find that bagging, boosting, and random forest models outperform the others techniques, and that all prediction accuracy in the testing sample improves when the additional variables are included. Our research adds to the discussion of the continuing debate about superiority of computational methods over statistical techniques such as in Tsai, Hsu, and Yen (2014) and Yeh, Chi, and Lin (2014). In particular, for machine learning mechanisms, we do not find SVM to lead to higher accuracy rates than other models. This result contradicts outcomes from Danenas and Garsva (2015) and Cleofas-Sanchez, Garcia, Marques, and Senchez (2016), but corroborates, for instance, Wang, Ma, and Yang (2014), Liang, Lu, Tsai, and Shih (2016), and Cano etal. (2017). Our study supports the applicability of the expert systems by practitioners as in Heo and Yang (2014), Kim, Kang, and Kim (2015) and Xiao, Xiao, and Wang (2016).

...read moreread less

732 citations

Proceedings Article•10.1109/CVPR.2017.510•

Large Margin Object Tracking with Circulant Feature Maps

[...]

Mengmeng Wang¹, Yong Liu¹, Zeyi Huang•Institutions (1)

Zhejiang University¹

1 Jul 2017

TL;DR: Wang et al. as discussed by the authors proposed a large margin object tracking method, which absorbs the strong discriminative ability from structured output SVM and speeds up by the correlation filter algorithm significantly.

...read moreread less

Abstract: Structured output support vector machine (SVM) based tracking algorithms have shown favorable performance recently. Nonetheless, the time-consuming candidate sampling and complex optimization limit their real-time applications. In this paper, we propose a novel large margin object tracking method which absorbs the strong discriminative ability from structured output SVM and speeds up by the correlation filter algorithm significantly. Secondly, a multimodal target detection technique is proposed to improve the target localization precision and prevent model drift introduced by similar objects or background noise. Thirdly, we exploit the feedback from high-confidence tracking results to avoid the model corruption problem. We implement two versions of the proposed tracker with the representations from both conventional hand-crafted and deep convolution neural networks (CNNs) based features to validate the strong compatibility of the algorithm. The experimental results demonstrate that the proposed tracker performs superiorly against several state-of-the-art algorithms on the challenging benchmark sequences while runs at speed in excess of 80 frames per second.

...read moreread less

682 citations

Book Chapter•10.1007/978-3-319-69155-8_9•

Detection of Online Fake News Using N-Gram Analysis and Machine Learning Techniques

[...]

Hadeer Ahmed¹, Issa Traore¹, Sherif Saad²•Institutions (2)

University of Victoria¹, University of Windsor²

25 Oct 2017

TL;DR: A fake news detection model that use n-gram analysis and machine learning techniques is proposed, which investigates and compares two different features extraction techniques and six different machine classification techniques.

...read moreread less

Abstract: Fake news is a phenomenon which is having a significant impact on our social life, in particular in the political world. Fake news detection is an emerging research area which is gaining interest but involved some challenges due to the limited amount of resources (i.e., datasets, published literature) available. We propose in this paper, a fake news detection model that use n-gram analysis and machine learning techniques. We investigate and compare two different features extraction techniques and six different machine classification techniques. Experimental evaluation yields the best performance using Term Frequency-Inverted Document Frequency (TF-IDF) as feature extraction technique, and Linear Support Vector Machine (LSVM) as a classifier, with an accuracy of 92%.

...read moreread less

662 citations

Journal Article•10.1109/TGRS.2019.2899129•

Cascaded Recurrent Neural Networks for Hyperspectral Image Classification

[...]

Renlong Hang¹, Qingshan Liu¹, Danfeng Hong², Pedram Ghamisi³•Institutions (3)

Nanjing University of Information Science and Technology¹, German Aerospace Center², Helmholtz-Zentrum Dresden-Rossendorf³

28 Apr 2017-IEEE Transactions on Geoscience and Remote Sensing

TL;DR: Wang et al. as discussed by the authors proposed a sequence-based recurrent neural network (RNN) for hyperspectral image classification, which makes use of a newly proposed activation function, parametric rectified tanh (PRetanh), instead of the popular tanh or rectified linear unit.

...read moreread less

Abstract: In recent years, vector-based machine learning algorithms, such as random forests, support vector machines, and 1-D convolutional neural networks, have shown promising results in hyperspectral image classification. Such methodologies, nevertheless, can lead to information loss in representing hyperspectral pixels, which intrinsically have a sequence-based data structure. A recurrent neural network (RNN), an important branch of the deep learning family, is mainly designed to handle sequential data. Can sequence-based RNN be an effective method of hyperspectral image classification? In this paper, we propose a novel RNN model that can effectively analyze hyperspectral pixels as sequential data and then determine information categories via network reasoning. As far as we know, this is the first time that an RNN framework has been proposed for hyperspectral image classification. Specifically, our RNN makes use of a newly proposed activation function, parametric rectified tanh (PRetanh), for hyperspectral sequential data analysis instead of the popular tanh or rectified linear unit. The proposed activation function makes it possible to use fairly high learning rates without the risk of divergence during the training procedure. Moreover, a modified gated recurrent unit, which uses PRetanh for hidden representation, is adopted to construct the recurrent layer in our network to efficiently process hyperspectral data and reduce the total number of parameters. Experimental results on three airborne hyperspectral images suggest competitive performance in the proposed mode. In addition, the proposed network architecture opens a new window for future research, showcasing the huge potential of deep recurrent networks for hyperspectral data analysis.

...read moreread less

Journal Article•10.1016/J.INS.2016.04.019•

Fuzziness based semi-supervised learning approach for intrusion detection system

[...]

Rana Aamir Raza Ashfaq¹, Xizhao Wang¹, Joshua Zhexue Huang¹, Haider Abbas², Yulin He¹ - Show less +1 more•Institutions (2)

Shenzhen University¹, King Saud University²

01 Feb 2017-Information Sciences

TL;DR: A novel fuzziness based semi-supervised learning approach by utilizing unlabeled samples assisted with supervised learning algorithm to improve the classifier's performance for the IDSs is proposed.

...read moreread less

Journal Article•10.1109/TBME.2016.2535311•

A Discriminatively Trained Fully Connected Conditional Random Field Model for Blood Vessel Segmentation in Fundus Images

[...]

José Ignacio Orlando¹, Elena Prokofyeva², Matthew B. Blaschko³•Institutions (3)

National Scientific and Technical Research Council¹, French Institute of Health and Medical Research², Katholieke Universiteit Leuven³

01 Jan 2017-IEEE Transactions on Biomedical Engineering

TL;DR: Results suggest that this method for blood vessel segmentation in fundus images based on a discriminatively trained fully connected conditional random field model is suitable for the task of segmenting elongated structures, a feature that can be exploited to contribute with other medical and biological applications.

...read moreread less

Abstract: Goal: In this work, we present an extensive description and evaluation of our method for blood vessel segmentation in fundus images based on a discriminatively trained fully connected conditional random field model. Methods: Standard segmentation priors such as a Potts model or total variation usually fail when dealing with thin and elongated structures. We overcome this difficulty by using a conditional random field model with more expressive potentials, taking advantage of recent results enabling inference of fully connected models almost in real time. Parameters of the method are learned automatically using a structured output support vector machine, a supervised technique widely used for structured prediction in a number of machine learning applications. Results: Our method, trained with state of the art features, is evaluated both quantitatively and qualitatively on four publicly available datasets: DRIVE, STARE, CHASEDB1, and HRF. Additionally, a quantitative comparison with respect to other strategies is included. Conclusion: The experimental results show that this approach outperforms other techniques when evaluated in terms of sensitivity, F1-score, G-mean, and Matthews correlation coefficient. Additionally, it was observed that the fully connected model is able to better distinguish the desired structures than the local neighborhood-based approach. Significance: Results suggest that this method is suitable for the task of segmenting elongated structures, a feature that can be exploited to contribute with other medical and biological applications.

...read moreread less

Journal Article•10.1016/J.PATCOG.2017.05.025•

Handcrafted vs. non-handcrafted features for computer vision classification

[...]

Loris Nanni¹, Stefano Ghidoni¹, Sheryl Brahnam²•Institutions (2)

University of Padua¹, Missouri State University²

01 Nov 2017-Pattern Recognition

TL;DR: A generic computer vision system designed for exploiting trained deep Convolutional Neural Networks as a generic feature extractor and mixing these features with more traditional hand-crafted features is presented, demonstrating the generalizability of the proposed approach.

...read moreread less

Proceedings Article•10.1109/ICCNI.2017.8123782•

Credit card fraud detection using machine learning techniques: A comparative analysis

[...]

John O. Awoyemi¹, Adebayo Olusola Adetunmbi¹, Samuel A. Oluwadare¹•Institutions (1)

Federal University of Technology Akure¹

1 Oct 2017

TL;DR: Investigation of the performance of naïve bayes, k-nearest neighbor and logistic regression on highly skewed credit card fraud data shows that k-NEarest neighbour performs better than naive bayes and logistics regression techniques.

...read moreread less

Abstract: Financial fraud is an ever growing menace with far consequences in the financial industry. Data mining had played an imperative role in the detection of credit card fraud in online transactions. Credit card fraud detection, which is a data mining problem, becomes challenging due to two major reasons — first, the profiles of normal and fraudulent behaviours change constantly and secondly, credit card fraud data sets are highly skewed. The performance of fraud detection in credit card transactions is greatly affected by the sampling approach on dataset, selection of variables and detection technique(s) used. This paper investigates the performance of naive bayes, k-nearest neighbor and logistic regression on highly skewed credit card fraud data. Dataset of credit card transactions is sourced from European cardholders containing 284,807 transactions. A hybrid technique of under-sampling and oversampling is carried out on the skewed data. The three techniques are applied on the raw and preprocessed data. The work is implemented in Python. The performance of the techniques is evaluated based on accuracy, sensitivity, specificity, precision, Matthews correlation coefficient and balanced classification rate. The results shows of optimal accuracy for naive bayes, k-nearest neighbor and logistic regression classifiers are 97.92%, 97.69% and 54.86% respectively. The comparative results show that k-nearest neighbour performs better than naive bayes and logistic regression techniques.

...read moreread less

Journal Article•10.1109/JSYST.2014.2341597•

Detecting Stealthy False Data Injection Using Machine Learning in Smart Grid

[...]

Mohammad Esmalifalak¹, Lanchao Liu¹, Nam Tuan Nguyen, Rong Zheng², Zhu Han¹ - Show less +1 more•Institutions (2)

University of Houston¹, McMaster University²

01 Sep 2017-IEEE Systems Journal

TL;DR: It is shown how normal operations of power networks can be statistically distinguished from the case under stealthy attacks, and two machine-learning-based techniques for stealthy attack detection are proposed.

...read moreread less

Abstract: Aging power industries, together with the increase in demand from industrial and residential customers, are the main incentive for policy makers to define a road map to the next-generation power system called the smart grid. In the smart grid, the overall monitoring costs will be decreased, but at the same time, the risk of cyber attacks might be increased. Recently, a new type of attacks (called the stealth attack) has been introduced, which cannot be detected by the traditional bad data detection using state estimation. In this paper, we show how normal operations of power networks can be statistically distinguished from the case under stealthy attacks. We propose two machine-learning-based techniques for stealthy attack detection. The first method utilizes supervised learning over labeled data and trains a distributed support vector machine (SVM). The design of the distributed SVM is based on the alternating direction method of multipliers, which offers provable optimality and convergence rate. The second method requires no training data and detects the deviation in measurements. In both methods, principal component analysis is used to reduce the dimensionality of the data to be processed, which leads to lower computation complexities. The results of the proposed detection methods on IEEE standard test systems demonstrate the effectiveness of both schemes.

...read moreread less

Proceedings Article•10.1109/ICMLA.2017.0-134•

HDLTex: Hierarchical Deep Learning for Text Classification

[...]

Kamran Kowsari¹, Donald E. Brown¹, Mojtaba Heidarysafa¹, Kiana Jafari Meimandi¹, Matthew S. Gerber¹, Laura E. Barnes¹ - Show less +2 more•Institutions (1)

University of Virginia¹

24 Sep 2017

TL;DR: Hierarchical Deep Learning for Text classification employs stacks of deep learning architectures to provide specialized understanding at each level of the document hierarchy.

...read moreread less

Abstract: Increasingly large document collections require improved information processing methods for searching, retrieving, and organizing text. Central to these information processing methods is document classification, which has become an important application for supervised learning. Recently the performance of traditional supervised classifiers has degraded as the number of documents has increased. This is because along with growth in the number of documents has come an increase in the number of categories. This paper approaches this problem differently from current document classification methods that view the problem as multi-class classification. Instead we perform hierarchical classification using an approach we call Hierarchical Deep Learning for Text classification (HDLTex). HDLTex employs stacks of deep learning architectures to provide specialized understanding at each level of the document hierarchy.

...read moreread less

Journal Article•10.1109/TIP.2019.2891888•

Deep Representation Learning with Part Loss for Person Re-Identification

[...]

Hantao Yao, Shiliang Zhang, Yongdong Zhang, Jintao Li, Qi Tian - Show less +1 more

04 Jul 2017-arXiv: Computer Vision and Pattern Recognition

TL;DR: Wang et al. as mentioned in this paper proposed part loss, which automatically generates several parts for an image, and computes the person classification loss on each part separately, which enforces the deep network to focus on the entire human body and learn discriminative representations for different parts.

...read moreread less

Abstract: Learning discriminative representations for unseen person images is critical for person Re-Identification (ReID). Most of current approaches learn deep representations in classification tasks, which essentially minimize the empirical classification risk on the training set. As shown in our experiments, such representations commonly focus on several body parts discriminative to the training set, rather than the entire human body. Inspired by the structural risk minimization principle in SVM, we revise the traditional deep representation learning procedure to minimize both the empirical classification risk and the representation learning risk. The representation learning risk is evaluated by the proposed part loss, which automatically generates several parts for an image, and computes the person classification loss on each part separately. Compared with traditional global classification loss, simultaneously considering multiple part loss enforces the deep network to focus on the entire human body and learn discriminative representations for different parts. Experimental results on three datasets, i.e., Market1501, CUHK03, VIPeR, show that our representation outperforms the existing deep representations.

...read moreread less

Journal Article•10.3390/S17112556•

Deep Recurrent Neural Networks for Human Activity Recognition.

[...]

Abdulmajid Murad¹, Jae-Young Pyun¹•Institutions (1)

Chosun University¹

06 Nov 2017-Sensors

TL;DR: Experimental results show that the proposed deep recurrent neural networks (DRNNs) used for building recognition models that are capable of capturing long-range dependencies in variable-length input sequences yield better performance than other deep learning techniques, such as deep believe networks (DBNs) and CNNs.

...read moreread less

Abstract: Adopting deep learning methods for human activity recognition has been effective in extracting discriminative features from raw input sequences acquired from body-worn sensors. Although human movements are encoded in a sequence of successive samples in time, typical machine learning methods perform recognition tasks without exploiting the temporal correlations between input data samples. Convolutional neural networks (CNNs) address this issue by using convolutions across a one-dimensional temporal sequence to capture dependencies among input data. However, the size of convolutional kernels restricts the captured range of dependencies between data samples. As a result, typical models are unadaptable to a wide range of activity-recognition configurations and require fixed-length input windows. In this paper, we propose the use of deep recurrent neural networks (DRNNs) for building recognition models that are capable of capturing long-range dependencies in variable-length input sequences. We present unidirectional, bidirectional, and cascaded architectures based on long short-term memory (LSTM) DRNNs and evaluate their effectiveness on miscellaneous benchmark datasets. Experimental results show that our proposed models outperform methods employing conventional machine learning, such as support vector machine (SVM) and k-nearest neighbors (KNN). Additionally, the proposed models yield better performance than other deep learning techniques, such as deep believe networks (DBNs) and CNNs.

...read moreread less

Journal Article•10.1109/TPAMI.2016.2601099•

Object Detection Networks on Convolutional Feature Maps

[...]

Shaoqing Ren¹, Kaiming He², Ross Girshick³, Xiangyu Zhang⁴, Jian Sun² - Show less +1 more•Institutions (4)

University of Science and Technology of China¹, Microsoft², Facebook³, Xi'an Jiaotong University⁴

01 Jul 2017-IEEE Transactions on Pattern Analysis and Machine Intelligence

TL;DR: In this article, a network on convolutional feature maps (NoC) is proposed for object detection, which uses shared, region-independent CNN features to improve the performance of object detection.

...read moreread less

Abstract: Most object detectors contain two important components: a feature extractor and an object classifier. The feature extractor has rapidly evolved with significant research efforts leading to better deep convolutional architectures. The object classifier, however, has not received much attention and many recent systems (like SPPnet and Fast/Faster R-CNN) use simple multi-layer perceptrons. This paper demonstrates that carefully designing deep networks for object classification is just as important. We experiment with region-wise classifier networks that use shared, region-independent convolutional features. We call them “Networks on Convolutional feature maps” (NoCs). We discover that aside from deep feature maps, a deep and convolutional per-region classifier is of particular importance for object detection, whereas latest superior image classification models (such as ResNets and GoogLeNets) do not directly lead to good detection accuracy without using such a per-region classifier. We show by experiments that despite the effective ResNets and Faster R-CNN systems, the design of NoCs is an essential element for the 1st-place winning entries in ImageNet and MS COCO challenges 2015.

...read moreread less

Journal Article•10.1016/J.JKSUCI.2015.12.004•

Intrusion detection model using fusion of chi-square feature selection and multi class SVM

[...]

Ikram Sumaiya Thaseen¹, Cherukuri Aswani Kumar¹•Institutions (1)

VIT University¹

01 Oct 2017-Journal of King Saud University - Computer and Information Sciences

TL;DR: The main idea behind this model is to construct a multi class SVM which has not been adopted for IDS so far to decrease the training and testing time and increase the individual classification accuracy of the network attacks.

...read moreread less

Journal Article•10.1016/J.COMPAG.2017.03.016•

Vision-based pest detection based on SVM classification method

[...]

M.A. Ebrahimi¹, Mohammad Hadi Khoshtaghaza¹, Saeid Minaei¹, Bahareh Jamshidi•Institutions (1)

Tarbiat Modares University¹

01 May 2017-Computers and Electronics in Agriculture

TL;DR: Results show that using SVM method with region index and intensify as color index make the best classification with mean percent error of less than 2.25%.

...read moreread less

Journal Article•

Persistence images: a stable vector representation of persistent homology

[...]

Henry Adams¹, Tegan Emerson¹, Michael Kirby¹, Rachel Neville¹, Chris Peterson¹, Patrick D. Shipman¹, Sofya Chepushtanova², Eric Hanson³, Francis C. Motta⁴, Lori Ziegelmeier⁵ - Show less +6 more•Institutions (5)

Colorado State University¹, Wilkes University², Texas Christian University³, Duke University⁴, Macalester College⁵

01 Jan 2017-Journal of Machine Learning Research

TL;DR: In this article, a persistence diagram (PD) is converted to a finite-dimensional vector representation which is called a persistence image (PI) and proved the stability of this transformation with respect to small perturbations in the inputs.

...read moreread less

Abstract: Many data sets can be viewed as a noisy sampling of an underlying space, and tools from topological data analysis can characterize this structure for the purpose of knowledge discovery. One such tool is persistent homology, which provides a multiscale description of the homological features within a data set. A useful representation of this homological information is a persistence diagram (PD). Efforts have been made to map PDs into spaces with additional structure valuable to machine learning tasks. We convert a PD to a finite-dimensional vector representation which we call a persistence image (PI), and prove the stability of this transformation with respect to small perturbations in the inputs. The discriminatory power of PIs is compared against existing methods, showing significant performance gains. We explore the use of PIs with vector-based machine learning tools, such as linear sparse support vector machines, which identify features containing discriminating topological information. Finally, high accuracy inference of parameter values from the dynamic output of a discrete dynamical system (the linked twist map) and a partial differential equation (the anisotropic Kuramoto-Sivashinsky equation) provide a novel application of the discriminatory power of PIs.

...read moreread less

Journal Article•10.1016/J.ESWA.2016.09.041•

Multi-level hybrid support vector machine and extreme learning machine based on modified K-means for intrusion detection system

[...]

Wathiq Laftah Al-Yaseen, Zulaiha Ali Othman, Mohd Zakree Ahmad Nazri

01 Jan 2017-Expert Systems With Applications

TL;DR: A multi-level hybrid intrusion detection model that uses support vector machine and extreme learning machine to improve the efficiency of detecting known and unknown attacks and a modified K-means algorithm is proposed to build a high-quality training dataset that contributes significantly to improving the performance of classifiers.

...read moreread less

Abstract: Reduction the 10%KDD training dataset up to 99.8% by using modified K-means.New high quality training datasets are constructed for training SVM and ELM.Multi-level model is proposed to improve the performance of detection accuracy.Improve the detection rate of DoS, U2R and R2L attacks.Overall accuracy of 95.75% is achieved with whole Corrected KDD dataset. Intrusion detection has become essential to network security because of the increasing connectivity between computers. Several intrusion detection systems have been developed to protect networks using different statistical methods and machine learning techniques. This study aims to design a model that deals with real intrusion detection problems in data analysis and classify network data into normal and abnormal behaviors. This study proposes a multi-level hybrid intrusion detection model that uses support vector machine and extreme learning machine to improve the efficiency of detecting known and unknown attacks. A modified K-means algorithm is also proposed to build a high-quality training dataset that contributes significantly to improving the performance of classifiers. The modified K-means is used to build new small training datasets representing the entire original training dataset, significantly reduce the training time of classifiers, and improve the performance of intrusion detection system. The popular KDD Cup 1999 dataset is used to evaluate the proposed model. Compared with other methods based on the same dataset, the proposed model shows high efficiency in attack detection, and its accuracy (95.75%) is the best performance thus far.

...read moreread less

Journal Article•10.1016/J.ASOC.2017.01.015•

Empirical Mode Decomposition based ensemble deep learning for load demand time series forecasting

[...]

Xueheng Qiu¹, Ye Ren¹, Ponnuthurai Nagaratnam Suganthan¹, Gehan A. J. Amaratunga²•Institutions (2)

Nanyang Technological University¹, University of Cambridge²

1 May 2017

TL;DR: An ensemble deep learning method has been proposed for load demand forecasting that composes of Empirical Mode Decomposition and Deep Belief Network and results demonstrated attractiveness of the proposed method compared with nine forecasting methods.

...read moreread less

Abstract: Graphical abstractDisplay Omitted HighlightsAn ensemble deep learning method has been proposed for load demand forecasting.The hybrid method composes of Empirical Mode Decomposition and Deep Belief Network.Empirical Mode Decomposition based methods outperform the single structure models.Deep learning shows more advantages when the forecasting horizon increases. Load demand forecasting is a critical process in the planning of electric utilities. An ensemble method composed of Empirical Mode Decomposition (EMD) algorithm and deep learning approach is presented in this work. For this purpose, the load demand series were first decomposed into several intrinsic mode functions (IMFs). Then a Deep Belief Network (DBN) including two restricted Boltzmann machines (RBMs) was used to model each of the extracted IMFs, so that the tendencies of these IMFs can be accurately predicted. Finally, the prediction results of all IMFs can be combined by either unbiased or weighted summation to obtain an aggregated output for load demand. The electricity load demand data sets from Australian Energy Market Operator (AEMO) are used to test the effectiveness of the proposed EMD-based DBN approach. Simulation results demonstrated attractiveness of the proposed method compared with nine forecasting methods.

...read moreread less

Journal Article•10.1016/J.EJOR.2017.12.001•

A support vector machine-based ensemble algorithm for breast cancer diagnosis

[...]

Haifeng Wang¹, Bichen Zheng¹, Sang Won Yoon¹, Hoo Sang Ko²•Institutions (2)

Binghamton University¹, Southern Illinois University Edwardsville²

01 Dec 2017-European Journal of Operational Research

TL;DR: The proposed WAUCE model achieves a higher accuracy with a significantly lower variance for breast cancer diagnosis compared to five other ensemble mechanisms and two common ensemble models, i.e., adaptive boosting and bagging classification tree.

...read moreread less

Journal Article•10.1016/J.ESWA.2017.04.003•

An up-to-date comparison of state-of-the-art classification algorithms

[...]

Chongsheng Zhang¹, Changchang Liu¹, Xiangliang Zhang², George Almpanidis¹•Institutions (2)

Henan University¹, King Abdullah University of Science and Technology²

01 Oct 2017-Expert Systems With Applications

TL;DR: It is found that Stochastic Gradient Boosting Trees (GBDT) matches or exceeds the prediction performance of Support Vector Machines and Random Forests, while being the fastest algorithm in terms of prediction efficiency.

...read moreread less

Abstract: Up-to-date report on the accuracy and efficiency of state-of-the-art classifiers.We compare the accuracy of 11 classification algorithms pairwise and groupwise.We examine separately the training, parameter-tuning, and testing time.GBDT and Random Forests yield highest accuracy, outperforming SVM.GBDT is the fastest in testing, Naive Bayes the fastest in training. Current benchmark reports of classification algorithms generally concern common classifiers and their variants but do not include many algorithms that have been introduced in recent years. Moreover, important properties such as the dependency on number of classes and features and CPU running time are typically not examined. In this paper, we carry out a comparative empirical study on both established classifiers and more recently proposed ones on 71 data sets originating from different domains, publicly available at UCI and KEEL repositories. The list of 11 algorithms studied includes Extreme Learning Machine (ELM), Sparse Representation based Classification (SRC), and Deep Learning (DL), which have not been thoroughly investigated in existing comparative studies. It is found that Stochastic Gradient Boosting Trees (GBDT) matches or exceeds the prediction performance of Support Vector Machines (SVM) and Random Forests (RF), while being the fastest algorithm in terms of prediction efficiency. ELM also yields good accuracy results, ranking in the top-5, alongside GBDT, RF, SVM, and C4.5 but this performance varies widely across all data sets. Unsurprisingly, top accuracy performers have average or slow training time efficiency. DL is the worst performer in terms of accuracy but second fastest in prediction efficiency. SRC shows good accuracy performance but it is the slowest classifier in both training and testing.

...read moreread less

Journal Article•10.1016/J.CHB.2017.01.047•

Evaluating the effectiveness of educational data mining techniques for early prediction of students' academic failure in introductory programming courses

[...]

Evandro Costa¹, Baldoino Fonseca¹, Marcelo A. Santana¹, Fabrsia Ferreira de Arajo², Joilson B. A. Rego³ - Show less +1 more•Institutions (3)

Federal University of Alagoas¹, Federal University of Campina Grande², Federal University of Rio Grande do Norte³

01 Aug 2017-Computers in Human Behavior

TL;DR: The results showed that the techniques analyzed are able to early identify students likely to fail, the effectiveness of some of these techniques is improved after applying the data preprocessing and/or algorithms fine-tuning, and the support vector machine technique outperforms the other ones in a statistically significant way.

...read moreread less

Book•

Kernel Mean Embedding of Distributions: A Review and Beyond

[...]

Krikamol Muandet¹, Kenji Fukumizu, Bharath K. Sriperumbudur², Bernhard Schölkopf¹•Institutions (2)

Max Planck Society¹, Pennsylvania State University²

28 Jun 2017

TL;DR: The kernel mean embedding (KME) as discussed by the authors is a generalization of the original feature map of support vector machines (SVMs) and other kernel methods, and it can be viewed as a generalisation of the SVM feature map.

...read moreread less

Abstract: A Hilbert space embedding of a distribution—in short, a kernel mean embedding—has recently emerged as a powerful tool for machine learning and statistical inference. The basic idea behind this framework is to map distributions into a reproducing kernel Hilbert space (RKHS) in which the whole arsenal of kernel methods can be extended to probability measures. It can be viewed as a generalization of the original “feature map” common to support vector machines (SVMs) and other kernel methods. In addition to the classical applications of kernel methods, the kernel mean embedding has found novel applications in ﬁelds ranging from probabilistic modeling to statistical inference, causal discovery, and deep learning. Kernel Mean Embedding of Distributions: A Review and Beyond provides a comprehensive review of existing work and recent advances in this research area, and to discuss some of the most challenging issues and open problems that could potentially lead to new research directions. The targeted audience includes graduate students and researchers in machine learning and statistics who are interested in the theory and applications of kernel mean embeddings.

...read moreread less

Journal Article•10.1109/TNNLS.2017.2682102•

A New Neural Dynamic Classification Algorithm

[...]

Mohammad Hossein Rafiei¹, Hojjat Adeli¹•Institutions (1)

Ohio State University¹

25 Jul 2017-IEEE Transactions on Neural Networks

TL;DR: A new supervised classification algorithm, called neural dynamic classification (NDC), is presented with the goal of finding the optimum number of features required for accurate classification using the patented robust neural dynamic optimization model of Adeli and Park.

...read moreread less

Abstract: The keys for the development of an effective classification algorithm are: 1) discovering feature spaces with large margins between clusters and close proximity of the classmates and 2) discovering the smallest number of the features to perform accurate classification. In this paper, a new supervised classification algorithm, called neural dynamic classification (NDC), is presented with the goal of: 1) discovering the most effective feature spaces and 2) finding the optimum number of features required for accurate classification using the patented robust neural dynamic optimization model of Adeli and Park. The new classification algorithm is compared with the probabilistic neural network (PNN), enhanced PNN (EPNN), and support vector machine using two sets of classification problems. The first set consists of five standard benchmark problems. The second set is a large benchmark problem called Mixed National Institute of Standards and Technology database of handwritten digits. In general, NDC yields the most accurate classification results followed by EPNN. A beauty of the new algorithm is the smoothness of convergence curves which is an indication of robustness and good performance of the algorithm. The main aim is to maximize the prediction accuracy.

...read moreread less

...

Expand