Top 2808 papers published in the topic of Support vector machine in 2007

Showing papers on "Support vector machine published in 2007"

Proceedings Article•10.1145/1273496.1273592•

Self-taught learning: transfer learning from unlabeled data

[...]

Rajat Raina¹, Alexis Battle¹, Honglak Lee¹, Benjamin Packer¹, Andrew Y. Ng¹ - Show less +1 more•Institutions (1)

20 Jun 2007

TL;DR: An approach to self-taught learning that uses sparse coding to construct higher-level features using the unlabeled data to form a succinct input representation and significantly improve classification performance.

...read moreread less

Abstract: We present a new machine learning framework called "self-taught learning" for using unlabeled data in supervised classification tasks. We do not assume that the unlabeled data follows the same class labels or generative distribution as the labeled data. Thus, we would like to use a large number of unlabeled images (or audio samples, or text documents) randomly downloaded from the Internet to improve performance on a given image (or audio, or text) classification task. Such unlabeled data is significantly easier to obtain than in typical semi-supervised or transfer learning settings, making self-taught learning widely applicable to many practical learning problems. We describe an approach to self-taught learning that uses sparse coding to construct higher-level features using the unlabeled data. These features form a succinct input representation and significantly improve classification performance. When using an SVM for classification, we further show how a Fisher kernel can be learned for this representation.

...read moreread less

1,970 citations

Support Vector Regression

[...]

Debasish Basak, Srimanta Pal¹, Dipak Chandra Patranabis•Institutions (1)

Indian Statistical Institute¹

1 Jan 2007

TL;DR: An attempt has been made to review the existing theory, methods, recent developments and scopes of Support Vector Regression.

...read moreread less

Abstract: Instead of minimizing the observed training error, Support Vector Regression (SVR) attempts to minimize the generalization error bound so as to achieve generalized performance. The idea of SVR is based on the computation of a linear regression function in a high dimensional feature space where the input data are mapped via a nonlinear function. SVR has been applied in various fields - time series and financial (noisy and risky) prediction, approximation of complex engineering analyses, convex quadratic programming and choices of loss functions, etc. In this paper, an attempt has been made to review the existing theory, methods, recent developments and scopes of SVR.

...read moreread less

1,899 citations

Journal Article•10.1109/TPAMI.2007.1068•

Twin Support Vector Machines for Pattern Classification

[...]

Jayadeva¹, Reshma Khemchandani, Suresh Chandra¹•Institutions (1)

Indian Institutes of Technology¹

01 May 2007-IEEE Transactions on Pattern Analysis and Machine Intelligence

TL;DR: A binary SVM classifier that determines two nonparallel planes by solving two related SVM-type problems, each of which is smaller than in a conventional SVM, which shows good generalization on several benchmark data sets.

...read moreread less

Abstract: We propose twin SVM, a binary SVM classifier that determines two nonparallel planes by solving two related SVM-type problems, each of which is smaller than in a conventional SVM. The twin SVM formulation is in the spirit of proximal SVMs via generalized eigenvalues. On several benchmark data sets, Twin SVM is not only fast, but shows good generalization. Twin SVM is also useful for automatically discovering two-dimensional projections of the data

...read moreread less

1,784 citations

Proceedings Article•10.1109/ICCV.2007.4409066•

Image Classification using Random Forests and Ferns

[...]

Anna Bosch¹, Andrew Zisserman², X. Muoz¹•Institutions (2)

University of Girona¹, University of Oxford²

26 Dec 2007

TL;DR: It is shown that selecting the ROI adds about 5% to the performance and, together with the other improvements, the result is about a 10% improvement over the state of the art for Caltech-256.

...read moreread less

Abstract: We explore the problem of classifying images by the object categories they contain in the case of a large number of object categories. To this end we combine three ingredients: (i) shape and appearance representations that support spatial pyramid matching over a region of interest. This generalizes the representation of Lazebnik et al., (2006) from an image to a region of interest (ROI), and from appearance (visual words) alone to appearance and local shape (edge distributions); (ii) automatic selection of the regions of interest in training. This provides a method of inhibiting background clutter and adding invariance to the object instance 's position; and (iii) the use of random forests (and random ferns) as a multi-way classifier. The advantage of such classifiers (over multi-way SVM for example) is the ease of training and testing. Results are reported for classification of the Caltech-101 and Caltech-256 data sets. We compare the performance of the random forest/ferns classifier with a benchmark multi-way SVM classifier. It is shown that selecting the ROI adds about 5% to the performance and, together with the other improvements, the result is about a 10% improvement over the state of the art for Caltech-256.

...read moreread less

1,571 citations

Journal Article•10.1016/J.YMSSP.2006.12.007•

Support vector machine in machine condition monitoring and fault diagnosis

[...]

Achmad Widodo¹, Bo-Suk Yang¹•Institutions (1)

Pukyong National University¹

01 Aug 2007-Mechanical Systems and Signal Processing

TL;DR: This paper presents a survey of machine condition monitoring and fault diagnosis using support vector machine (SVM), and attempts to summarize and review the recent research and developments of SVM in machine condition Monitoring and diagnosis.

...read moreread less

1,539 citations

Book Chapter•10.1007/978-3-540-74958-5_38•

Random k-Labelsets: An Ensemble Method for Multilabel Classification

[...]

Grigorios Tsoumakas¹, Ioannis Vlahavas¹•Institutions (1)

Aristotle University of Thessaloniki¹

17 Sep 2007

TL;DR: This paper proposes an ensemble method for multilabel classification that aims to take into account label correlations using single-label classifiers that are applied on subtasks with manageable number of labels and adequate number of examples per label.

...read moreread less

Abstract: This paper proposes an ensemble method for multilabel classification. The RAndom k-labELsets (RAKEL) algorithm constructs each member of the ensemble by considering a small random subset of labels and learning a single-label classifier for the prediction of each element in the powerset of this subset. In this way, the proposed algorithm aims to take into account label correlations using single-label classifiers that are applied on subtasks with manageable number of labels and adequate number of examples per label. Experimental results on common multilabel domains involving protein, document and scene classification show that better performance can be achieved compared to popular multilabel classification approaches.

...read moreread less

1,052 citations

Proceedings Article•10.1145/1273496.1273598•

Pegasos: Primal Estimated sub-GrAdient SOlver for SVM

[...]

Shai Shalev-Shwartz¹, Yoram Singer¹, Nathan Srebro²•Institutions (2)

Hebrew University of Jerusalem¹, Toyota Technological Institute²

20 Jun 2007

TL;DR: A simple and effective iterative algorithm for solving the optimization problem cast by Support Vector Machines that alternates between stochastic gradient descent steps and projection steps that can seamlessly be adapted to employ non-linear kernels while working solely on the primal objective function.

...read moreread less

Abstract: We describe and analyze a simple and effective iterative algorithm for solving the optimization problem cast by Support Vector Machines (SVM). Our method alternates between stochastic gradient descent steps and projection steps. We prove that the number of iterations required to obtain a solution of accuracy e is O(1/e). In contrast, previous analyses of stochastic gradient descent methods require Ω (1/e2) iterations. As in previously devised SVM solvers, the number of iterations also scales linearly with 1/λ, where λ is the regularization parameter of SVM. For a linear kernel, the total run-time of our method is O (d/(λe)), where d is a bound on the number of non-zero features in each example. Since the run-time does not depend directly on the size of the training set, the resulting algorithm is especially suited for learning from large datasets. Our approach can seamlessly be adapted to employ non-linear kernels while working solely on the primal objective function. We demonstrate the efficiency and applicability of our approach by conducting experiments on large text classification problems, comparing our solver to existing state-of-the-art SVM solvers. For example, it takes less than 5 seconds for our solver to converge when solving a text classification problem from Reuters Corpus Volume 1 (RCV1) with 800,000 training examples.

...read moreread less

1,039 citations

Journal Article•10.1198/004017007000000245•

Large-Scale Bayesian Logistic Regression for Text Categorization

[...]

Alexander Genkin, David D. Lewis, David Madigan

01 Aug 2007-Technometrics

TL;DR: In this article, a simple Bayesian logistic regression approach that uses a Laplace prior to avoid overfitting and produces sparse predictive models for text data is presented. But this approach is not suitable for document classification problems.

...read moreread less

Abstract: Logistic regression analysis of high-dimensional data, such as natural language text, poses computational and statistical challenges. Maximum likelihood estimation often fails in these applications. We present a simple Bayesian logistic regression approach that uses a Laplace prior to avoid overfitting and produces sparse predictive models for text data. We apply this approach to a range of document classification problems and show that it produces compact predictive models at least as effective as those produced by support vector machine classifiers or ridge logistic regression combined with feature selection. We describe our model fitting algorithm, our open source implementations (BBR and BMR), and experimental results.

...read moreread less

996 citations

TMVA - Toolkit for Multivariate Data Analysis

[...]

Andreas Hocker¹, P. Speckmayer¹, J. Stelzer¹, F. Tegenfeldt, H. Voss, K. Voss, A. Christov, Sophie Henrot-Versille, M. Jachowski, Attila Krasznahorkay¹, Y. Mahalalel, Rustem Ospanov, X. Prudent, Marcin Wladyslaw Wolter, Andrzej Zemla - Show less +11 more•Institutions (1)

CERN¹

28 Jun 2007

TL;DR: TMVA as mentioned in this paper is a toolkit that hosts a large variety of multivariate classification algorithms, ranging from rectangular cut optimization using a genetic algorithm and from one-dimensional likelihood estimators, over linear and nonlinear discriminants and neural networks, to sophisticated more recent classifiers such as a support vector machine, boosted decision trees and rule ensemble fitting.

...read moreread less

Abstract: n high-energy physics, with the search for ever smaller signals in ever larger data sets, it has become essential to extract a maximum of the available information from the data. Multivariate classification methods based on machine learning techniques have become a fundamental ingredient to most analyses. Also the multivariate classifiers themselves have significantly evolved in recent years. Statisticians have found new ways to tune and to combine classifiers to further gain in performance. Integrated into the analysis framework ROOT, TMVA is a toolkit which hosts a large variety of multivariate classification algorithms. They range from rectangular cut optimization using a genetic algorithm and from one- and multidimensional likelihood estimators, over linear and nonlinear discriminants and neural networks, to sophisticated more recent classifiers such as a support vector machine, boosted decision trees and rule ensemble fitting. TMVA manages the simultaneous training, testing, and performance evaluation of all these classifiers with a user-friendly interface, and expedites the application of the trained classifiers to data.

...read moreread less

950 citations

Journal Article•10.1109/TMI.2007.898551•

Retinal Blood Vessel Segmentation Using Line Operators and Support Vector Classification

[...]

Elisa Ricci¹, Renzo Perfetti¹•Institutions (1)

University of Perugia¹

01 Oct 2007-IEEE Transactions on Medical Imaging

TL;DR: In the framework of computer-aided diagnosis of eye diseases, retinal vessel segmentation based on line operators is proposed and two segmentation methods are considered.

...read moreread less

Abstract: In the framework of computer-aided diagnosis of eye diseases, retinal vessel segmentation based on line operators is proposed. A line detector, previously used in mammography, is applied to the green channel of the retinal image. It is based on the evaluation of the average grey level along lines of fixed length passing through the target pixel at different orientations. Two segmentation methods are considered. The first uses the basic line detector whose response is thresholded to obtain unsupervised pixel classification. As a further development, we employ two orthogonal line detectors along with the grey level of the target pixel to construct a feature vector for supervised classification using a support vector machine. The effectiveness of both methods is demonstrated through receiver operating characteristic analysis on two publicly available databases of color fundus images.

...read moreread less

932 citations

Journal Article•10.1162/NECO.2007.19.5.1155•

Training a Support Vector Machine in the Primal

[...]

Olivier Chapelle¹•Institutions (1)

Max Planck Society¹

01 May 2007-Neural Computation

TL;DR: It is pointed out that the primal problem can also be solved efficiently for both linear and nonlinear SVMs and that there is no reason for ignoring this possibility.

...read moreread less

Abstract: Most literature on support vector machines (SVMs) concentrates on the dual optimization problem. In this letter, we point out that the primal problem can also be solved efficiently for both linear and nonlinear SVMs and that there is no reason for ignoring this possibility. On the contrary, from the primal point of view, new families of algorithms for large-scale SVM training can be investigated.

...read moreread less

Proceedings Article•10.1145/1277741.1277790•

A support vector method for optimizing average precision

[...]

Yisong Yue¹, Thomas Finley¹, Filip Radlinski¹, Thorsten Joachims¹•Institutions (1)

Cornell University¹

23 Jul 2007

TL;DR: This work presents a general SVM learning algorithm that efficiently finds a globally optimal solution to a straightforward relaxation of MAP, and shows its method to produce statistically significant improvements in MAP scores.

...read moreread less

Abstract: Machine learning is commonly used to improve ranked retrieval systems. Due to computational difficulties, few learning techniques have been developed to directly optimize for mean average precision (MAP), despite its widespread use in evaluating such systems. Existing approaches optimizing MAP either do not find a globally optimal solution, or are computationally expensive. In contrast, we present a general SVM learning algorithm that efficiently finds a globally optimal solution to a straightforward relaxation of MAP. We evaluate our approach using the TREC 9 and TREC 10 Web Track corpora (WT10g), comparing against SVMs optimized for accuracy and ROCArea. In most cases we show our method to produce statistically significant improvements in MAP scores.

...read moreread less

Journal Article•10.1016/J.ESWA.2006.07.007•

Credit scoring with a data mining approach based on support vector machines

[...]

Cheng-Lung Huang¹, Mu-Chen Chen², Chieh-Jen Wang³•Institutions (3)

National Kaohsiung First University of Science and Technology¹, National Chiao Tung University², Huafan University³

01 Nov 2007-Expert Systems With Applications

TL;DR: Experimental results show that SVM is a promising addition to the existing data mining methods and three strategies to construct the hybrid SVM-based credit scoring models are used.

...read moreread less

Abstract: The credit card industry has been growing rapidly recently, and thus huge numbers of consumers' credit data are collected by the credit department of the bank. The credit scoring manager often evaluates the consumer's credit with intuitive experience. However, with the support of the credit classification model, the manager can accurately evaluate the applicant's credit score. Support Vector Machine (SVM) classification is currently an active research area and successfully solves classification problems in many domains. This study used three strategies to construct the hybrid SVM-based credit scoring models to evaluate the applicant's credit score from the applicant's input features. Two credit datasets in UCI database are selected as the experimental data to demonstrate the accuracy of the SVM classifier. Compared with neural networks, genetic programming, and decision tree classifiers, the SVM classifier achieved an identical classificatory accuracy with relatively few input features. Additionally, combining genetic algorithms with SVM classifier, the proposed hybrid GA-SVM strategy can simultaneously perform feature selection task and model parameters optimization. Experimental results show that SVM is a promising addition to the existing data mining methods.

...read moreread less

Journal Article•10.1109/TITS.2007.895311•

Road-Sign Detection and Recognition Based on Support Vector Machines

[...]

Saturnino Maldonado-Bascón, S. Lafuente-Arroyo, P. Gil-Jimenez, H. Gomez-Moreno, Francisco López-Ferreras - Show less +1 more

01 Jun 2007-IEEE Transactions on Intelligent Transportation Systems

TL;DR: An automatic road-sign detection and recognition system based on support vector machines that is able to detect and recognize circular, rectangular, triangular, and octagonal signs and, hence, covers all existing Spanish traffic-sign shapes.

...read moreread less

Abstract: This paper presents an automatic road-sign detection and recognition system based on support vector machines (SVMs). In automatic traffic-sign maintenance and in a visual driver-assistance system, road-sign detection and recognition are two of the most important functions. Our system is able to detect and recognize circular, rectangular, triangular, and octagonal signs and, hence, covers all existing Spanish traffic-sign shapes. Road signs provide drivers important information and help them to drive more safely and more easily by guiding and warning them and thus regulating their actions. The proposed recognition system is based on the generalization properties of SVMs. The system consists of three stages: 1) segmentation according to the color of the pixel; 2) traffic-sign detection by shape classification using linear SVMs; and 3) content recognition based on Gaussian-kernel SVMs. Because of the used segmentation stage by red, blue, yellow, white, or combinations of these colors, all traffic signs can be detected, and some of them can be detected by several colors. Results show a high success rate and a very low amount of false positives in the final recognition stage. From these results, we can conclude that the proposed algorithm is invariant to translation, rotation, scale, and, in many situations, even to partial occlusions

...read moreread less

Journal Article•10.1109/TGRS.2007.895416•

Semi-Supervised Graph-Based Hyperspectral Image Classification

[...]

G. Camps-Valls, T. Bandos Marsheva, Dengyong Zhou¹•Institutions (1)

Microsoft¹

24 Sep 2007-IEEE Transactions on Geoscience and Remote Sensing

TL;DR: The introduction of the composite-kernel framework drastically improves results, and the new fast formulation ranks almost linearly in the computational cost, rather than cubic as in the original method, thus allowing the use of this method in remote-sensing applications.

...read moreread less

Abstract: This paper presents a semi-supervised graph-based method for the classification of hyperspectral images. The method is designed to handle the special characteristics of hyperspectral images, namely, high-input dimension of pixels, low number of labeled samples, and spatial variability of the spectral signature. To alleviate these problems, the method incorporates three ingredients, respectively. First, being a kernel-based method, it combats the curse of dimensionality efficiently. Second, following a semi-supervised approach, it exploits the wealth of unlabeled samples in the image, and naturally gives relative importance to the labeled ones through a graph-based methodology. Finally, it incorporates contextual information through a full family of composite kernels. Noting that the graph method relies on inverting a huge kernel matrix formed by both labeled and unlabeled samples, we originally introduce the Nystro umlm method in the formulation to speed up the classification process. The presented semi-supervised-graph-based method is compared to state-of-the-art support vector machines in the classification of hyperspectral data. The proposed method produces better classification maps, which capture the intrinsic structure collectively revealed by labeled and unlabeled points. Good and stable accuracy is produced in ill-posed classification problems (high dimensional spaces and low number of labeled samples). In addition, the introduction of the composite-kernel framework drastically improves results, and the new fast formulation ranks almost linearly in the computational cost, rather than cubic as in the original method, thus allowing the use of this method in remote-sensing applications.

...read moreread less

Journal Article•10.1016/J.COMPMEDIMAG.2007.01.003•

A Methodological Approach to the Classification of Dermoscopy Images

[...]

M. Emre Celebi¹, Hassan A. Kingravi¹, Bakhtiyar Uddin¹, Hitoshi Iyatomi², Y. Alp Aslandogan¹, William V. Stoecker, Randy Hays Moss³ - Show less +3 more•Institutions (3)

University of Texas at Arlington¹, Hosei University², Missouri University of Science and Technology³

01 Sep 2007-Computerized Medical Imaging and Graphics

TL;DR: A methodological approach to the classification of pigmented skin lesions in dermoscopy images is presented and the issue of class imbalance is addressed using various sampling strategies and the classifier generalization error is estimated using Monte Carlo cross validation.

...read moreread less

Journal Article•10.1093/BIOSTATISTICS/KXJ035•

Regularized linear discriminant analysis and its application in microarrays

[...]

Yaqian Guo¹, Trevor Hastie¹, Robert Tibshirani¹•Institutions (1)

Stanford University¹

01 Jan 2007-Biostatistics

TL;DR: Through both simulated data and real life data, it is shown that this method performs very well in multivariate classification problems, often outperforms the PAM method and can be as competitive as the support vector machines classifiers.

...read moreread less

Abstract: In this paper, we introduce a modified version of linear discriminant analysis, called the "shrunken centroids regularized discriminant analysis" (SCRDA). This method generalizes the idea of the "nearest shrunken centroids" (NSC) (Tibshirani and others, 2003) into the classical discriminant analysis. The SCRDA method is specially designed for classification problems in high dimension low sample size situations, for example, microarray data. Through both simulated data and real life data, it is shown that this method performs very well in multivariate classification problems, often outperforms the PAM method (using the NSC algorithm) and can be as competitive as the support vector machines classifiers. It is also suitable for feature elimination purpose and can be used as gene selection method. The open source R package for this method (named "rda") is available on CRAN (http://www.r-project.org) for download and testing.

...read moreread less

Journal Article•10.1016/J.NEUROIMAGE.2006.11.005•

Temporal classification of multichannel near-infrared spectroscopy signals of motor imagery for developing a brain-computer interface.

[...]

Ranganatha Sitaram¹, Haihong Zhang¹, Cuntai Guan¹, M. Thulasidas¹, Yoko Hoshi, Akihiro Ishikawa², Koji Shimizu², Niels Birbaumer³ - Show less +4 more•Institutions (3)

Institute for Infocomm Research Singapore¹, Shimadzu Corp.², University of Tübingen³

15 Feb 2007-NeuroImage

TL;DR: Results indicate potential application of NIRS in the development of BCIs and present results of signal analysis indicating that there exist distinct patterns of hemodynamic responses which could be utilized in a pattern classifier towards developing a BCI.

...read moreread less

Proceedings Article•10.1117/12.696774•

Merging Markov and DCT Features for Multi-Class JPEG Steganalysis

[...]

Tomas Pevny¹, Jessica Fridrich¹•Institutions (1)

Binghamton University¹

15 Feb 2007

TL;DR: In this article, a support vector machine (SVM) was used to construct a new multi-class JPEG steganalyzer with markedly improved performance by extending the 23 DCT feature set and applying calibration to the Markov features.

...read moreread less

Abstract: Blind steganalysis based on classifying feature vectors derived from images is becoming increasingly more powerful. For steganalysis of JPEG images, features derived directly in the embedding domain from DCT coefficients appear to achieve the best performance (e.g., the DCT features10 and Markov features21). The goal of this paper is to construct a new multi-class JPEG steganalyzer with markedly improved performance. We do so first by extending the 23 DCT feature set,10 then applying calibration to the Markov features described in21 and reducing their dimension. The resulting feature sets are merged, producing a 274-dimensional feature vector. The new feature set is then used to construct a Support Vector Machine multi-classifier capable of assigning stego images to six popular steganographic algorithms-F5,22 OutGuess,18 Model Based Steganography without ,19 and with20 deblocking, JP Hide&Seek,1 and Steghide.14 Comparing to our previous work on multi-classification,11, 12 the new feature set provides significantly more reliable results.

...read moreread less

Journal Article•10.1016/J.YMSSP.2006.05.004•

Feature selection using Decision Tree and classification through Proximal Support Vector Machine for fault diagnostics of roller bearing

[...]

V. Sugumaran¹, V. Muralidharan¹, K. I. Ramachandran¹•Institutions (1)

Amrita Vishwa Vidyapeetham¹

01 Feb 2007-Mechanical Systems and Signal Processing

TL;DR: This paper illustrates the use of a Decision Tree that identifies the best features from a given set of samples for the purpose of classification using Proximal Support Vector Machine (PSVM), which has the capability to efficiently classify the faults using statistical features.

...read moreread less

Proceedings Article•10.1145/1299015.1299021•

A comparison of machine learning techniques for phishing detection

[...]

Saeed Abu-Nimeh¹, D. Nappa¹, Xinlei Wang¹, Suku Nair¹•Institutions (1)

Southern Methodist University¹

4 Oct 2007

TL;DR: This study compares the predictive accuracy of several machine learning methods including Logistic Regression (LR), Classification and Regression Trees (CART), Bayesian Additive Regression trees (BART), Support Vector Machines (SVM), Random Forests (RF), and Neural Networks (NNet) for predicting phishing emails.

...read moreread less

Abstract: There are many applications available for phishing detection. However, unlike predicting spam, there are only few studies that compare machine learning techniques in predicting phishing. The present study compares the predictive accuracy of several machine learning methods including Logistic Regression (LR), Classification and Regression Trees (CART), Bayesian Additive Regression Trees (BART), Support Vector Machines (SVM), Random Forests (RF), and Neural Networks (NNet) for predicting phishing emails. A data set of 2889 phishing and legitimate emails is used in the comparative study. In addition, 43 features are used to train and test the classifiers.

...read moreread less

Journal Article•10.1016/J.INS.2007.03.025•

A hybrid machine learning approach to network anomaly detection

[...]

Taeshik Shon, Jongsub Moon¹•Institutions (1)

Korea University¹

01 Sep 2007-Information Sciences

TL;DR: A new SVM approach is proposed, named Enhanced SVM, which combines these two methods in order to provide unsupervised learning and low false alarm capability, similar to that of a supervised S VM approach.

...read moreread less

Book•

PAC-BAYESIAN SUPERVISED CLASSIFICATION: The Thermodynamics of Statistical Learning

[...]

Olivier Catoni

30 Jun 2007

TL;DR: An alternative selection scheme based on relative bounds between estimators is described and study, and a two step localization technique which can handle the selection of a parametric model from a family of those is presented.

...read moreread less

Abstract: This monograph deals with adaptive supervised classification, using tools borrowed from statistical mechanics and information theory, stemming from the PACBayesian approach pioneered by David McAllester and applied to a conception of statistical learning theory forged by Vladimir Vapnik. Using convex analysis on the set of posterior probability measures, we show how to get local measures of the complexity of the classification model involving the relative entropy of posterior distributions with respect to Gibbs posterior measures. We then discuss relative bounds, comparing the generalization error of two classification rules, showing how the margin assumption of Mammen and Tsybakov can be replaced with some empirical measure of the covariance structure of the classification model.We show how to associate to any posterior distribution an effective temperature relating it to the Gibbs prior distribution with the same level of expected error rate, and how to estimate this effective temperature from data, resulting in an estimator whose expected error rate converges according to the best possible power of the sample size adaptively under any margin and parametric complexity assumptions. We describe and study an alternative selection scheme based on relative bounds between estimators, and present a two step localization technique which can handle the selection of a parametric model from a family of those. We show how to extend systematically all the results obtained in the inductive setting to transductive learning, and use this to improve Vapnik's generalization bounds, extending them to the case when the sample is made of independent non-identically distributed pairs of patterns and labels. Finally we review briefly the construction of Support Vector Machines and show how to derive generalization bounds for them, measuring the complexity either through the number of support vectors or through the value of the transductive or inductive margin.

...read moreread less

Journal Article•10.1007/S00778-006-0002-5•

A new intrusion detection system using support vector machines and hierarchical clustering

[...]

Latifur Khan¹, Mamoun Awad¹, Bhavani Thuraisingham¹•Institutions (1)

University of Texas at Dallas¹

1 Oct 2007

TL;DR: This paper presents a new approach of combination of SVM and DGSOT, which starts with an initial training set and expands it gradually using the clustering structure produced by the D GSOT algorithm, which has proved to overcome the drawbacks of traditional hierarchical clustering algorithms.

...read moreread less

Abstract: Whenever an intrusion occurs, the security and value of a computer system is compromised. Network-based attacks make it difficult for legitimate users to access various network services by purposely occupying or sabotaging network resources and services. This can be done by sending large amounts of network traffic, exploiting well-known faults in networking services, and by overloading network hosts. Intrusion Detection attempts to detect computer attacks by examining various data records observed in processes on the network and it is split into two groups, anomaly detection systems and misuse detection systems. Anomaly detection is an attempt to search for malicious behavior that deviates from established normal patterns. Misuse detection is used to identify intrusions that match known attack scenarios. Our interest here is in anomaly detection and our proposed method is a scalable solution for detecting network-based anomalies. We use Support Vector Machines (SVM) for classification. The SVM is one of the most successful classification algorithms in the data mining area, but its long training time limits its use. This paper presents a study for enhancing the training time of SVM, specifically when dealing with large data sets, using hierarchical clustering analysis. We use the Dynamically Growing Self-Organizing Tree (DGSOT) algorithm for clustering because it has proved to overcome the drawbacks of traditional hierarchical clustering algorithms (e.g., hierarchical agglomerative clustering). Clustering analysis helps find the boundary points, which are the most qualified data points to train SVM, between two classes. We present a new approach of combination of SVM and DGSOT, which starts with an initial training set and expands it gradually using the clustering structure produced by the DGSOT algorithm. We compare our approach with the Rocchio Bundling technique and random selection in terms of accuracy loss and training time gain using a single benchmark real data set. We show that our proposed variations contribute significantly in improving the training process of SVM with high generalization accuracy and outperform the Rocchio Bundling technique.

...read moreread less

Journal Article•10.4319/LOM.2007.5.204•

Automated taxonomic classification of phytoplankton sampled with imaging-in-flow cytometry

[...]

Heidi M. Sosik¹, Robert J. Olson¹•Institutions (1)

Woods Hole Oceanographic Institution¹

01 Jun 2007-Limnology and Oceanography-methods

TL;DR: This work developed an approach that relies on extraction of image features, which are then presented to a machine learning algorithm for classification, which provides taxonomically resolved estimates of phytoplankton abundance with fine temporal resolution and permits access to scales of variability from tidal to seasonal and longer.

...read moreread less

Abstract: High-resolution photomicrographs of phytoplankton cells and chains can now be acquired with imaging-in-flow systems at rates that make manual identification impractical for many applications. To address the challenge for automated taxonomic identification of images generated by our custom-built submersible Imaging FlowCytobot, we developed an approach that relies on extraction of image features, which are then presented to a machine learning algorithm for classification. Our approach uses a combination of image feature types including size, shape, symmetry, and texture characteristics, plus orientation invariant moments, diffraction pattern sampling, and co-occurrence matrix statistics. Some of these features required preprocessing with image analysis techniques including edge detection after phase congruency calculations, morphological operations, boundary representation and simplification, and rotation. For the machine learning strategy, we developed an approach that combines a feature selection algorithm and use of a support vector machine specified with a rigorous parameter selection and training approach. After training, a 22-category classifier provides 88% overall accuracy for an independent test set, with individual category accuracies ranging from 68% to 99%. We demonstrate application of this classifier to a nearly uninterrupted 2-month time series of images acquired in Woods Hole Harbor, including use of statistical error correction to derive quantitative concentration estimates, which are shown to be unbiased with respect to manual estimates for random subsamples. Our approach, which provides taxonomically resolved estimates of phytoplankton abundance with fine temporal resolution (hours for many species), permits access to scales of variability from tidal to seasonal and longer.

...read moreread less

Proceedings Article•10.1145/1321440.1321461•

Learning on the border: active learning in imbalanced data classification

[...]

Seyda Ertekin¹, Jian Huang¹, Léon Bottou², C. Lee Giles¹•Institutions (2)

Pennsylvania State University¹, Princeton University²

6 Nov 2007

TL;DR: It is demonstrated that active learning is capable of solving the class imbalance problem by providing the learner more balanced classes and an efficient way of selecting informative instances from a smaller pool of samples for active learning which does not necessitate a search through the entire dataset.

...read moreread less

Abstract: This paper is concerned with the class imbalance problem which has been known to hinder the learning performance of classification algorithms. The problem occurs when there are significantly less number of observations of the target concept. Various real-world classification tasks, such as medical diagnosis, text categorization and fraud detection suffer from this phenomenon. The standard machine learning algorithms yield better prediction performance with balanced datasets. In this paper, we demonstrate that active learning is capable of solving the class imbalance problem by providing the learner more balanced classes. We also propose an efficient way of selecting informative instances from a smaller pool of samples for active learning which does not necessitate a search through the entire dataset. The proposed method yields an efficient querying system and allows active learning to be applied to very large datasets. Our experimental results show that with an early stopping criteria, active learning achieves a fast solution with competitive prediction performance in imbalanced data classification.

...read moreread less

Journal Article•10.1198/016214507000000617•

Robust Truncated Hinge Loss Support Vector Machines

[...]

Yichao Wu¹, Yufeng Liu•Institutions (1)

Princeton University¹

01 Sep 2007-Journal of the American Statistical Association

TL;DR: The robust truncated hinge loss SVM (RSVM) is proposed, which is shown to be more robust to outliers and to deliver more accurate classifiers using a smaller set of SVs than the standard SVM.

...read moreread less

Abstract: The support vector machine (SVM) has been widely applied for classification problems in both machine learning and statistics. Despite its popularity, however, SVM has some drawbacks in certain situations. In particular, the SVM classifier can be very sensitive to outliers in the training sample. Moreover, the number of support vectors (SVs) can be very large in many applications. To circumvent these drawbacks, we propose the robust truncated hinge loss SVM (RSVM), which uses a truncated hinge loss. The RSVM is shown to be more robust to outliers and to deliver more accurate classifiers using a smaller set of SVs than the standard SVM. Our theoretical results show that the RSVM is Fisher-consistent, even when there is no dominating class, a scenario that is particularly challenging for multicategory classification. Similar results are obtained for a class of margin-based classifiers.

...read moreread less

Book Chapter•10.1002/9780470116449.CH6•

Applications of Support Vector Machines in Chemistry

[...]

Ovidiu Ivanciuc¹•Institutions (1)

University of Texas Medical Branch¹

14 Feb 2007

TL;DR: Support vector machines represent an extension to nonlinear models of the generalized portrait algorithm developed by Vapnik and Lerner, and are a group of supervised learning methods that can be applied to classification or regression.

...read moreread less

Abstract: Kernel-based techniques (such as support vector machines, Bayes point machines, kernel principal component analysis, and Gaussian processes) represent a major development in machine learning algorithms. Support vector machines (SVM) are a group of supervised learning methods that can be applied to classification or regression. In a short period of time, SVM found numerous applications in chemistry, such as in drug design (discriminating between ligands and nonligands, inhibitors and noninhibitors, etc.), quantitative structure-activity relationships (QSAR, where SVM regression is used to predict various physical, chemical, or biological properties), chemometrics (optimization of chromatographic separation or compound concentration prediction from spectral data as examples), sensors (for qualitative and quantitative prediction from sensor data), chemical engineering (fault detection and modeling of industrial processes), and text mining (automatic recognition of scientific information). Support vector machines represent an extension to nonlinear models of the generalized portrait algorithm developed by Vapnik and Lerner. The SVM algorithm is based on the statistical learning theory and the Vapnik–Chervonenkis

...read moreread less

A data mining approach to predict forest fires using meteorological data

[...]

Paulo Cortez¹, Aníbal de Jesus Raimundo Morais•Institutions (1)

University of Minho¹

1 Dec 2007

TL;DR: This work explores a Data Mining (DM) approach to predict the burned area of forest fires and finds that the best configuration uses a SVM and four meteorological inputs and it is capable of predicting the burned Area of small fires, which are more frequent.

...read moreread less

Abstract: Forest fires are a major environmental issue, creating economical and ecological damage while endangering human lives. Fast detection is a key element for controlling such phenomenon. To achieve this, one alternative is to use automatic tools based on local sensors, such as provided by meteorological stations. In effect, meteorological conditions (e.g. temperature, wind) are known to influence forest fires and several fire indexes, such as the forest Fire Weather Index (FWI), use such data. In this work, we explore a Data Mining (DM) approach to predict the burned area of forest fires. Five different DM techniques, e.g. Support Vector Machines (SVM) and Random Forests, and four distinct feature selection setups (using spatial, temporal, FWI components and weather attributes), were tested on recent real-world data collected from the northeast region of Portugal. The best configuration uses a SVM and four meteorological inputs (i.e. temperature, relative humidity, rain and wind) and it is capable of predicting the burned area of small fires, which are more frequent. Such knowledge is particularly useful for improving firefighting resource management (e.g. prioritizing targets for air tankers and ground crews).

...read moreread less

Support Vector Machine Solvers

[...]

Léon Bottou, Olivier Chapelle¹, Dennis DeCoste, Jason Weston•Institutions (1)

National Taiwan University¹

1 Jan 2007

TL;DR: This chapter contains sections titled: Introduction, Support Vector Machines, Duality, Sparsity, Early SVM Algorithms, The Decomposition Method, A Case Study: LIBSVM, Conclusion and Outlook.

...read moreread less

Abstract: This chapter contains sections titled: Introduction, Support Vector Machines, Duality, Sparsity, Early SVM Algorithms, The Decomposition Method, A Case Study: LIBSVM, Conclusion and Outlook, Appendix

...read moreread less

...

Expand