Top 980 papers published in the topic of Support vector machine in 2003

Showing papers on "Support vector machine published in 2003"

Journal Article•10.1198/JASA.2003.S269•

Learning With Kernels: Support Vector Machines, Regularization, Optimization, and Beyond

[...]

01 Jun 2003-Journal of the American Statistical Association

TL;DR: Learning with Kernels: Support Vector Machines, Regularization, Optimization, and Beyond as discussed by the authors The Journal of the American Statistical Association: Vol 98, No 462, pp 489-489

...read moreread less

Abstract: (2003) Learning With Kernels: Support Vector Machines, Regularization, Optimization, and Beyond Journal of the American Statistical Association: Vol 98, No 462, pp 489-489

...read moreread less

4,057 citations

Journal Article•10.1016/S0925-2312(03)00372-2•

Financial time series forecasting using support vector machines

[...]

Kyoung-jae Kim¹•Institutions (1)

College of Business Administration¹

01 Sep 2003-Neurocomputing

TL;DR: The experimental results show that SVM provides a promising alternative to stock market prediction and the feasibility of applying SVM in financial forecasting is examined by comparing it with back-propagation neural networks and case-based reasoning.

...read moreread less

1,867 citations

Journal Article•10.1162/089976603321891855•

Asymptotic behaviors of support vector machines with Gaussian kernel

[...]

S. Sathiya Keerthi¹, Chih-Jen Lin²•Institutions (2)

National University of Singapore¹, National Taiwan University²

01 Jul 2003-Neural Computation

TL;DR: The behavior of the SVM classifier when these hyper parameters take very small or very large values is analyzed, which helps in understanding thehyperparameter space that leads to an efficient heuristic method of searching for hyperparameter values with small generalization errors.

...read moreread less

Abstract: Support vector machines (SVMs) with the gaussian (RBF) kernel have been popular for practical use. Model selection in this class of SVMs involves two hyperparameters: the penalty parameter C and the kernel width σ. This letter analyzes the behavior of the SVM classifier when these hyperparameters take very small or very large values. Our results help in understanding the hyperparameter space that leads to an efficient heuristic method of searching for hyperparameter values with small generalization errors. The analysis also indicates that if complete model selection using the gaussian kernel has been conducted, there is no need to consider linear SVM.

...read moreread less

1,742 citations

Journal Article•10.1057/PALGRAVE.JORS.2601545•

Benchmarking state-of-the-art classification algorithms for credit scoring

[...]

Bart Baesens¹, T. Van Gestel¹, Stijn Viaene¹, M Stepanova², Johan A. K. Suykens¹, Jan Vanthienen¹ - Show less +2 more•Institutions (2)

Katholieke Universiteit Leuven¹, UBS²

09 Jun 2003-Journal of the Operational Research Society

TL;DR: It is found that both the LS-SVM and neural network classifiers yield a very good performance, but also simple classifiers such as logistic regression and linear discriminant analysis perform very well for credit scoring.

...read moreread less

Abstract: In this paper, we study the performance of various state-of-the-art classification algorithms applied to eight real-life credit scoring data sets. Some of the data sets originate from major Benelux and UK financial institutions. Different types of classifiers are evaluated and compared. Besides the well-known classification algorithms (eg logistic regression, discriminant analysis, k-nearest neighbour, neural networks and decision trees), this study also investigates the suitability and performance of some recently proposed, advanced kernel-based classification algorithms such as support vector machines and least-squares support vector machines (LS-SVMs). The performance is assessed using the classification accuracy and the area under the receiver operating characteristic curve. Statistically significant performance differences are identified using the appropriate test statistics. It is found that both the LS-SVM and neural network classifiers yield a very good performance, but also simple classifiers such as logistic regression and linear discriminant analysis perform very well for credit scoring.

...read moreread less

1,351 citations

Proceedings Article•10.1145/956750.956759•

Adaptive duplicate detection using learnable string similarity measures

[...]

Mikhail Bilenko¹, Raymond J. Mooney¹•Institutions (1)

University of Texas at Austin¹

24 Aug 2003

TL;DR: This paper proposes to employ learnable text distance functions for each database field, and shows that such measures are capable of adapting to the specific notion of similarity that is appropriate for the field's domain.

...read moreread less

Abstract: The problem of identifying approximately duplicate records in databases is an essential step for data cleaning and data integration processes. Most existing approaches have relied on generic or manually tuned distance metrics for estimating the similarity of potential duplicates. In this paper, we present a framework for improving duplicate detection using trainable measures of textual similarity. We propose to employ learnable text distance functions for each database field, and show that such measures are capable of adapting to the specific notion of similarity that is appropriate for the field's domain. We present two learnable text similarity measures suitable for this task: an extended variant of learnable string edit distance, and a novel vector-space based measure that employs a Support Vector Machine (SVM) for training. Experimental results on a range of datasets show that our framework can improve duplicate detection accuracy over traditional techniques.

...read moreread less

1,103 citations

Proceedings Article•

1-norm Support Vector Machines

[...]

Ji Zhu¹, Saharon Rosset¹, Robert Tibshirani¹, Trevor Hastie¹•Institutions (1)

Stanford University¹

9 Dec 2003

TL;DR: It is argued that the 1-norm SVM may have some advantage over the standard 2- norm SVM, especially when there are redundant noise features, and an efficient algorithm is proposed that computes the whole solution path of the1-normSVM, hence facilitates adaptive selection of the tuning parameter for the 1

...read moreread less

Abstract: The standard 2-norm SVM is known for its good performance in two-class classification. In this paper, we consider the 1-norm SVM. We argue that the 1-norm SVM may have some advantage over the standard 2-norm SVM, especially when there are redundant noise features. We also propose an efficient algorithm that computes the whole solution path of the 1-norm SVM, hence facilitates adaptive selection of the tuning parameter for the 1-norm SVM.

...read moreread less

1,061 citations

Journal Article•10.1109/TNN.2003.820556•

Support vector machine with adaptive parameters in financial time series forecasting

[...]

L.J. Cao¹, Francis E. H. Tay¹•Institutions (1)

National University of Singapore¹

01 Nov 2003-IEEE Transactions on Neural Networks

TL;DR: SVM with adaptive parameters can both achieve higher generalization performance and use fewer support vectors than the standard SVM in financial forecasting.

...read moreread less

Abstract: A novel type of learning machine called support vector machine (SVM) has been receiving increasing interest in areas ranging from its original application in pattern recognition to other applications such as regression estimation due to its remarkable generalization performance. This paper deals with the application of SVM in financial time series forecasting. The feasibility of applying SVM in financial forecasting is first examined by comparing it with the multilayer back-propagation (BP) neural network and the regularized radial basis function (RBF) neural network. The variability in performance of SVM with respect to the free parameters is investigated experimentally. Adaptive parameters are then proposed by incorporating the nonstationarity of financial time series into SVM. Five real futures contracts collated from the Chicago Mercantile Market are used as the data sets. The simulation shows that among the three methods, SVM outperforms the BP neural network in financial forecasting, and there are comparable generalization performance between SVM and the regularized RBF neural network. Furthermore, the free parameters of SVM have a great effect on the generalization performance. SVM with adaptive parameters can both achieve higher generalization performance and use fewer support vectors than the standard SVM in financial forecasting.

...read moreread less

1,044 citations

Journal Article•10.1214/AOS/1079120130•

Statistical behavior and consistency of classification methods based on convex risk minimization

[...]

Tong Zhang

01 Feb 2003-Annals of Statistics

TL;DR: This study sheds light on the good performance of some recently proposed linear classification methods including boosting and support vector machines and shows their limitations and suggests possible improvements.

...read moreread less

Abstract: We study how closely the optimal Bayes error rate can be approximately reached using a classification algorithm that computes a classifier by minimizing a convex upper bound of the classification error function. The measurement of closeness is characterized by the loss function used in the estimation. We show that such a classification scheme can be generally regarded as a (nonmaximum-likelihood) conditional in-class probability estimate, and we use this analysis to compare various convex loss functions that have appeared in the literature. Furthermore, the theoretical insight allows us to design good loss functions with desirable properties. Another aspect of our analysis is to demonstrate the consistency of certain classification methods using convex risk minimization. This study sheds light on the good performance of some recently proposed linear classification methods including boosting and support vector machines. It also shows their limitations and suggests possible improvements.

...read moreread less

960 citations

Journal Article•10.1016/S0925-2312(03)00431-4•

The support vector machine under test

[...]

David Meyer¹, Friedrich Leisch¹, Kurt Hornik²•Institutions (2)

Vienna University of Technology¹, Vienna University of Economics and Business²

01 Sep 2003-Neurocomputing

TL;DR: A popular SVM implementation is compared to 16 classification methods and 9 regression methods accessible through the software R by the means of standard performance measures and bias-variance decompositions which showed mostly good performances both on classification and regression tasks, but other methods proved to be very competitive.

...read moreread less

907 citations

Proceedings Article•10.1109/ICDM.2003.1250918•

Building text classifiers using positive and unlabeled examples

[...]

Bing Liu¹, Yang Dai, Xiaoli Li², Wee Sun Lee², Philip S. Yu³ - Show less +1 more•Institutions (3)

University of Illinois at Chicago¹, National University of Singapore², IBM³

19 Nov 2003

TL;DR: A more principled approach to solving the problem of building text classifiers using positive and unlabeled examples based on a biased formulation of SVM is proposed, and it is shown experimentally that it is more accurate than the existing techniques.

...read moreread less

Abstract: We study the problem of building text classifiers using positive and unlabeled examples. The key feature of this problem is that there is no negative example for learning. Recently, a few techniques for solving this problem were proposed in the literature. These techniques are based on the same idea, which builds a classifier in two steps. Each existing technique uses a different method for each step. We first introduce some new methods for the two steps, and perform a comprehensive evaluation of all possible combinations of methods of the two steps. We then propose a more principled approach to solving the problem based on a biased formulation of SVM, and show experimentally that it is more accurate than the existing techniques.

...read moreread less

865 citations

Proceedings Article•

Learning a Distance Metric from Relative Comparisons

[...]

Matthew Schultz¹, Thorsten Joachims¹•Institutions (1)

Cornell University¹

9 Dec 2003

TL;DR: Taking a Support Vector Machine (SVM) approach, an algorithm is developed that provides a flexible way of describing qualitative training data as a set of constraints that leads to a convex quadratic programming problem that can be solved by adapting standard methods for SVM training.

...read moreread less

Abstract: This paper presents a method for learning a distance metric from relative comparison such as "A is closer to B than A is to C". Taking a Support Vector Machine (SVM) approach, we develop an algorithm that provides a flexible way of describing qualitative training data as a set of constraints. We show that such constraints lead to a convex quadratic programming problem that can be solved by adapting standard methods for SVM training. We empirically evaluate the performance and the modelling flexibility of the algorithm on a collection of text documents.

...read moreread less

Proceedings Article•

Transductive learning via spectral graph partitioning

[...]

Thorsten Joachims¹•Institutions (1)

Cornell University¹

21 Aug 2003

TL;DR: This work proposes an algorithm that robustly achieves good generalization performance and that can be trained efficiently, and shows a connection to transductive Support Vector Machines, and that an effective Co-Training algorithm arises as a special case.

...read moreread less

Abstract: We present a new method for transductive learning, which can be seen as a transductive version of the k nearest-neighbor classifier. Unlike for many other transductive learning methods, the training problem has a meaningful relaxation that can be solved globally optimally using spectral methods. We propose an algorithm that robustly achieves good generalization performance and that can be trained efficiently. A key advantage of the algorithm is that it does not require additional heuristics to avoid unbalanced splits. Furthermore, we show a connection to transductive Support Vector Machines, and that an effective Co-Training algorithm arises as a special case.

...read moreread less

Proceedings Article•10.1109/ICDM.2003.1250950•

Cost-sensitive learning by cost-proportionate example weighting

[...]

Bianca Zadrozny¹, John Langford¹, Naoki Abe¹•Institutions (1)

IBM¹

19 Nov 2003

TL;DR: Costing is proposed, a method based on cost-proportionate rejection sampling and ensemble aggregation, which achieves excellent predictive performance on two publicly available datasets, while drastically reducing the computation required by other methods.

...read moreread less

Abstract: We propose and evaluate a family of methods for converting classifier learning algorithms and classification theory into cost-sensitive algorithms and theory. The proposed conversion is based on cost-proportionate weighting of the training examples, which can be realized either by feeding the weights to the classification algorithm (as often done in boosting), or by careful subsampling. We give some theoretical performance guarantees on the proposed methods, as well as empirical evidence that they are practical alternatives to existing approaches. In particular, we propose costing, a method based on cost-proportionate rejection sampling and ensemble aggregation, which achieves excellent predictive performance on two publicly available datasets, while drastically reducing the computation required by other methods.

...read moreread less

Journal Article•10.1109/TNSRE.2003.814441•

Comparison of linear, nonlinear, and feature selection methods for EEG signal classification

[...]

Deon Garrett¹, David A. Peterson¹, Charles W. Anderson¹, Michael H. Thaut¹•Institutions (1)

Colorado State University¹

28 Jul 2003

TL;DR: The results of a linear (linear discriminant analysis) and two nonlinear classifiers applied to the classification of spontaneous EEG during five mental tasks are reported, showing that non linear classifiers produce only slightly better classification results.

...read moreread less

Abstract: The reliable operation of brain-computer interfaces (BCIs) based on spontaneous electroencephalogram (EEG) signals requires accurate classification of multichannel EEG. The design of EEG representations and classifiers for BCI are open research questions whose difficulty stems from the need to extract complex spatial and temporal patterns from noisy multidimensional time series obtained from EEG measurements. The high-dimensional and noisy nature of EEG may limit the advantage of nonlinear classification methods over linear ones. This paper reports the results of a linear (linear discriminant analysis) and two nonlinear classifiers (neural networks and support vector machines) applied to the classification of spontaneous EEG during five mental tasks, showing that nonlinear classifiers produce only slightly better classification results. An approach to feature selection based on genetic algorithms is also presented with preliminary results of application to EEG during finger movement.

...read moreread less

Journal Article•

Use of the zero norm with linear models and kernel methods

[...]

Jason Weston¹, André Elisseeff¹, Bernhard Schölkopf¹, Michael E. Tipping²•Institutions (2)

Max Planck Society¹, Microsoft²

01 Mar 2003-Journal of Machine Learning Research

TL;DR: In this article, the authors explore the use of the zero-norm of the parameters of linear models in learning and derive a simple but practical method for variable or feature selection, minimizing training error and ensuring sparsity in solutions.

...read moreread less

Abstract: We explore the use of the so-called zero-norm of the parameters of linear models in learning. Minimization of such a quantity has many uses in a machine learning context: for variable or feature selection, minimizing training error and ensuring sparsity in solutions. We derive a simple but practical method for achieving these goals and discuss its relationship to existing techniques of minimizing the zero-norm. The method boils down to implementing a simple modification of vanilla SVM, namely via an iterative multiplicative rescaling of the training data. Applications we investigate which aid our discussion include variable and feature selection on biological microarray data, and multicategory classification.

...read moreread less

Journal Article•

Variable selection using svm based criteria

[...]

Alain Rakotomamonjy¹•Institutions (1)

Institut national des sciences appliquées de Rouen¹

01 Mar 2003-Journal of Machine Learning Research

TL;DR: New methods to evaluate variable subset relevance with a view to variable selection based on weight vector derivative achieves good results and performs consistently well over the datasets used.

...read moreread less

Abstract: We propose new methods to evaluate variable subset relevance with a view to variable selection. Relevance criteria are derived from Support Vector Machines and are based on weight vector ||w||2 or generalization error bounds sensitivity with respect to a variable. Experiments on linear and non-linear toy problems and real-world datasets have been carried out to assess the effectiveness of these criteria. Results show that the criterion based on weight vector derivative achieves good results and performs consistently well over the datasets we used.

...read moreread less

Journal Article•10.1016/S0925-2312(02)00601-X•

Evaluation of simple performance measures for tuning SVM hyperparameters

[...]

Kaibo Duan¹, S. Sathiya Keerthi¹, Aun-Neow Poo¹•Institutions (1)

National University of Singapore¹

01 Apr 2003-Neurocomputing

TL;DR: The empirically study the usefulness of several simple performance measures that are inexpensive to compute (in the sense that they do not require expensive matrix operations involving the kernel matrix) for tuning SVM hyperparameters.

...read moreread less

Journal Article•10.1016/S0925-2312(03)00433-8•

A comparison of PCA, KPCA and ICA for dimensionality reduction in support vector machine

[...]

Cao Lijuan¹, Kok Seng Chua², W. K. Chong², Heow Pueh Lee¹, Q. M. Gu - Show less +1 more•Institutions (2)

Institute of High Performance Computing Singapore¹, National University of Singapore²

01 Sep 2003-Neurocomputing

TL;DR: The experiment shows that SVM by feature extraction using PCA, KPCA or ICA can perform better than that without feature extraction, and among the three methods, there is the best performance in K PCA feature extraction; followed by ICA feature extraction.

...read moreread less

Journal Article•10.1109/TPAMI.2003.1227989•

Adaptive sparseness for supervised learning

[...]

Mário A. T. Figueiredo

01 Sep 2003-IEEE Transactions on Pattern Analysis and Machine Intelligence

TL;DR: A Bayesian approach to supervised learning, which leads to sparse solutions; that is, in which irrelevant parameters are automatically set exactly to zero, and involves no tuning or adjustment of sparseness-controlling hyperparameters.

...read moreread less

Abstract: The goal of supervised learning is to infer a functional mapping based on a set of training examples. To achieve good generalization, it is necessary to control the "complexity" of the learned function. In Bayesian approaches, this is done by adopting a prior for the parameters of the function being learned. We propose a Bayesian approach to supervised learning, which leads to sparse solutions; that is, in which irrelevant parameters are automatically set exactly to zero. Other ways to obtain sparse classifiers (such as Laplacian priors, support vector machines) involve (hyper)parameters which control the degree of sparseness of the resulting classifiers; these parameters have to be somehow adjusted/estimated from the training data. In contrast, our approach does not involve any (hyper)parameters to be adjusted or estimated. This is achieved by a hierarchical-Bayes interpretation of the Laplacian prior, which is then modified by the adoption of a Jeffreys' noninformative hyperprior. Implementation is carried out by an expectation-maximization (EM) algorithm. Experiments with several benchmark data sets show that the proposed approach yields state-of-the-art performance. In particular, our method outperforms SVMs and performs competitively with the best alternative techniques, although it involves no tuning or adjustment of sparseness-controlling hyperparameters.

...read moreread less

Journal Article•10.1016/S0031-3203(03)00085-2•

Handwritten digit recognition: benchmarking of state-of-the-art techniques

[...]

Cheng-Lin Liu¹, Kazuki Nakashima¹, Hiroshi Sako¹, Hiromichi Fujisawa¹•Institutions (1)

Hitachi¹

01 Oct 2003-Pattern Recognition

TL;DR: The results of handwritten digit recognition on well-known image databases using state-of-the-art feature extraction and classification techniques are competitive to the best ones previously reported on the same databases.

...read moreread less

Journal Article•10.1016/S0169-7439(03)00111-4•

Using support vector machines for time series prediction

[...]

Uwe Thissen¹, R van Brakel¹, A.P. de Weijer, Willem J. Melssen¹, Lutgarde M. C. Buydens¹ - Show less +1 more•Institutions (1)

Radboud University Nijmegen¹

28 Nov 2003-Chemometrics and Intelligent Laboratory Systems

TL;DR: Time series prediction is performed by support vector machines, Elman recurrent neural networks, and autoregressive moving average (ARMA) models and it appears that the AR MA model performs best for the ARMA data set while the SVM and the Elman networks perform similarly.

...read moreread less

Journal Article•10.1021/CI0341161•

Comparison of Support Vector Machine and Artificial Neural Network Systems for Drug/Nondrug Classification

[...]

Evgeny Byvatov¹, Uli Fechner¹, Jens Sadowski¹, Gisbert Schneider¹•Institutions (1)

Goethe University Frankfurt¹

27 Sep 2003-Journal of Chemical Information and Computer Sciences

TL;DR: Although SVM outperformed the ANN classifiers with regard to overall prediction accuracy, both methods were shown to complement each other, as the sets of true positives, false positives, true negatives, and false negatives produced by the two classifiers were not identical.

...read moreread less

Abstract: Support vector machine (SVM) and artificial neural network (ANN) systems were applied to a drug/nondrug classification problem as an example of binary decision problems in early-phase virtual compound filtering and screening. The results indicate that solutions obtained by SVM training seem to be more robust with a smaller standard error compared to ANN training. Generally, the SVM classifier yielded slightly higher prediction accuracy than ANN, irrespective of the type of descriptors used for molecule encoding, the size of the training data sets, and the algorithm employed for neural network training. The performance was compared using various different descriptor sets and descriptor combinations based on the 120 standard Ghose-Crippen fragment descriptors, a wide range of 180 different properties and physicochemical descriptors from the Molecular Operating Environment (MOE) package, and 225 topological pharmacophore (CATS) descriptors. For the complete set of 525 descriptors cross-validated classificati...

...read moreread less

Journal Article•10.1093/BIOINFORMATICS/BTG102•

Classification of multiple cancer types by multicategory support vector machines using gene expression data.

[...]

Yoonkyung Lee¹, Cheol Koo Lee²•Institutions (2)

Ohio State University¹, University of Wisconsin-Madison²

12 Jun 2003-Bioinformatics

TL;DR: The Multicategory SVM is introduced, which is a recently proposed extension of the binary SVM, and applied to multiclass cancer diagnosis problems, which renders the MSVM a viable alternative to other classification methods.

...read moreread less

Abstract: Motivation: High-density DNA microarray measures the activities of several thousand genes simultaneously and the gene expression profiles have been used for the cancer classification recently. This new approach promises to give better therapeutic measurements to cancer patients by diagnosing cancer types with improved accuracy. The Support Vector Machine (SVM) is one of the classification methods successfully applied to the cancer diagnosis problems. However, its optimal extension to more than two classes was not obvious, which might impose limitations in its application to multiple tumor types. We briefly introduce the Multicategory SVM, which is a recently proposed extension of the binary SVM, and apply it to multiclass cancer diagnosis problems. Results: Its applicability is demonstrated on the leukemia data (Golub et al., 1999) and the small round blue cell tumors of childhood data (Khan et al. ,2 001). Comparable classification accuracy shown in the applications and its flexibility render the MSVM a viable alternative to other classification methods. Supplementary Information: http://www.stat.ohio-state. edu/∼yklee/msvm.html

...read moreread less

Proceedings Article•

Incorporating diversity in active learning with support vector machines

[...]

Klaus Brinker¹•Institutions (1)

University of Paderborn¹

21 Aug 2003

TL;DR: This work presents a new approach that is especially designed to construct batches and incorporates a diversity measure that has low computational requirements making it feasible for large scale problems with several thousands of examples.

...read moreread less

Abstract: In many real world applications, active selection of training examples can significantly reduce the number of labelled training examples to learn a classification function. Different strategies in the field of support vector machines have been proposed that iteratively select a single new example from a set of unlabelled examples, query the corresponding class label and then perform retraining of the current classifier. However, to reduce computational time for training, it might be necessary to select batches of new training examples instead of single examples. Strategies for single examples can be extended straightforwardly to select batches by choosing the h > 1 examples that get the highest values for the individual selection criterion. We present a new approach that is especially designed to construct batches and incorporates a diversity measure. It has low computational requirements making it feasible for large scale problems with several thousands of examples. Experimental results indicate that this approach provides a faster method to attain a level of generalization accuracy in terms of the number of labelled examples.

...read moreread less

Journal Article•10.1016/S0031-3203(03)00175-4•

Constructing support vector machine ensemble

[...]

Hyun-Chul Kim¹, Shaoning Pang¹, Je Hongmo¹, Daijin Kim¹, Sung Yang Bang¹ - Show less +1 more•Institutions (1)

Pohang University of Science and Technology¹

01 Dec 2003-Pattern Recognition

TL;DR: Simulation results for the IRIS data classification and the hand-written digit recognition, and the fraud detection show that the proposed SVM ensemble with bagging or boosting outperforms a single SVM in terms of classification accuracy greatly.

...read moreread less

Journal Article•10.1016/J.ENGAPPAI.2003.09.006•

Artificial neural networks and support vector machines with genetic algorithm for bearing fault detection

[...]

Biswanath Samanta¹, K.R. Al-Balushi¹, S.A. Al-Araimi¹•Institutions (1)

Sultan Qaboos University¹

01 Oct 2003-Engineering Applications of Artificial Intelligence

TL;DR: A study to compare the performance of bearing fault detection using two different classifiers, namely, artificial neural networks and support vector machines (SMVs), using time-domain vibration signals of a rotating machine with normal and defective bearings.

...read moreread less

Proceedings Article•

Learning to classify texts using positive and unlabeled data

[...]

Xiaoli Li¹, Bing Liu²•Institutions (2)

National University of Singapore¹, University of Illinois at Chicago²

9 Aug 2003

TL;DR: An effective technique to solve the problem of labeled negative document classification is proposed that combines the Rocchio method and the SVM technique for classifier building and outperforms existing methods significantly.

...read moreread less

Abstract: In traditional text classification, a classifier is built using labeled training documents of every class. This paper studies a different problem. Given a set P of documents of a particular class (called positive class) and a set U of unlabeled documents that contains documents from class P and also other types of documents (called negative class documents), we want to build a classifier to classify the documents in U into documents from P and documents not from P. The key feature of this problem is that there is no labeled negative document, which makes traditional text classification techniques inapplicable. In this paper, we propose an effective technique to solve the problem. It combines the Rocchio method and the SVM technique for classifier building. Experimental results show that the new method outperforms existing methods significantly.

...read moreread less

Journal Article•10.1109/TNN.2002.806626•

Content-based audio classification and retrieval by support vector machines

[...]

Guodong Guo¹, Stan Z. Li²•Institutions (2)

University of Wisconsin-Madison¹, Microsoft²

01 Jan 2003-IEEE Transactions on Neural Networks

TL;DR: The SVMs with a binary tree recognition strategy are used to tackle the audio classification problem and experimental comparisons for audio retrieval are presented to show the superiority of this novel metric, called distance-from-boundary (DFB).

...read moreread less

Abstract: Support vector machines (SVMs) have been recently proposed as a new learning algorithm for pattern recognition. In this paper, the SVMs with a binary tree recognition strategy are used to tackle the audio classification problem. We illustrate the potential of SVMs on a common audio database, which consists of 409 sounds of 16 classes. We compare the SVMs based classification with other popular approaches. For audio retrieval, we propose a new metric, called distance-from-boundary (DFB). When a query audio is given, the system first finds a boundary inside which the query pattern is located. Then, all the audio patterns in the database are sorted by their distances to this boundary. All boundaries are learned by the SVMs and stored together with the audio database. Experimental comparisons for audio retrieval are presented to show the superiority of this novel metric to other similarity measures.

...read moreread less

Journal Article•10.1016/S1071-5819(02)00141-6•

The production and recognition of emotions in speech: features and algorithms

[...]

Oudeyer Pierre-Yves

01 Jul 2003-International Journal of Human-computer Studies \/ International Journal of Man-machine Studies

TL;DR: A technique which allows to continuously control both the age of a synthetic voice and the quantity of emotions that are expressed and the first large-scale data mining experiment about the automatic recognition of basic emotions in informal everyday short utterances is presented.

...read moreread less

Abstract: This paper presents algorithms that allow a robot to express its emotions by modulating the intonation of its voice. They are very simple and efficiently provide life-like speech thanks to the use of concatenative speech synthesis. We describe a technique which allows to continuously control both the age of a synthetic voice and the quantity of emotions that are expressed. Also, we present the first large-scale data mining experiment about the automatic recognition of basic emotions in informal everyday short utterances. We focus on the speaker-dependent problem. We compare a large set of machine learning algorithms, ranging from neural networks, Support Vector Machines or decision trees, together with 200 features, using a large database of several thousands examples. We show that the difference of performance among learning schemes can be substantial, and that some features which were previously unexplored are of crucial importance. An optimal feature set is derived through the use of a genetic algorithm. Finally, we explain how this study can be applied to real world situations in which very few examples are available. Furthermore, we describe a game to play with a personal robot which facilitates teaching of examples of emotional utterances in a natural and rather unconstrained manner.

...read moreread less

Proceedings Article•10.1145/952532.952688•

Supervised term weighting for automated text categorization

[...]

Franca Debole, Fabrizio Sebastiani¹•Institutions (1)

Istituto di Scienza e Tecnologie dell'Informazione¹

9 Mar 2003

TL;DR: It is proposed that learning from training data should also affect phase (ii), i.e. that information on the membership of training documents to categories be used to determine term weights, and is called supervised term weighting (STW).

...read moreread less

Abstract: The construction of a text classifier usually involves (i) a phase of term selection, in which the most relevant terms for the classification task are identified, (ii) a phase of term weighting, in which document weights for the selected terms are computed, and (iii) a phase of classifier learning, in which a classifier is generated from the weighted representations of the training documents. This process involves an activity of supervised learning, in which information on the membership of training documents in categories is used. Traditionally, supervised learning enters only phases (i) and (iii). In this paper we propose instead that learning from training data should also affect phase (ii), i.e. that information on the membership of training documents to categories be used to determine term weights. We call this idea supervised term weighting (STW). As an example, we propose a number of "supervised variants" of t f idf weighting, obtained by replacing the idf function with the function that has been used in phase (i) for term selection. We present experimental results obtained on the standard Reuters-21578 benchmark with one classifier learning method (support vector machines), three term selection functions (information gain, chi-square, and gain ratio), and both local and global term selection and weighting.

...read moreread less

...

Expand