Scispace (Formerly Typeset)
  1. Home
  2. Topics
  3. Support vector machine
  4. 2003
  1. Home
  2. Topics
  3. Support vector machine
  4. 2003
Showing papers on "Support vector machine published in 2003"
Journal Article•10.1198/JASA.2003.S269•
Learning With Kernels: Support Vector Machines, Regularization, Optimization, and Beyond

[...]

Christopher Williams1•
University of Edinburgh1
01 Jun 2003-Journal of the American Statistical Association
TL;DR: Learning with Kernels: Support Vector Machines, Regularization, Optimization, and Beyond as discussed by the authors The Journal of the American Statistical Association: Vol 98, No 462, pp 489-489
Abstract: (2003) Learning With Kernels: Support Vector Machines, Regularization, Optimization, and Beyond Journal of the American Statistical Association: Vol 98, No 462, pp 489-489

4,057 citations

Journal Article•10.1016/S0925-2312(03)00372-2•
Financial time series forecasting using support vector machines

[...]

Kyoung-jae Kim1•
College of Business Administration1
01 Sep 2003-Neurocomputing
TL;DR: The experimental results show that SVM provides a promising alternative to stock market prediction and the feasibility of applying SVM in financial forecasting is examined by comparing it with back-propagation neural networks and case-based reasoning.

1,867 citations

Journal Article•10.1162/089976603321891855•
Asymptotic behaviors of support vector machines with Gaussian kernel

[...]

S. Sathiya Keerthi1, Chih-Jen Lin2•
National University of Singapore1, National Taiwan University2
01 Jul 2003-Neural Computation
TL;DR: The behavior of the SVM classifier when these hyper parameters take very small or very large values is analyzed, which helps in understanding thehyperparameter space that leads to an efficient heuristic method of searching for hyperparameter values with small generalization errors.
Abstract: Support vector machines (SVMs) with the gaussian (RBF) kernel have been popular for practical use. Model selection in this class of SVMs involves two hyperparameters: the penalty parameter C and the kernel width σ. This letter analyzes the behavior of the SVM classifier when these hyperparameters take very small or very large values. Our results help in understanding the hyperparameter space that leads to an efficient heuristic method of searching for hyperparameter values with small generalization errors. The analysis also indicates that if complete model selection using the gaussian kernel has been conducted, there is no need to consider linear SVM.

1,742 citations

Journal Article•10.1057/PALGRAVE.JORS.2601545•
Benchmarking state-of-the-art classification algorithms for credit scoring

[...]

Bart Baesens1, T. Van Gestel1, Stijn Viaene1, M Stepanova2, Johan A. K. Suykens1, Jan Vanthienen1 •
Katholieke Universiteit Leuven1, UBS2
09 Jun 2003-Journal of the Operational Research Society
TL;DR: It is found that both the LS-SVM and neural network classifiers yield a very good performance, but also simple classifiers such as logistic regression and linear discriminant analysis perform very well for credit scoring.
Abstract: In this paper, we study the performance of various state-of-the-art classification algorithms applied to eight real-life credit scoring data sets. Some of the data sets originate from major Benelux and UK financial institutions. Different types of classifiers are evaluated and compared. Besides the well-known classification algorithms (eg logistic regression, discriminant analysis, k-nearest neighbour, neural networks and decision trees), this study also investigates the suitability and performance of some recently proposed, advanced kernel-based classification algorithms such as support vector machines and least-squares support vector machines (LS-SVMs). The performance is assessed using the classification accuracy and the area under the receiver operating characteristic curve. Statistically significant performance differences are identified using the appropriate test statistics. It is found that both the LS-SVM and neural network classifiers yield a very good performance, but also simple classifiers such as logistic regression and linear discriminant analysis perform very well for credit scoring.

1,351 citations

Proceedings Article•10.1145/956750.956759•
Adaptive duplicate detection using learnable string similarity measures

[...]

Mikhail Bilenko1, Raymond J. Mooney1•
University of Texas at Austin1
24 Aug 2003
TL;DR: This paper proposes to employ learnable text distance functions for each database field, and shows that such measures are capable of adapting to the specific notion of similarity that is appropriate for the field's domain.
Abstract: The problem of identifying approximately duplicate records in databases is an essential step for data cleaning and data integration processes. Most existing approaches have relied on generic or manually tuned distance metrics for estimating the similarity of potential duplicates. In this paper, we present a framework for improving duplicate detection using trainable measures of textual similarity. We propose to employ learnable text distance functions for each database field, and show that such measures are capable of adapting to the specific notion of similarity that is appropriate for the field's domain. We present two learnable text similarity measures suitable for this task: an extended variant of learnable string edit distance, and a novel vector-space based measure that employs a Support Vector Machine (SVM) for training. Experimental results on a range of datasets show that our framework can improve duplicate detection accuracy over traditional techniques.

1,103 citations

Proceedings Article•
1-norm Support Vector Machines

[...]

Ji Zhu1, Saharon Rosset1, Robert Tibshirani1, Trevor Hastie1•
Stanford University1
9 Dec 2003
TL;DR: It is argued that the 1-norm SVM may have some advantage over the standard 2- norm SVM, especially when there are redundant noise features, and an efficient algorithm is proposed that computes the whole solution path of the1-normSVM, hence facilitates adaptive selection of the tuning parameter for the 1
Abstract: The standard 2-norm SVM is known for its good performance in two-class classification. In this paper, we consider the 1-norm SVM. We argue that the 1-norm SVM may have some advantage over the standard 2-norm SVM, especially when there are redundant noise features. We also propose an efficient algorithm that computes the whole solution path of the 1-norm SVM, hence facilitates adaptive selection of the tuning parameter for the 1-norm SVM.

1,061 citations

Journal Article•10.1109/TNN.2003.820556•
Support vector machine with adaptive parameters in financial time series forecasting

[...]

L.J. Cao1, Francis E. H. Tay1•
National University of Singapore1
01 Nov 2003-IEEE Transactions on Neural Networks
TL;DR: SVM with adaptive parameters can both achieve higher generalization performance and use fewer support vectors than the standard SVM in financial forecasting.
Abstract: A novel type of learning machine called support vector machine (SVM) has been receiving increasing interest in areas ranging from its original application in pattern recognition to other applications such as regression estimation due to its remarkable generalization performance. This paper deals with the application of SVM in financial time series forecasting. The feasibility of applying SVM in financial forecasting is first examined by comparing it with the multilayer back-propagation (BP) neural network and the regularized radial basis function (RBF) neural network. The variability in performance of SVM with respect to the free parameters is investigated experimentally. Adaptive parameters are then proposed by incorporating the nonstationarity of financial time series into SVM. Five real futures contracts collated from the Chicago Mercantile Market are used as the data sets. The simulation shows that among the three methods, SVM outperforms the BP neural network in financial forecasting, and there are comparable generalization performance between SVM and the regularized RBF neural network. Furthermore, the free parameters of SVM have a great effect on the generalization performance. SVM with adaptive parameters can both achieve higher generalization performance and use fewer support vectors than the standard SVM in financial forecasting.

1,044 citations

Journal Article•10.1214/AOS/1079120130•
Statistical behavior and consistency of classification methods based on convex risk minimization

[...]

Tong Zhang
01 Feb 2003-Annals of Statistics
TL;DR: This study sheds light on the good performance of some recently proposed linear classification methods including boosting and support vector machines and shows their limitations and suggests possible improvements.
Abstract: We study how closely the optimal Bayes error rate can be approximately reached using a classification algorithm that computes a classifier by minimizing a convex upper bound of the classification error function. The measurement of closeness is characterized by the loss function used in the estimation. We show that such a classification scheme can be generally regarded as a (nonmaximum-likelihood) conditional in-class probability estimate, and we use this analysis to compare various convex loss functions that have appeared in the literature. Furthermore, the theoretical insight allows us to design good loss functions with desirable properties. Another aspect of our analysis is to demonstrate the consistency of certain classification methods using convex risk minimization. This study sheds light on the good performance of some recently proposed linear classification methods including boosting and support vector machines. It also shows their limitations and suggests possible improvements.

960 citations

Journal Article•10.1016/S0925-2312(03)00431-4•
The support vector machine under test

[...]

David Meyer1, Friedrich Leisch1, Kurt Hornik2•
Vienna University of Technology1, Vienna University of Economics and Business2
01 Sep 2003-Neurocomputing
TL;DR: A popular SVM implementation is compared to 16 classification methods and 9 regression methods accessible through the software R by the means of standard performance measures and bias-variance decompositions which showed mostly good performances both on classification and regression tasks, but other methods proved to be very competitive.

907 citations

Proceedings Article•10.1109/ICDM.2003.1250918•
Building text classifiers using positive and unlabeled examples

[...]

Bing Liu1, Yang Dai, Xiaoli Li2, Wee Sun Lee2, Philip S. Yu3 •
University of Illinois at Chicago1, National University of Singapore2, IBM3
19 Nov 2003
TL;DR: A more principled approach to solving the problem of building text classifiers using positive and unlabeled examples based on a biased formulation of SVM is proposed, and it is shown experimentally that it is more accurate than the existing techniques.
Abstract: We study the problem of building text classifiers using positive and unlabeled examples. The key feature of this problem is that there is no negative example for learning. Recently, a few techniques for solving this problem were proposed in the literature. These techniques are based on the same idea, which builds a classifier in two steps. Each existing technique uses a different method for each step. We first introduce some new methods for the two steps, and perform a comprehensive evaluation of all possible combinations of methods of the two steps. We then propose a more principled approach to solving the problem based on a biased formulation of SVM, and show experimentally that it is more accurate than the existing techniques.

865 citations

Proceedings Article•
Learning a Distance Metric from Relative Comparisons

[...]

Matthew Schultz1, Thorsten Joachims1•
Cornell University1
9 Dec 2003
TL;DR: Taking a Support Vector Machine (SVM) approach, an algorithm is developed that provides a flexible way of describing qualitative training data as a set of constraints that leads to a convex quadratic programming problem that can be solved by adapting standard methods for SVM training.
Abstract: This paper presents a method for learning a distance metric from relative comparison such as "A is closer to B than A is to C". Taking a Support Vector Machine (SVM) approach, we develop an algorithm that provides a flexible way of describing qualitative training data as a set of constraints. We show that such constraints lead to a convex quadratic programming problem that can be solved by adapting standard methods for SVM training. We empirically evaluate the performance and the modelling flexibility of the algorithm on a collection of text documents.
Proceedings Article•
Transductive learning via spectral graph partitioning

[...]

Thorsten Joachims1•
Cornell University1
21 Aug 2003
TL;DR: This work proposes an algorithm that robustly achieves good generalization performance and that can be trained efficiently, and shows a connection to transductive Support Vector Machines, and that an effective Co-Training algorithm arises as a special case.
Abstract: We present a new method for transductive learning, which can be seen as a transductive version of the k nearest-neighbor classifier. Unlike for many other transductive learning methods, the training problem has a meaningful relaxation that can be solved globally optimally using spectral methods. We propose an algorithm that robustly achieves good generalization performance and that can be trained efficiently. A key advantage of the algorithm is that it does not require additional heuristics to avoid unbalanced splits. Furthermore, we show a connection to transductive Support Vector Machines, and that an effective Co-Training algorithm arises as a special case.
Proceedings Article•10.1109/ICDM.2003.1250950•
Cost-sensitive learning by cost-proportionate example weighting

[...]

Bianca Zadrozny1, John Langford1, Naoki Abe1•
IBM1
19 Nov 2003
TL;DR: Costing is proposed, a method based on cost-proportionate rejection sampling and ensemble aggregation, which achieves excellent predictive performance on two publicly available datasets, while drastically reducing the computation required by other methods.
Abstract: We propose and evaluate a family of methods for converting classifier learning algorithms and classification theory into cost-sensitive algorithms and theory. The proposed conversion is based on cost-proportionate weighting of the training examples, which can be realized either by feeding the weights to the classification algorithm (as often done in boosting), or by careful subsampling. We give some theoretical performance guarantees on the proposed methods, as well as empirical evidence that they are practical alternatives to existing approaches. In particular, we propose costing, a method based on cost-proportionate rejection sampling and ensemble aggregation, which achieves excellent predictive performance on two publicly available datasets, while drastically reducing the computation required by other methods.
Journal Article•10.1109/TNSRE.2003.814441•
Comparison of linear, nonlinear, and feature selection methods for EEG signal classification

[...]

Deon Garrett1, David A. Peterson1, Charles W. Anderson1, Michael H. Thaut1•
Colorado State University1
28 Jul 2003
TL;DR: The results of a linear (linear discriminant analysis) and two nonlinear classifiers applied to the classification of spontaneous EEG during five mental tasks are reported, showing that non linear classifiers produce only slightly better classification results.
Abstract: The reliable operation of brain-computer interfaces (BCIs) based on spontaneous electroencephalogram (EEG) signals requires accurate classification of multichannel EEG. The design of EEG representations and classifiers for BCI are open research questions whose difficulty stems from the need to extract complex spatial and temporal patterns from noisy multidimensional time series obtained from EEG measurements. The high-dimensional and noisy nature of EEG may limit the advantage of nonlinear classification methods over linear ones. This paper reports the results of a linear (linear discriminant analysis) and two nonlinear classifiers (neural networks and support vector machines) applied to the classification of spontaneous EEG during five mental tasks, showing that nonlinear classifiers produce only slightly better classification results. An approach to feature selection based on genetic algorithms is also presented with preliminary results of application to EEG during finger movement.
Journal Article•
Use of the zero norm with linear models and kernel methods

[...]

Jason Weston1, André Elisseeff1, Bernhard Schölkopf1, Michael E. Tipping2•
Max Planck Society1, Microsoft2
01 Mar 2003-Journal of Machine Learning Research
TL;DR: In this article, the authors explore the use of the zero-norm of the parameters of linear models in learning and derive a simple but practical method for variable or feature selection, minimizing training error and ensuring sparsity in solutions.
Abstract: We explore the use of the so-called zero-norm of the parameters of linear models in learning. Minimization of such a quantity has many uses in a machine learning context: for variable or feature selection, minimizing training error and ensuring sparsity in solutions. We derive a simple but practical method for achieving these goals and discuss its relationship to existing techniques of minimizing the zero-norm. The method boils down to implementing a simple modification of vanilla SVM, namely via an iterative multiplicative rescaling of the training data. Applications we investigate which aid our discussion include variable and feature selection on biological microarray data, and multicategory classification.
Journal Article•
Variable selection using svm based criteria

[...]

Alain Rakotomamonjy1•
Institut national des sciences appliquées de Rouen1
01 Mar 2003-Journal of Machine Learning Research
TL;DR: New methods to evaluate variable subset relevance with a view to variable selection based on weight vector derivative achieves good results and performs consistently well over the datasets used.
Abstract: We propose new methods to evaluate variable subset relevance with a view to variable selection. Relevance criteria are derived from Support Vector Machines and are based on weight vector ||w||2 or generalization error bounds sensitivity with respect to a variable. Experiments on linear and non-linear toy problems and real-world datasets have been carried out to assess the effectiveness of these criteria. Results show that the criterion based on weight vector derivative achieves good results and performs consistently well over the datasets we used.
Journal Article•10.1016/S0925-2312(02)00601-X•
Evaluation of simple performance measures for tuning SVM hyperparameters

[...]

Kaibo Duan1, S. Sathiya Keerthi1, Aun-Neow Poo1•
National University of Singapore1
01 Apr 2003-Neurocomputing
TL;DR: The empirically study the usefulness of several simple performance measures that are inexpensive to compute (in the sense that they do not require expensive matrix operations involving the kernel matrix) for tuning SVM hyperparameters.
Journal Article•10.1016/S0925-2312(03)00433-8•
A comparison of PCA, KPCA and ICA for dimensionality reduction in support vector machine

[...]

Cao Lijuan1, Kok Seng Chua2, W. K. Chong2, Heow Pueh Lee1, Q. M. Gu •
Institute of High Performance Computing Singapore1, National University of Singapore2
01 Sep 2003-Neurocomputing
TL;DR: The experiment shows that SVM by feature extraction using PCA, KPCA or ICA can perform better than that without feature extraction, and among the three methods, there is the best performance in K PCA feature extraction; followed by ICA feature extraction.
Journal Article•10.1109/TPAMI.2003.1227989•
Adaptive sparseness for supervised learning

[...]

Mário A. T. Figueiredo
01 Sep 2003-IEEE Transactions on Pattern Analysis and Machine Intelligence
TL;DR: A Bayesian approach to supervised learning, which leads to sparse solutions; that is, in which irrelevant parameters are automatically set exactly to zero, and involves no tuning or adjustment of sparseness-controlling hyperparameters.
Abstract: The goal of supervised learning is to infer a functional mapping based on a set of training examples. To achieve good generalization, it is necessary to control the "complexity" of the learned function. In Bayesian approaches, this is done by adopting a prior for the parameters of the function being learned. We propose a Bayesian approach to supervised learning, which leads to sparse solutions; that is, in which irrelevant parameters are automatically set exactly to zero. Other ways to obtain sparse classifiers (such as Laplacian priors, support vector machines) involve (hyper)parameters which control the degree of sparseness of the resulting classifiers; these parameters have to be somehow adjusted/estimated from the training data. In contrast, our approach does not involve any (hyper)parameters to be adjusted or estimated. This is achieved by a hierarchical-Bayes interpretation of the Laplacian prior, which is then modified by the adoption of a Jeffreys' noninformative hyperprior. Implementation is carried out by an expectation-maximization (EM) algorithm. Experiments with several benchmark data sets show that the proposed approach yields state-of-the-art performance. In particular, our method outperforms SVMs and performs competitively with the best alternative techniques, although it involves no tuning or adjustment of sparseness-controlling hyperparameters.
Journal Article•10.1016/S0031-3203(03)00085-2•
Handwritten digit recognition: benchmarking of state-of-the-art techniques

[...]

Cheng-Lin Liu1, Kazuki Nakashima1, Hiroshi Sako1, Hiromichi Fujisawa1•
Hitachi1
01 Oct 2003-Pattern Recognition
TL;DR: The results of handwritten digit recognition on well-known image databases using state-of-the-art feature extraction and classification techniques are competitive to the best ones previously reported on the same databases.
Journal Article•10.1016/S0169-7439(03)00111-4•
Using support vector machines for time series prediction

[...]

Uwe Thissen1, R van Brakel1, A.P. de Weijer, Willem J. Melssen1, Lutgarde M. C. Buydens1 •
Radboud University Nijmegen1
28 Nov 2003-Chemometrics and Intelligent Laboratory Systems
TL;DR: Time series prediction is performed by support vector machines, Elman recurrent neural networks, and autoregressive moving average (ARMA) models and it appears that the AR MA model performs best for the ARMA data set while the SVM and the Elman networks perform similarly.
Journal Article•10.1021/CI0341161•
Comparison of Support Vector Machine and Artificial Neural Network Systems for Drug/Nondrug Classification

[...]

Evgeny Byvatov1, Uli Fechner1, Jens Sadowski1, Gisbert Schneider1•
Goethe University Frankfurt1
27 Sep 2003-Journal of Chemical Information and Computer Sciences
TL;DR: Although SVM outperformed the ANN classifiers with regard to overall prediction accuracy, both methods were shown to complement each other, as the sets of true positives, false positives, true negatives, and false negatives produced by the two classifiers were not identical.
Abstract: Support vector machine (SVM) and artificial neural network (ANN) systems were applied to a drug/nondrug classification problem as an example of binary decision problems in early-phase virtual compound filtering and screening. The results indicate that solutions obtained by SVM training seem to be more robust with a smaller standard error compared to ANN training. Generally, the SVM classifier yielded slightly higher prediction accuracy than ANN, irrespective of the type of descriptors used for molecule encoding, the size of the training data sets, and the algorithm employed for neural network training. The performance was compared using various different descriptor sets and descriptor combinations based on the 120 standard Ghose-Crippen fragment descriptors, a wide range of 180 different properties and physicochemical descriptors from the Molecular Operating Environment (MOE) package, and 225 topological pharmacophore (CATS) descriptors. For the complete set of 525 descriptors cross-validated classificati...
Journal Article•10.1093/BIOINFORMATICS/BTG102•
Classification of multiple cancer types by multicategory support vector machines using gene expression data.

[...]

Yoonkyung Lee1, Cheol Koo Lee2•
Ohio State University1, University of Wisconsin-Madison2
12 Jun 2003-Bioinformatics
TL;DR: The Multicategory SVM is introduced, which is a recently proposed extension of the binary SVM, and applied to multiclass cancer diagnosis problems, which renders the MSVM a viable alternative to other classification methods.
Abstract: Motivation: High-density DNA microarray measures the activities of several thousand genes simultaneously and the gene expression profiles have been used for the cancer classification recently. This new approach promises to give better therapeutic measurements to cancer patients by diagnosing cancer types with improved accuracy. The Support Vector Machine (SVM) is one of the classification methods successfully applied to the cancer diagnosis problems. However, its optimal extension to more than two classes was not obvious, which might impose limitations in its application to multiple tumor types. We briefly introduce the Multicategory SVM, which is a recently proposed extension of the binary SVM, and apply it to multiclass cancer diagnosis problems. Results: Its applicability is demonstrated on the leukemia data (Golub et al., 1999) and the small round blue cell tumors of childhood data (Khan et al. ,2 001). Comparable classification accuracy shown in the applications and its flexibility render the MSVM a viable alternative to other classification methods. Supplementary Information: http://www.stat.ohio-state. edu/∼yklee/msvm.html
Proceedings Article•
Incorporating diversity in active learning with support vector machines

[...]

Klaus Brinker1•
University of Paderborn1
21 Aug 2003
TL;DR: This work presents a new approach that is especially designed to construct batches and incorporates a diversity measure that has low computational requirements making it feasible for large scale problems with several thousands of examples.
Abstract: In many real world applications, active selection of training examples can significantly reduce the number of labelled training examples to learn a classification function. Different strategies in the field of support vector machines have been proposed that iteratively select a single new example from a set of unlabelled examples, query the corresponding class label and then perform retraining of the current classifier. However, to reduce computational time for training, it might be necessary to select batches of new training examples instead of single examples. Strategies for single examples can be extended straightforwardly to select batches by choosing the h > 1 examples that get the highest values for the individual selection criterion. We present a new approach that is especially designed to construct batches and incorporates a diversity measure. It has low computational requirements making it feasible for large scale problems with several thousands of examples. Experimental results indicate that this approach provides a faster method to attain a level of generalization accuracy in terms of the number of labelled examples.
Journal Article•10.1016/S0031-3203(03)00175-4•
Constructing support vector machine ensemble

[...]

Hyun-Chul Kim1, Shaoning Pang1, Je Hongmo1, Daijin Kim1, Sung Yang Bang1 •
Pohang University of Science and Technology1
01 Dec 2003-Pattern Recognition
TL;DR: Simulation results for the IRIS data classification and the hand-written digit recognition, and the fraud detection show that the proposed SVM ensemble with bagging or boosting outperforms a single SVM in terms of classification accuracy greatly.
Journal Article•10.1016/J.ENGAPPAI.2003.09.006•
Artificial neural networks and support vector machines with genetic algorithm for bearing fault detection

[...]

Biswanath Samanta1, K.R. Al-Balushi1, S.A. Al-Araimi1•
Sultan Qaboos University1
01 Oct 2003-Engineering Applications of Artificial Intelligence
TL;DR: A study to compare the performance of bearing fault detection using two different classifiers, namely, artificial neural networks and support vector machines (SMVs), using time-domain vibration signals of a rotating machine with normal and defective bearings.
Proceedings Article•
Learning to classify texts using positive and unlabeled data

[...]

Xiaoli Li1, Bing Liu2•
National University of Singapore1, University of Illinois at Chicago2
9 Aug 2003
TL;DR: An effective technique to solve the problem of labeled negative document classification is proposed that combines the Rocchio method and the SVM technique for classifier building and outperforms existing methods significantly.
Abstract: In traditional text classification, a classifier is built using labeled training documents of every class. This paper studies a different problem. Given a set P of documents of a particular class (called positive class) and a set U of unlabeled documents that contains documents from class P and also other types of documents (called negative class documents), we want to build a classifier to classify the documents in U into documents from P and documents not from P. The key feature of this problem is that there is no labeled negative document, which makes traditional text classification techniques inapplicable. In this paper, we propose an effective technique to solve the problem. It combines the Rocchio method and the SVM technique for classifier building. Experimental results show that the new method outperforms existing methods significantly.
Journal Article•10.1109/TNN.2002.806626•
Content-based audio classification and retrieval by support vector machines

[...]

Guodong Guo1, Stan Z. Li2•
University of Wisconsin-Madison1, Microsoft2
01 Jan 2003-IEEE Transactions on Neural Networks
TL;DR: The SVMs with a binary tree recognition strategy are used to tackle the audio classification problem and experimental comparisons for audio retrieval are presented to show the superiority of this novel metric, called distance-from-boundary (DFB).
Abstract: Support vector machines (SVMs) have been recently proposed as a new learning algorithm for pattern recognition. In this paper, the SVMs with a binary tree recognition strategy are used to tackle the audio classification problem. We illustrate the potential of SVMs on a common audio database, which consists of 409 sounds of 16 classes. We compare the SVMs based classification with other popular approaches. For audio retrieval, we propose a new metric, called distance-from-boundary (DFB). When a query audio is given, the system first finds a boundary inside which the query pattern is located. Then, all the audio patterns in the database are sorted by their distances to this boundary. All boundaries are learned by the SVMs and stored together with the audio database. Experimental comparisons for audio retrieval are presented to show the superiority of this novel metric to other similarity measures.
Journal Article•10.1016/S1071-5819(02)00141-6•
The production and recognition of emotions in speech: features and algorithms

[...]

Oudeyer Pierre-Yves
01 Jul 2003-International Journal of Human-computer Studies \/ International Journal of Man-machine Studies
TL;DR: A technique which allows to continuously control both the age of a synthetic voice and the quantity of emotions that are expressed and the first large-scale data mining experiment about the automatic recognition of basic emotions in informal everyday short utterances is presented.
Abstract: This paper presents algorithms that allow a robot to express its emotions by modulating the intonation of its voice. They are very simple and efficiently provide life-like speech thanks to the use of concatenative speech synthesis. We describe a technique which allows to continuously control both the age of a synthetic voice and the quantity of emotions that are expressed. Also, we present the first large-scale data mining experiment about the automatic recognition of basic emotions in informal everyday short utterances. We focus on the speaker-dependent problem. We compare a large set of machine learning algorithms, ranging from neural networks, Support Vector Machines or decision trees, together with 200 features, using a large database of several thousands examples. We show that the difference of performance among learning schemes can be substantial, and that some features which were previously unexplored are of crucial importance. An optimal feature set is derived through the use of a genetic algorithm. Finally, we explain how this study can be applied to real world situations in which very few examples are available. Furthermore, we describe a game to play with a personal robot which facilitates teaching of examples of emotional utterances in a natural and rather unconstrained manner.
Proceedings Article•10.1145/952532.952688•
Supervised term weighting for automated text categorization

[...]

Franca Debole, Fabrizio Sebastiani1•
Istituto di Scienza e Tecnologie dell'Informazione1
9 Mar 2003
TL;DR: It is proposed that learning from training data should also affect phase (ii), i.e. that information on the membership of training documents to categories be used to determine term weights, and is called supervised term weighting (STW).
Abstract: The construction of a text classifier usually involves (i) a phase of term selection, in which the most relevant terms for the classification task are identified, (ii) a phase of term weighting, in which document weights for the selected terms are computed, and (iii) a phase of classifier learning, in which a classifier is generated from the weighted representations of the training documents. This process involves an activity of supervised learning, in which information on the membership of training documents in categories is used. Traditionally, supervised learning enters only phases (i) and (iii). In this paper we propose instead that learning from training data should also affect phase (ii), i.e. that information on the membership of training documents to categories be used to determine term weights. We call this idea supervised term weighting (STW). As an example, we propose a number of "supervised variants" of t f idf weighting, obtained by replacing the idf function with the function that has been used in phase (i) for term selection. We present experimental results obtained on the standard Reuters-21578 benchmark with one classifier learning method (support vector machines), three term selection functions (information gain, chi-square, and gain ratio), and both local and global term selection and weighting.
...

Tools

SciSpace AgentBiomedical AgentSciSpace RecruitSciSpace for EnterpriseAgent GalleryChat with PDFLiterature ReviewAI WriterFind TopicsParaphraserCitation GeneratorExtract DataAI DetectorCitation Booster

Learn

ResourcesLive Workshops

SciSpace

CareersSupportBrowse PapersPricingSciSpace Affiliate ProgramCancellation & Refund PolicyTermsPrivacyData Sources

Directories

PapersTopicsJournalsAuthorsConferencesInstitutionsCitation StylesWriting templates

Extension & Apps

SciSpace Chrome ExtensionSciSpace Mobile App

Contact

support@scispace.com
SciSpace

© 2026 | PubGenius Inc. | Suite # 217 691 S Milpitas Blvd Milpitas CA 95035, USA

soc2
Secured by Delve