Open Access
Predictive modeling using sparse logistic regression with applications
Tapio Manninen
- 31 Jan 2014
3
TL;DR: It is shown that a combination of a careful model assessment scheme and automatic feature selection by means of logistic regression model and coefficient regularization create a powerful, yet simple and practical, tool chain for applications of supervised learning and classification.
read more
Abstract: In this thesis, sparse logistic regression models are applied in a set of real world machine learning applications. The studied cases include supervised image segmentation, cancer diagnosis, and MEG data classification. Image segmentation is applied both in component detection in inkjet printed electronics manufacturing and in cell detection from microscope images. The results indicate that a simple linear classification method such as logistic regression often outperforms more sophisticated methods. Further, it is shown that the interpretability of the linear model offers great advantage in many applications. Model validation and automatic feature selection by means of `1 regularized parameter estimation have a significant role in this thesis. It is shown that a combination of a careful model assessment scheme and automatic feature selection by means of logistic regression model and coefficient regularization create a powerful, yet simple and practical, tool chain for applications of supervised learning and classification.
read more
Chat with Paper
AI Agents for this Paper
Find similar papers on Google Scholar, PubMed and Arxiv
Write a critical review of this paper
Analyze citations of this paper to find unaddressed research gaps
Citations
Predicting the recognition of natural scenes from single trial MEG recordings
Jochem W. Rieger,F Plum,Karl R. Gegenfurtner,Braun C, Preissl, H,Heinrich H. Bülthoff +4 more
- 01 May 2000
TL;DR: In this article, the authors used Support Vector Machines (SVM) to predict scene recognition success with natural scene photographs using single-trial magnetoencephalography (MEG) measures of brain activation.
44
Heart Disease Prediction System Using Optimization Techniques
Namariq Ayad Saeed,Ziyad Tariq Mustafa Al-Ta’i +1 more
- 15 Jun 2020
TL;DR: In this article, a binary particle swarm optimization algorithm combined with mutual information filter for feature selection and employed logistic regression for classification was proposed for predicting heart disease in the Cleveland heart disease dataset.
12
Techniques for Noise Robustness in Automatic Speech Recognition
Jort F. Gemmeke,Ulpu Remes +1 more
- 01 Jan 2012
TL;DR: For some reasons, this techniques for noise robustness in automatic speech recognition tends to be the representative book in this website.
References
A new look at the statistical model identification
TL;DR: In this article, a new estimate minimum information theoretical criterion estimate (MAICE) is introduced for the purpose of statistical identification, which is free from the ambiguities inherent in the application of conventional hypothesis testing procedure.
Regression Shrinkage and Selection via the Lasso
TL;DR: A new method for estimation in linear models called the lasso, which minimizes the residual sum of squares subject to the sum of the absolute value of the coefficients being less than a constant, is proposed.
Estimating the Dimension of a Model
TL;DR: In this paper, the problem of selecting one of a number of models of different dimensions is treated by finding its Bayes solution, and evaluating the leading terms of its asymptotic expansion.
Support-Vector Networks
Corinna Cortes,Vladimir Vapnik +1 more
TL;DR: High generalization ability of support-vector networks utilizing polynomial input transformations is demonstrated and the performance of the support- vector network is compared to various classical learning algorithms that all took part in a benchmark study of Optical Character Recognition.
Estimating the dimension of a model
Gideon Schwarz
- 01 Jan 2005
TL;DR: In this paper, the problem of selecting one of a number of models of different dimensions is treated by finding its Bayes solution, and evaluating the leading terms of its asymptotic expansion.
40.6K