Predicting sample size required for classification performance

doi:10.1186/1472-6947-12-8

Open AccessJournal Article10.1186/1472-6947-12-8

Predicting sample size required for classification performance

Rosa L. Figueroa, +3 more

- 15 Feb 2012

- BMC Medical Informatics and Decision Mak...

- Vol. 12, Iss: 1, pp 8-8

516

TL;DR: A simple and effective sample size prediction algorithm that conducts weighted fitting of learning curves and outperformed an un-weighted algorithm described in previous literature can help researchers determine annotation sample size for supervised machine learning.

Abstract: Supervised learning methods need annotated data in order to generate efficient models. Annotated data, however, is a relatively scarce resource and can be expensive to obtain. For both passive and active learning methods, there is a need to estimate the size of the annotated sample required to reach a performance target. We designed and implemented a method that fits an inverse power law model to points of a given learning curve created using a small annotated training set. Fitting is carried out using nonlinear weighted least squares optimization. The fitted model is then used to predict the classifier's performance and confidence interval for larger sample sizes. For evaluation, the nonlinear weighted curve fitting method was applied to a set of learning curves generated using clinical text and waveform classification tasks with active and passive sampling methods, and predictions were validated using standard goodness of fit measures. As control we used an un-weighted fitting method. A total of 568 models were fitted and the model predictions were compared with the observed performances. Depending on the data set and sampling method, it took between 80 to 560 annotated samples to achieve mean average and root mean squared error below 0.01. Results also show that our weighted fitting method outperformed the baseline un-weighted method (p < 0.05). This paper describes a simple and effective sample size prediction algorithm that conducts weighted fitting of learning curves. The algorithm outperformed an un-weighted algorithm described in previous literature. It can help researchers determine annotation sample size for supervised machine learning.

Chat with Paper

AI Agents for this Paper

Find similar papers on Google Scholar, PubMed and Arxiv
Write a critical review of this paper
Analyze citations of this paper to find unaddressed research gaps

Citations

•Journal Article•10.1371/JOURNAL.PONE.0224365

Machine learning algorithm validation with a limited sample size

Andrius Vabalas, +3 more

- 07 Nov 2019

- PLOS ONE

TL;DR: The authors' simulations show that K-fold Cross-Validation (CV) produces strongly biased performance estimates with small sample sizes, and the bias is still evident with sample size of 1000, while Nested CV and train/test split approaches produce robust and unbiased performance estimates regardless of sample size.

...read moreread less

1.1K

•Journal Article•10.1016/J.JBI.2014.02.013

Sample size estimation in diagnostic test studies of biomedical informatics

Karimollah Hajian-Tilaki

- 01 Apr 2014

- Journal of Biomedical Informatics

TL;DR: This review provided a conceptual framework of sample size calculations in the studies of diagnostic test accuracy in various conditions and test outcomes to help clinicians when designing diagnostic test studies that an adequate sample size is chosen based on statistical principles in order to guarantee the reliability of study.

...read moreread less

829

•Journal Article•10.1148/RADIOL.2018171820

Current Applications and Future Impact of Machine Learning in Radiology.

Garry Choy, +10 more

- 26 Jun 2018

- Radiology

TL;DR: Examples of current applications of machine learning and artificial intelligence techniques in diagnostic radiology and the future impact and natural extension of these techniques in radiology practice are discussed.

...read moreread less

727

Journal Article•10.1126/SCITRANSLMED.3005623

Intraoperative tissue identification using rapid evaporative ionization mass spectrometry.

Julia Balog, +11 more

- 17 Jul 2013

- Science Translational Medicine

TL;DR: This first-in-human demonstration shows that the iKnife technology is ready for widespread use in the operating room to improve the accuracy of surgical intervention in cancer, and to demonstrate the translation to real-time use in vivo in a surgical environment.

...read moreread less

601

•Journal Article•10.1016/J.ACA.2012.11.007

Sample size planning for classification models.

Claudia Beleites, +4 more

- 14 Jan 2013

- Analytica Chimica Acta

TL;DR: The test sample sizes necessary to achieve reasonable precision in the validation of classifier training and testing are determined and it is found that 75-100 samples will usually be needed to test a good but not perfect classifier.

...read moreread less

463

...

Expand

References

•Book

Statistical Power Analysis for the Behavioral Sciences

Jacob Cohen

- 01 Dec 1969

TL;DR: The concepts of power analysis are discussed in this paper, where Chi-square Tests for Goodness of Fit and Contingency Tables, t-Test for Means, and Sign Test are used.

...read moreread less

124.4K

•Journal Article•10.1023/A:1007692713085

Text Classification from Labeled and Unlabeled Documents using EM

Kamal Nigam, +3 more

- 01 May 2000

- Machine Learning

TL;DR: This paper shows that the accuracy of learned text classifiers can be improved by augmenting a small number of labeled training documents with a large pool of unlabeled documents, and presents two extensions to the algorithm that improve classification accuracy under these conditions.

...read moreread less

3.4K

•Journal Article•10.1162/153244302760185243

Support vector machine active learning with applications to text classification

Simon Tong, +1 more

- 01 Mar 2002

- Journal of Machine Learning Research

TL;DR: Experimental results showing that employing the active learning method can significantly reduce the need for labeled training instances in both the standard inductive and transductive settings are presented.

...read moreread less

3.4K

Journal Article•10.1111/J.1540-5915.1979.TB00026.X

The learning curve: historical review and comprehensive survey

Louis E. Yelle

- 01 Apr 1979

- Decision Sciences

TL;DR: The use of the learning curve has been receiving increasing attention in recent years as discussed by the authors, and much of this increase has been due to learning curve applications other than in the traditional learning curve areas.

...read moreread less

1.6K

Journal Article•10.1198/000313001317098149

Some Practical Guidelines for Effective Sample Size Determination

Russell V. Lenth

- 01 Aug 2001

- The American Statistician

TL;DR: Suggestions for successful and meaningful sample size determination are offered and criticism is made of some ill-advised shortcuts relating to power and sample size.

...read moreread less

1.2K

...

Expand

Predicting sample size required for classification performance

Chat with Paper

AI Agents for this Paper

Citations

Machine learning algorithm validation with a limited sample size

Sample size estimation in diagnostic test studies of biomedical informatics

Current Applications and Future Impact of Machine Learning in Radiology.

Intraoperative tissue identification using rapid evaporative ionization mass spectrometry.

Sample size planning for classification models.

References

Statistical Power Analysis for the Behavioral Sciences

Text Classification from Labeled and Unlabeled Documents using EM

Support vector machine active learning with applications to text classification

The learning curve: historical review and comprehensive survey

Some Practical Guidelines for Effective Sample Size Determination

Related Papers (5)

Scikit-learn: Machine Learning in Python

Random Forests

A study of cross-validation and bootstrap for accuracy estimation and model selection

Deep learning

Deep Residual Learning for Image Recognition