Towards Algorithmic Analytics for Large-scale Datasets.
TL;DR: Trends in learning from "big data" are reviewed and examples from imaging neuroscience are illustrated, showing how more elaborate, less interpretable models are embraced in order to maximize prediction accuracy.
read more
Abstract: The traditional goals of quantitative analytics cherish simple, transparent models to generate explainable insights. Large-scale data acquisition, enabled for instance by brain scanning and genomic profiling with microarray-type techniques, has prompted a wave of statistical inventions and innovative applications. Modern analysis approaches 1) tame large variable arrays capitalizing on regularization and dimensionality-reduction strategies, 2) are increasingly backed up by empirical model validations rather than justified by mathematical proofs, 3) will compare against and build on open data and consortium repositories, as well as 4) often embrace more elaborate, less interpretable models in order to maximize prediction accuracy. Here we review these trends in learning from "big data" and illustrate examples from imaging neuroscience.
read more
Chat with Paper
AI Agents for this Paper
Find similar papers on Google Scholar, PubMed and Arxiv
Write a critical review of this paper
Analyze citations of this paper to find unaddressed research gaps
Citations
Ensemble deep learning in bioinformatics
TL;DR: Recent key developments in ensemble deep learning are shared and a look is looked at at how their contribution has benefited a wide range of bioinformatics research from basic sequence analysis to systems biology.
277
Different scaling of linear models and deep learning in UKBiobank brain images versus machine-learning datasets.
Marc-Andre Schulz,B.T. Thomas Yeo,Joshua T. Vogelstein,Janaina Mourao-Miranada,Jakob Nikolas Kather,Jakob Nikolas Kather,Konrad P. Kording,Blake A. Richards,Danilo Bzdok +8 more
TL;DR: This work systematically profiled the performance of deep, kernel, and linear models as a function of sample size on UKBiobank brain images against established machine learning references to benchmark performance scaling with increasingly sophisticated prediction algorithms and with increasing sample size in reference machine-learning and biomedical datasets.
Finding the needle in a high-dimensional haystack: Canonical correlation analysis for neuroscientists.
Hao-Ting Wang,Hao-Ting Wang,Jonathan Smallwood,Janaina Mourao-Miranda,Cedric Huchuan Xia,Theodore D. Satterthwaite,Danielle S. Bassett,Danilo Bzdok +7 more
TL;DR: Canonical correlation analysis is a prototypical family of methods that is useful in identifying the links between variable sets from different modalities and so is well suited to the analysis of big neuroscience datasets.
230
Autism spectrum heterogeneity: fact or artifact?
Laurent Mottron,Danilo Bzdok +1 more
TL;DR: Several remedies to the problem of heterogeneity compatible with a categorical diagnosis of ASD are proposed, including maintaining a line of research on prototypical autism and reintroducing the qualitative properties of autism presentations and of current dimensional specifiers, language, intelligence, comorbidity, and severity.
Functional specialization within the inferior parietal lobes across cognitive domains
TL;DR: The inferior parietal lobe (IPL) is a key neural substrate underlying diverse mental processes, from basic attention to language and social cognition that define human interactions as mentioned in this paper, and its putative domain-global role appears to tie into poorly understood differences between cognitive domains in both hemispheres.
120
References
Deep learning
TL;DR: Deep learning is making major advances in solving problems that have resisted the best attempts of the artificial intelligence community for many years, and will have many more successes in the near future because it requires very little engineering by hand and can easily take advantage of increases in the amount of available computation and data.
67K
•Book
An introduction to the bootstrap
Bradley Efron,Robert Tibshirani +1 more
- 01 Jan 1993
TL;DR: This article presents bootstrap methods for estimation, using simple arguments, with Minitab macros for implementing these methods, as well as some examples of how these methods could be used for estimation purposes.
•Book
The Elements of Statistical Learning
Trevor Hastie,Robert Tibshirani,Jerome H. Friedman +2 more
- 01 Jan 2001
29.4K
Bagging predictors
Leo Breiman
- 01 Aug 1996
TL;DR: Tests on real and simulated data sets using classification and regression trees and subset selection in linear regression show that bagging can give substantial gains in accuracy.
Statistical parametric maps in functional imaging: A general linear approach
Karl J. Friston,Andrew P. Holmes,Keith J. Worsley,J-B. Poline,Chris D. Frith,Richard S. J. Frackowiak +5 more
TL;DR: In this paper, the authors present a general approach that accommodates most forms of experimental layout and ensuing analysis (designed experiments with fixed effects for factors, covariates and interaction of factors).
Related Papers (5)
Karla L. Miller,Fidel Alfaro-Almagro,Neal K. Bangerter,David L. Thomas,Essa Yacoub,Junqian Xu,Andreas J. Bartsch,Saad Jbabdi,Stamatios N. Sotiropoulos,Jesper L. R. Andersson,Ludovica Griffanti,Gwenaëlle Douaud,Thomas W. Okell,Peter Weale,Iulius Dragonu,Steve Garratt,Sarah Hudson,Rory Collins,Mark Jenkinson,Paul M. Matthews,Stephen M. Smith +20 more
[...]
Fidel Alfaro-Almagro,Mark Jenkinson,Neal K. Bangerter,Jesper L. R. Andersson,Ludovica Griffanti,Gwenaëlle Douaud,Stamatios N. Sotiropoulos,Stamatios N. Sotiropoulos,Saad Jbabdi,Moises Hernandez-Fernandez,Emmanuel Vallée,Diego Vidaurre,Matthew A. Webster,Paul McCarthy,Chris Rorden,Alessandro Daducci,Daniel C. Alexander,Hui Zhang,Iulius Dragonu,Paul M. Matthews,Karla L. Miller,Stephen M. Smith +21 more