A statistical explanation of MaxEnt for ecologists
TL;DR: A new statistical explanation of MaxEnt is described, showing that the model minimizes the relative entropy between two probability densities defined in covariate space, which is likely to be a more accessible way to understand the model than previous ones that rely on machine learning concepts.
read more
Abstract: MaxEnt is a program for modelling species distributions from presence-only species records. This paper is written for ecologists and describes the MaxEnt model from a statistical perspective, making explicit links between the structure of the model, decisions required in producing a modelled distribution, and knowledge about the species and the data that might affect those decisions. To begin we discuss the characteristics of presence-only data, highlighting implications for modelling distributions. We particularly focus on the problems of sample bias and lack of information on species prevalence. The keystone of the paper is a new statistical explanation of MaxEnt which shows that the model minimizes the relative entropy between two probability densities (one estimated from the presence data and one, from the landscape) defined in covariate space. For many users, this viewpoint is likely to be a more accessible way to understand the model than previous ones that rely on machine learning concepts. We then step through a detailed explanation of MaxEnt describing key components (e.g. covariates and features, and definition of the landscape extent), the mechanics of model fitting (e.g. feature selection, constraints and regularization) and outputs. Using case studies for a Banksia species native to south-west Australia and a riverine fish, we fit models and interpret them, exploring why certain choices affect the result and what this means. The fish example illustrates use of the model with vector data for linear river segments rather than raster (gridded) data. Appropriate treatments for survey bias, unprojected data, locally restricted species, and predicting to environments outside the range of the training data are demonstrated, and new capabilities discussed. Online appendices include additional details of the model and the mathematical links between previous explanations and this one, example code and data, and further information on the case studies.
read more
Chat with Paper
AI Agents for this Paper
Find similar papers on Google Scholar, PubMed and Arxiv
Write a critical review of this paper
Analyze citations of this paper to find unaddressed research gaps
Citations
An assessment of angler education and bait trade regulations to prevent invasive species introductions in the Laurentian Great Lakes.
TL;DR: In this paper, the authors quantify the current distribution of AIS signage in retail bait shops in the Great Lakes region and estimate the long term viability of using retail bait shop as platform for angler education.
Estimating hantavirus risk in southern Argentina: a GIS-based approach combining human cases and host distribution.
Verónica Andreo,Markus Neteler,Duccio Rocchini,Cecilia Provensal,Silvana del Carmen Levis,Ximena Porcasi,Annapaola Rizzoli,Mario Lanfri,Marcelo Scavuzzo,Noemí Pini,Delia Enria,Jaime Polop +11 more
TL;DR: The implementation and use of thematic maps, such as the one built here, are proposed as a basic tool allowing public health authorities to focus surveillance efforts and normally scarce resources for prevention and control actions in vast areas like southern Argentina.
Influence of Host and Environmental Factors on the Distribution of the Japanese Encephalitis Vector Culex tritaeniorhynchus in China.
TL;DR: The predicted suitable habitats of the JE vector were correlated with areas of high JE incidence in parts of China, and human population density and the maximum NDVI were the most important predictors in the models.
Factors influencing the distribution of leopard in a semiarid landscape of Western India
TL;DR: In this paper, the influence of different ecogeographic variables determining the distribution of leopards in and around Sariska Tiger Reserve through MaxEnt habitat suitability model based on camera trapping was assessed.
31
References
A new look at the statistical model identification
TL;DR: In this article, a new estimate minimum information theoretical criterion estimate (MAICE) is introduced for the purpose of statistical identification, which is free from the ambiguities inherent in the application of conventional hypothesis testing procedure.
Regression Shrinkage and Selection via the Lasso
TL;DR: A new method for estimation in linear models called the lasso, which minimizes the residual sum of squares subject to the sum of the absolute value of the coefficients being less than a constant, is proposed.
Climate change 2007: the physical science basis
TL;DR: In this article, Chen et al. present a survey of the state of the art in the field of computer vision and artificial intelligence, including a discussion of the role of the human brain in computer vision.
22.4K
•Book
The Elements of Statistical Learning: Data Mining, Inference, and Prediction
Trevor Hastie,Robert Tibshirani,Jerome H. Friedman +2 more
- 28 Jul 2013
TL;DR: In this paper, the authors describe the important ideas in these areas in a common conceptual framework, and the emphasis is on concepts rather than mathematics, with a liberal use of color graphics.
21.3K
The elements of statistical learning. 2001
Trevor Hastie,Robert Tibshirani,Jerome H. Friedman +2 more
- 01 Jan 2001
17.2K
Related Papers (5)
Jane Elith,Catherine H. Graham,Robert P. Anderson,Miroslav Dudík,Simon Ferrier,Antoine Guisan,Robert J. Hijmans,Falk Huettmann,John R. Leathwick,Anthony Lehmann,Jin Li,Lúcia G. Lohmann,Bette A. Loiselle,Glenn Manion,Craig Moritz,Miguel Nakamura,Yoshinori Nakazawa,Jacob C. M. Mc Overton,A. Townsend Peterson,Steven J. Phillips,Karen Richardson,Ricardo Scachetti-Pereira,Robert E. Schapire,Jorge Soberón,Stephen E. Williams,Mary S. Wisz,Niklaus E. Zimmermann +26 more