A statistical explanation of MaxEnt for ecologists
TL;DR: A new statistical explanation of MaxEnt is described, showing that the model minimizes the relative entropy between two probability densities defined in covariate space, which is likely to be a more accessible way to understand the model than previous ones that rely on machine learning concepts.
read more
Abstract: MaxEnt is a program for modelling species distributions from presence-only species records. This paper is written for ecologists and describes the MaxEnt model from a statistical perspective, making explicit links between the structure of the model, decisions required in producing a modelled distribution, and knowledge about the species and the data that might affect those decisions. To begin we discuss the characteristics of presence-only data, highlighting implications for modelling distributions. We particularly focus on the problems of sample bias and lack of information on species prevalence. The keystone of the paper is a new statistical explanation of MaxEnt which shows that the model minimizes the relative entropy between two probability densities (one estimated from the presence data and one, from the landscape) defined in covariate space. For many users, this viewpoint is likely to be a more accessible way to understand the model than previous ones that rely on machine learning concepts. We then step through a detailed explanation of MaxEnt describing key components (e.g. covariates and features, and definition of the landscape extent), the mechanics of model fitting (e.g. feature selection, constraints and regularization) and outputs. Using case studies for a Banksia species native to south-west Australia and a riverine fish, we fit models and interpret them, exploring why certain choices affect the result and what this means. The fish example illustrates use of the model with vector data for linear river segments rather than raster (gridded) data. Appropriate treatments for survey bias, unprojected data, locally restricted species, and predicting to environments outside the range of the training data are demonstrated, and new capabilities discussed. Online appendices include additional details of the model and the mathematical links between previous explanations and this one, example code and data, and further information on the case studies.
read more
Chat with Paper
AI Agents for this Paper
Find similar papers on Google Scholar, PubMed and Arxiv
Write a critical review of this paper
Analyze citations of this paper to find unaddressed research gaps
Citations
Predicting the impact of climate change on the distribution of two threatened Himalayan medicinal plants of Liliaceae in Nepal
Santosh Kumar Rana,Santosh Kumar Rana,Hum Kala Rana,Suresh Kumar Ghimire,Krishna Kumar Shrestha,Sailesh Ranjitkar,Sailesh Ranjitkar +6 more
TL;DR: In this paper, the authors used MaxEnt to predict the distribution of two threatened medicinal plants, Fritillaria cirrhosa and Lilium nepalense, in response to climate change.
70
Trailing edges projected to move faster than leading edges for large pelagic fish habitats under climate change
Lucy M. Robinson,Lucy M. Robinson,Alistair J. Hobday,Hugh P. Possingham,Hugh P. Possingham,Anthony J. Richardson,Anthony J. Richardson +6 more
TL;DR: In this article, the authors compared projected shifts in the core habitats of nine large pelagic fish species (five tuna, two billfish and two shark species) off the east coast of Australia at different spatial points (centre, leading and trailing edges of the core habitat), during different seasons (summer and winter), in the near-2030 and long-term (2070) using independent species distribution models and habitat suitability models.
Rich diversity, strong endemism, but poor protection: addressing the neglect of sandy beach ecosystems in coastal conservation planning
TL;DR: In this article, the authors quantified trends in species richness and endemism on sandy shores to assess representation of beach ecosystems in existing reserve networks and compared the relative importance of different drivers of species distributions through species distribution modelling.
70
Model Thresholds are More Important than Presence Location Type: Understanding the Distribution of Lowland tapir (Tapirus Terrestris) in a Continuous Atlantic Forest of Southeast Brazil:
Darren Norris,Darren Norris +1 more
TL;DR: In this article, the authors investigated what species distribution models (SDMs) actually represent and found that SDMs represent the distribution of rare and endangered species in the United States and Europe.
69
Maxent modeling of ancient and modern agricultural terraces in the Troodos foothills, Cyprus
TL;DR: In this paper, the authors developed a predictive model using the environmental variables that differentiate ancient and modern terrace locations in the foothills of the Troodos Mountains, Cyprus, using the maximum entropy principle as applied by Maxent software to estimate probability distributions for both terrace types across geographical space.
69
References
A new look at the statistical model identification
TL;DR: In this article, a new estimate minimum information theoretical criterion estimate (MAICE) is introduced for the purpose of statistical identification, which is free from the ambiguities inherent in the application of conventional hypothesis testing procedure.
Regression Shrinkage and Selection via the Lasso
TL;DR: A new method for estimation in linear models called the lasso, which minimizes the residual sum of squares subject to the sum of the absolute value of the coefficients being less than a constant, is proposed.
Climate change 2007: the physical science basis
TL;DR: In this article, Chen et al. present a survey of the state of the art in the field of computer vision and artificial intelligence, including a discussion of the role of the human brain in computer vision.
22.4K
•Book
The Elements of Statistical Learning: Data Mining, Inference, and Prediction
Trevor Hastie,Robert Tibshirani,Jerome H. Friedman +2 more
- 28 Jul 2013
TL;DR: In this paper, the authors describe the important ideas in these areas in a common conceptual framework, and the emphasis is on concepts rather than mathematics, with a liberal use of color graphics.
21.3K
The elements of statistical learning. 2001
Trevor Hastie,Robert Tibshirani,Jerome H. Friedman +2 more
- 01 Jan 2001
17.2K
Related Papers (5)
Jane Elith,Catherine H. Graham,Robert P. Anderson,Miroslav Dudík,Simon Ferrier,Antoine Guisan,Robert J. Hijmans,Falk Huettmann,John R. Leathwick,Anthony Lehmann,Jin Li,Lúcia G. Lohmann,Bette A. Loiselle,Glenn Manion,Craig Moritz,Miguel Nakamura,Yoshinori Nakazawa,Jacob C. M. Mc Overton,A. Townsend Peterson,Steven J. Phillips,Karen Richardson,Ricardo Scachetti-Pereira,Robert E. Schapire,Jorge Soberón,Stephen E. Williams,Mary S. Wisz,Niklaus E. Zimmermann +26 more