A statistical explanation of MaxEnt for ecologists

doi:10.1111/J.1472-4642.2010.00725.X

Open AccessJournal Article10.1111/J.1472-4642.2010.00725.X

A statistical explanation of MaxEnt for ecologists

Jane Elith, +5 more

- 01 Jan 2011

- Diversity and Distributions

- Vol. 17, Iss: 1, pp 43-57

5.8K

TL;DR: A new statistical explanation of MaxEnt is described, showing that the model minimizes the relative entropy between two probability densities defined in covariate space, which is likely to be a more accessible way to understand the model than previous ones that rely on machine learning concepts.

Abstract: MaxEnt is a program for modelling species distributions from presence-only species records. This paper is written for ecologists and describes the MaxEnt model from a statistical perspective, making explicit links between the structure of the model, decisions required in producing a modelled distribution, and knowledge about the species and the data that might affect those decisions. To begin we discuss the characteristics of presence-only data, highlighting implications for modelling distributions. We particularly focus on the problems of sample bias and lack of information on species prevalence. The keystone of the paper is a new statistical explanation of MaxEnt which shows that the model minimizes the relative entropy between two probability densities (one estimated from the presence data and one, from the landscape) defined in covariate space. For many users, this viewpoint is likely to be a more accessible way to understand the model than previous ones that rely on machine learning concepts. We then step through a detailed explanation of MaxEnt describing key components (e.g. covariates and features, and definition of the landscape extent), the mechanics of model fitting (e.g. feature selection, constraints and regularization) and outputs. Using case studies for a Banksia species native to south-west Australia and a riverine fish, we fit models and interpret them, exploring why certain choices affect the result and what this means. The fish example illustrates use of the model with vector data for linear river segments rather than raster (gridded) data. Appropriate treatments for survey bias, unprojected data, locally restricted species, and predicting to environments outside the range of the training data are demonstrated, and new capabilities discussed. Online appendices include additional details of the model and the mathematical links between previous explanations and this one, example code and data, and further information on the case studies.

Chat with Paper

AI Agents for this Paper

Find similar papers on Google Scholar, PubMed and Arxiv
Write a critical review of this paper
Analyze citations of this paper to find unaddressed research gaps

Citations

•Journal Article•10.3391/MBI.2014.5.4.02

An assessment of angler education and bait trade regulations to prevent invasive species introductions in the Laurentian Great Lakes.

Lucas R. Nathan, +3 more

- 01 Nov 2014

- Management of Biological Invasions

TL;DR: In this paper, the authors quantify the current distribution of AIS signage in retail bait shops in the Great Lakes region and estimate the long term viability of using retail bait shop as platform for angler education.

...read moreread less

32

•Journal Article•10.3390/V6010201

Estimating hantavirus risk in southern Argentina: a GIS-based approach combining human cases and host distribution.

Verónica Andreo, +11 more

- 14 Jan 2014

- Viruses

TL;DR: The implementation and use of thematic maps, such as the one built here, are proposed as a basic tool allowing public health authorities to focus surveillance efforts and normally scarce resources for prevention and control actions in vast areas like southern Argentina.

...read moreread less

32

•Journal Article•10.3390/IJERPH15091848

Influence of Host and Environmental Factors on the Distribution of the Japanese Encephalitis Vector Culex tritaeniorhynchus in China.

Boyang Liu, +5 more

- 27 Aug 2018

- International Journal of Environmental R...

TL;DR: The predicted suitable habitats of the JE vector were correlated with areas of high JE incidence in parts of China, and human population density and the maximum NDVI were the most important predictors in the models.

...read moreread less

32

•Journal Article•10.1111/JFR3.12549

Estimating flood extent during Hurricane Harvey using maximum entropy to build a hazard distribution model

William Mobley, +4 more

- 13 Jun 2019

- Journal of Flood Risk Management

31

Journal Article•10.1007/S13364-012-0109-6

Factors influencing the distribution of leopard in a semiarid landscape of Western India

Krishnendu Mondal, +2 more

- 01 Apr 2013

- Acta Theriologica

TL;DR: In this paper, the influence of different ecogeographic variables determining the distribution of leopards in and around Sariska Tiger Reserve through MaxEnt habitat suitability model based on camera trapping was assessed.

...read moreread less

31

...

Expand

References

•Journal Article•10.1109/TAC.1974.1100705

A new look at the statistical model identification

Hirotugu Akaike

- 01 Dec 1974

- IEEE Transactions on Automatic Control

TL;DR: In this article, a new estimate minimum information theoretical criterion estimate (MAICE) is introduced for the purpose of statistical identification, which is free from the ambiguities inherent in the application of conventional hypothesis testing procedure.

...read moreread less

53.1K

Journal Article•10.1111/J.2517-6161.1996.TB02080.X

Regression Shrinkage and Selection via the Lasso

Robert Tibshirani

- 01 Jan 1996

- Journal of the royal statistical society...

TL;DR: A new method for estimation in linear models called the lasso, which minimizes the residual sum of squares subject to the sum of the absolute value of the coefficients being less than a constant, is proposed.

...read moreread less

45.4K

•Journal Article•10.1080/03736245.2010.480842

Climate change 2007: the physical science basis

Willem A. Landman

- 16 Jun 2010

- South African Geographical Journal

TL;DR: In this article, Chen et al. present a survey of the state of the art in the field of computer vision and artificial intelligence, including a discussion of the role of the human brain in computer vision.

...read moreread less

22.4K

•Book

The Elements of Statistical Learning: Data Mining, Inference, and Prediction

Trevor Hastie, +2 more

- 28 Jul 2013

TL;DR: In this paper, the authors describe the important ideas in these areas in a common conceptual framework, and the emphasis is on concepts rather than mathematics, with a liberal use of color graphics.

...read moreread less

21.3K