M statistic commands: interpoint distance distribution analysis.

doi:10.1177/1536867X1101100207

Open AccessJournal Article10.1177/1536867X1101100207

M statistic commands: interpoint distance distribution analysis.

Pietro Tebaldi, +2 more

- 01 Jul 2011

- Stata Journal

- Vol. 11, Iss: 2, pp 271-289

6

TL;DR: In this paper, the M statistic is used to compare the interpoint distance distribution across groups of observations in a k-dimensional setting, where the locations are distributed in a region of the plane.

Abstract: We implement the commands mstat and mtest to perform inference based on the M statistic, a statistic that can be used to compare the interpoint distance distribution across groups of observations. The analyses are based on the study of the interpoint distances between n points in a k-dimensional setting to produce a one-dimensional real-valued test statistic. The locations are distributed in a region of the plane. When we consider allninterpoint distances, the dependencies among them are difficult to express analytically, but their distribution is informative, and the M statistic can be built to summarize one aspect of this information. The two commands can be used on aw ide class of datasets to test the null hypothesis that two groups have the same (spatial) distribution. mstat and mtest return the exact M test statistic. Moreover, mtest executes a Monte Carlo-type permutation test, which returns the empirical p-value together with its confidence interval. This is the command to use in most situations, because the convergence of M to its asymptotic chi-squared distribution is slow. Both commands can be used to obtain graphical output of the empirical density function of the interpoint distance distributions in the two groups and the two- dimensional map of the n observations in the plane. The descriptions of the commands are accompanied by examples of applications with real and simulated data. We run the test on the Alt and Vach grave site dataset (Manjourides and Pagano, forthcoming, Statistics in Medicine) and reject the null hypothesis, in contradiction to other published analyses. We also show how to adapt the techniques to discrete datasets with more than one unit in each location. Finally, we report an extensive application on breast cancer data in Massachusetts; in the application, we show the compatibility of the M commands with Pisati's spmap package.

Chat with Paper

AI Agents for this Paper

Find similar papers on Google Scholar, PubMed and Arxiv
Write a critical review of this paper
Analyze citations of this paper to find unaddressed research gaps

Citations

•Journal Article•10.1111/ACPS.12384

Social and spatial heterogeneity in psychosis proneness in a multilevel case-prodrome-control study.

James B. Kirkbride, +13 more

- 01 Oct 2015

- Acta Psychiatrica Scandinavica

TL;DR: To test whether spatial and social neighbourhood patterning of people at ultra‐high risk of psychosis differs from first‐episode psychosis (FEP) participants or controls and to determine whether exposure to different social environments is evident before disorder onset.

...read moreread less

29

•Journal Article•10.1016/J.JMVA.2015.09.017

Shape classification based on interpoint distance distributions

José R. Berrendero, +2 more

- 01 Apr 2016

- Journal of Multivariate Analysis

TL;DR: The Lipschitz continuity of the transformation taking every shape to its corresponding interpoint distance distribution is shown and a partial identifiability result is obtained showing that, under some geometrical restrictions, shapes with different planar area must have different inter point distance distributions.

...read moreread less

23

Journal Article•10.1007/S00362-016-0766-7

Multinomial interpoint distances

Reza Modarres

- 01 Mar 2018

TL;DR: The properties of the squared Euclidean interpoint distances (IDs) drawn from multinomial distributions are explored and applications of IDs for testing goodness of fit, the equality of high dimensional mult inomial distributions, classification and outliers detection are discussed.

...read moreread less

9

Journal Article•10.1016/J.SPL.2016.01.025

Multivariate Poisson interpoint distances

Reza Modarres

- 01 May 2016

- Statistics & Probability Letters

TL;DR: In this article, the properties of the squared interpoint distances (IDs) in samples taken from multivariate Poisson distributions were studied and the means and covariances of the average IDs were derived.

...read moreread less

6

Journal Article•10.1080/03610918.2016.1148726

Modeling a minimal spanning tree

Haigang Liu, +1 more

- 28 Feb 2017

- Communications in Statistics - Simulatio...

TL;DR: A multivariate Gini index is defined to measure the scatter of a data cloud based on the normalized ordered IDs of the MST, using a multivariate normal copula with beta marginals and a Dirichlet distribution to obtain beta vectors.

...read moreread less

3

References

•Book

Applied Spatial Statistics for Public Health Data

Lance A. Waller, +1 more

- 15 Jul 2004

TL;DR: In this paper, the authors present a method for estimating risk and risk of cancer in public health data using statistical methods for spatial data in the context of geographic information systems (GISs).

...read moreread less

1.1K

Book•10.1002/0471662682

Applied Spatial Statistics for Public Health Data: Waller/Applied Spatial Statistics

Lance A. Waller, +1 more

- 01 Jul 2004

1K

Journal Article•10.1002/SIM.1947

The interpoint distance distribution as a descriptor of point patterns, with an application to spatial disease clustering

Marco Bonetti, +1 more

- 15 Mar 2005

- Statistics in Medicine

TL;DR: Its use in the detection of spatial clustering by application to a well‐known leukaemia data set is illustrated, and the results of a simulation experiment designed to study the power characteristics of the methods within that study region and in an artificial homogenous setting are reported.

...read moreread less

68

•Journal Article•10.1177/1536867X0400400401

Simple thematic mapping

Maurizio Pisati

- 01 Dec 2004

- Stata Journal

TL;DR: The tmap package is presented, a set of Stata programs designed to draw five kinds of thematic maps: choropleth, proportional symbol, deviation, dot, and label maps, intended to depict area data.

...read moreread less

19

•Journal Article•10.1016/J.CSDA.2009.03.005

The choice of the number of bins for the M statistic

Laura F. White, +2 more

- 01 Aug 2009

- Computational Statistics & Data Analysis

TL;DR: The relationship of M to Pearson's Chi Square statistic, xn2, is shown and it is shown that spatial data provides a unique insight into the problem through examples with simulated data and spatial data from a health care provider and results indicate that the number of bins does not appear to vary with m, thenumber of spatial locations.

...read moreread less

12