M statistic commands: interpoint distance distribution analysis.
TL;DR: In this paper, the M statistic is used to compare the interpoint distance distribution across groups of observations in a k-dimensional setting, where the locations are distributed in a region of the plane.
read more
Abstract: We implement the commands mstat and mtest to perform inference based on the M statistic, a statistic that can be used to compare the interpoint distance distribution across groups of observations. The analyses are based on the study of the interpoint distances between n points in a k-dimensional setting to produce a one-dimensional real-valued test statistic. The locations are distributed in a region of the plane. When we consider allninterpoint distances, the dependencies among them are difficult to express analytically, but their distribution is informative, and the M statistic can be built to summarize one aspect of this information. The two commands can be used on aw ide class of datasets to test the null hypothesis that two groups have the same (spatial) distribution. mstat and mtest return the exact M test statistic. Moreover, mtest executes a Monte Carlo-type permutation test, which returns the empirical p-value together with its confidence interval. This is the command to use in most situations, because the convergence of M to its asymptotic chi-squared distribution is slow. Both commands can be used to obtain graphical output of the empirical density function of the interpoint distance distributions in the two groups and the two- dimensional map of the n observations in the plane. The descriptions of the commands are accompanied by examples of applications with real and simulated data. We run the test on the Alt and Vach grave site dataset (Manjourides and Pagano, forthcoming, Statistics in Medicine) and reject the null hypothesis, in contradiction to other published analyses. We also show how to adapt the techniques to discrete datasets with more than one unit in each location. Finally, we report an extensive application on breast cancer data in Massachusetts; in the application, we show the compatibility of the M commands with Pisati's spmap package.
read more
Chat with Paper
AI Agents for this Paper
Find similar papers on Google Scholar, PubMed and Arxiv
Write a critical review of this paper
Analyze citations of this paper to find unaddressed research gaps
Citations
Social and spatial heterogeneity in psychosis proneness in a multilevel case-prodrome-control study.
James B. Kirkbride,Jan Stochl,Jorge Zimbron,Jorge Zimbron,Carolyn M Crane,Carolyn M Crane,Antonio Metastasio,E. Aguilar,R. Webster,S. Theegala,Nikolett Kabacs,Peter B. Jones,Jesus Perez,Jesus Perez +13 more
TL;DR: To test whether spatial and social neighbourhood patterning of people at ultra‐high risk of psychosis differs from first‐episode psychosis (FEP) participants or controls and to determine whether exposure to different social environments is evident before disorder onset.
Shape classification based on interpoint distance distributions
TL;DR: The Lipschitz continuity of the transformation taking every shape to its corresponding interpoint distance distribution is shown and a partial identifiability result is obtained showing that, under some geometrical restrictions, shapes with different planar area must have different inter point distance distributions.
23
Multinomial interpoint distances
Reza Modarres
- 01 Mar 2018
TL;DR: The properties of the squared Euclidean interpoint distances (IDs) drawn from multinomial distributions are explored and applications of IDs for testing goodness of fit, the equality of high dimensional mult inomial distributions, classification and outliers detection are discussed.
9
Multivariate Poisson interpoint distances
TL;DR: In this article, the properties of the squared interpoint distances (IDs) in samples taken from multivariate Poisson distributions were studied and the means and covariances of the average IDs were derived.
6
Modeling a minimal spanning tree
Haigang Liu,Reza Modarres +1 more
TL;DR: A multivariate Gini index is defined to measure the scatter of a data cloud based on the normalized ordered IDs of the MST, using a multivariate normal copula with beta marginals and a Dirichlet distribution to obtain beta vectors.
3
References
•Book
Applied Spatial Statistics for Public Health Data
Lance A. Waller,Carol A. Gotway +1 more
- 15 Jul 2004
TL;DR: In this paper, the authors present a method for estimating risk and risk of cancer in public health data using statistical methods for spatial data in the context of geographic information systems (GISs).
The interpoint distance distribution as a descriptor of point patterns, with an application to spatial disease clustering
Marco Bonetti,Marcello Pagano +1 more
TL;DR: Its use in the detection of spatial clustering by application to a well‐known leukaemia data set is illustrated, and the results of a simulation experiment designed to study the power characteristics of the methods within that study region and in an artificial homogenous setting are reported.
68
Simple thematic mapping
TL;DR: The tmap package is presented, a set of Stata programs designed to draw five kinds of thematic maps: choropleth, proportional symbol, deviation, dot, and label maps, intended to depict area data.
The choice of the number of bins for the M statistic
TL;DR: The relationship of M to Pearson's Chi Square statistic, xn2, is shown and it is shown that spatial data provides a unique insight into the problem through examples with simulated data and spatial data from a health care provider and results indicate that the number of bins does not appear to vary with m, thenumber of spatial locations.
12