On the Distribution of the Two-Sample Cramer-von Mises Criterion
TL;DR: The Cramer-von Mises criterion for testing whether a sample is drawn from a specified continuous distribution was introduced in this paper. But it is not known whether the criterion can be applied to the case of two samples.
read more
Abstract: The Cramer-von Mises $\omega^2$ criterion for testing that a sample, $x_1, \cdots, x_N$, has been drawn from a specified continuous distribution $F(x)$ is \begin{equation*}\tag{1}\omega^2 = \int^\infty_{-\infty} \lbrack F_N(x) - F(x)\rbrack^2 dF(x),\end{equation*} where $F_N(x)$ is the empirical distribution function of the sample; that is, $F_N(x) = k/N$ if exactly $k$ observations are less than or equal to $x(k = 0, 1, \cdots, N)$. If there is a second sample, $y_1, \cdots, y_M$, a test of the hypothesis that the two samples come from the same (unspecified) continuous distribution can be based on the analogue of $N\omega^2$, namely \begin{equation*}\tag{2} T = \lbrack NM/(N + M)\rbrack \int^\infty_{-\infty} \lbrack F_N(x) - G_M(x)\rbrack^2 dH_{N+M}(x),\end{equation*} where $G_M(x)$ is the empirical distribution function of the second sample and $H_{N+M}(x)$ is the empirical distribution function of the two samples together [that is, $(N + M)H_{N+M}(x) = NF_N(x) + MG_M(x)\rbrack$. The limiting distribution of $N\omega^2$ as $N \rightarrow \infty$ has been tabulated [2], and it has been shown ([3], [4a], and [7]) that $T$ has the same limiting distribution as $N \rightarrow \infty, M \rightarrow \infty$, and $N/M \rightarrow \lambda$, where $\lambda$ is any finite positive constant. In this note we consider the distribution of $T$ for small values of $N$ and $M$ and present tables to permit use of the criterion at some conventional significance levels for small values of $N$ and $M$. The limiting distribution seems a surprisingly good approximation to the exact distribution for moderate sample sizes (corresponding to the same feature for $N\omega^2$ [6]). The accuracy of approximation is better than in the case of the two-sample Kolmogorov-Smirnov statistic studied by Hodges [4].
read more
Chat with Paper
AI Agents for this Paper
Find similar papers on Google Scholar, PubMed and Arxiv
Write a critical review of this paper
Analyze citations of this paper to find unaddressed research gaps
Citations
•Dissertation
Traitements avancés pour l’augmentation de la disponibilité et de l’intégrité de la mesure de vitesse 3D par LiDAR, dans le domaine aéronautique.
Gregory Baral-Baron
- 16 Jul 2014
TL;DR: In this article, Thales mene des travaux sur le developpement d'un anemometre laser Doppler embarque sur aeronef, compose de quatre axes LiDAR (Light Detection And Ranging) repartis autour de lavion, permet d'estimer la vitesse air par l'analyse de la reflexion de l’onde laser emise sur les particules presentes dans l'air.
•Dissertation
Image Analysis Applications of the Maximum Mean Discrepancy Distance Measure
Michael Diu
- 23 May 2013
TL;DR: It is proposed that dissimilarity-based classification and changepoint detection using MMD can lead to enhanced separation between different populations, and improvements over the difference of means, measured primarily using precision/recall for scene change detection, and k-nearest neighbour classification accuracy for tumor response assessment, are obtained.
5
Constraining ozone-precursor responsiveness using ambient measurements
TL;DR: In this paper, uncertainties in model formulations and input parameters are jointly considered to identify factors that strongly influence ozone (O3) concentrations and sensitivities in the Dallas-Fort Worth region in Texas.
•Posted Content
Two Sample Testing in High Dimension via Maximum Mean Discrepancy
Hanjia Gao,Xiaofeng Shao +1 more
TL;DR: In this paper, the authors investigate the behavior of the sample MMD in a high-dimensional environment and develop a new studentized test statistic, which can detect difference between two distributions in the moderately high dimensional regime.
5
Forward cycle time distributions for returnable transport items
Barry R. Cobb,Linda Li +1 more
- 17 Sep 2021
TL;DR: In this article, the authors used an adaptive exponential smoothing method that accounts for seasonality to forecast the parameters of a lognormal distribution for FCT in future periods, which is used to calculate discrete FCT probabilities, estimate container returns, and calculate the estimated mean and standard deviation of cycle time for periods with incomplete data.
5