Null distribution

Topic Tools

Papers published on a yearly basis

1 / 2

Papers

Journal Article•10.1016/0304-4076(92)90104-Y•

Testing the null hypothesis of stationarity against the alternative of a unit root: How sure are we that economic time series have a unit root?

[...]

Denis Kwiatkowski¹, Peter C.B. Phillips², Peter Schmidt³, Yongcheol Shin³•Institutions (3)

Central Michigan University¹, Yale University², Michigan State University³

01 Oct 1992-Journal of Econometrics

TL;DR: In this paper, a test of the null hypothesis that an observable series is stationary around a deterministic trend is proposed, where the series is expressed as the sum of deterministic trends, random walks, and stationary error.

...read moreread less

12,931 citations

Journal Article•10.1080/01621459.1988.10478722•

A Test of Missing Completely at Random for Multivariate Data with Missing Values

[...]

Roderick J. A. Little¹•Institutions (1)

University of California, Los Angeles¹

01 Jan 1988-Journal of the American Statistical Association

TL;DR: In this article, the authors proposed a global test statistic for multivariate data with missing values, that is, whether the missing data are missing completely at random (MCAR), that is whether missingness depends on the variables in the data set.

...read moreread less

Abstract: A common concern when faced with multivariate data with missing values is whether the missing data are missing completely at random (MCAR); that is, whether missingness depends on the variables in the data set. One way of assessing this is to compare the means of recorded values of each variable between groups defined by whether other variables in the data set are missing or not. Although informative, this procedure yields potentially many correlated statistics for testing MCAR, resulting in multiple-comparison problems. This article proposes a single global test statistic for MCAR that uses all of the available data. The asymptotic null distribution is given, and the small-sample null distribution is derived for multivariate normal data with a monotone pattern of missing data. The test reduces to a standard t test when the data are bivariate with missing data confined to a single variable. A limited simulation study of empirical sizes for the test applied to normal and nonnormal data suggests th...

...read moreread less

7,613 citations

Journal Article•10.1111/1467-9868.00293•

Estimating the number of clusters in a data set via the gap statistic

[...]

Robert Tibshirani¹, Guenther Walther¹, Trevor Hastie¹•Institutions (1)

Stanford University¹

01 Jan 2001-Journal of The Royal Statistical Society Series B-statistical Methodology

TL;DR: In this paper, the authors proposed a method called the "gap statistic" for estimating the number of clusters (groups) in a set of data, which uses the output of any clustering algorithm (e.g. K-means or hierarchical), comparing the change in within-cluster dispersion with that expected under an appropriate reference null distribution.

...read moreread less

Abstract: We propose a method (the ‘gap statistic’) for estimating the number of clusters (groups) in a set of data. The technique uses the output of any clustering algorithm (e.g. K-means or hierarchical), comparing the change in within-cluster dispersion with that expected under an appropriate reference null distribution. Some theory is developed for the proposal and a simulation study shows that the gap statistic usually outperforms other methods that have been proposed in the literature.

...read moreread less

6,064 citations

Journal Article•10.1016/S0304-4076(98)00023-2•

Spurious regression and residual-based tests for cointegration in panel data

[...]

Chihwa Kao¹•Institutions (1)

Syracuse University¹

01 May 1999-Journal of Econometrics

TL;DR: In this paper, the null distribution of residual-based cointegration tests depends on the asymptotics of the least-squares dummy variable (LSDV) estimator and other conventional statistics.

...read moreread less

5,308 citations

Journal Article•10.1038/NPRE.2010.4282.1•

Differential expression analysis for sequence count data

[...]

Simon Anders¹, Wolfgang Huber¹•Institutions (1)

European Bioinformatics Institute¹

15 Mar 2010-Nature Precedings

TL;DR: An error model that uses the negative binomial distribution, with variance and mean linked by local regression, to model the null distribution of the count data is proposed and provides good detection power.

...read moreread less

Abstract: Motivation: High throughput nucleotide sequencing provides quantitative readouts in assays for RNA expression (RNA-Seq), protein-DNA binding (ChIP-Seq), cell counting. Statistical inference of differential signal in these data needs to take into account their natural variability throughout the dynamic range. When the number of replicates is small, error modeling is needed to achieve statistical power. Results: We propose an error model that uses the negative binomial distribution, with variance and mean linked by local regression, to model the null distribution of the count data. The method controls type-I error and provides good detection power.Availability: A free open-source R/Biondonductor software package, called "DESeq", is available from "http://www-huber.embl.de/users/anders/DESeq":http://www-huber.embl.de/users/anders/DESeq

...read moreread less

2,182 citations

...

Expand

Year	Papers
2025	4
2024	5
2023	49
2022	76
2021	94
2020	91

Topic Tools

Papers published on a yearly basis

Papers

Testing the null hypothesis of stationarity against the alternative of a unit root: How sure are we that economic time series have a unit root?

A Test of Missing Completely at Random for Multivariate Data with Missing Values

Estimating the number of clusters in a data set via the gap statistic

Spurious regression and residual-based tests for cointegration in panel data

Differential expression analysis for sequence count data

Related Topics (5)

Performance Metrics