Standard deviation

Topic Tools

Papers published on a yearly basis

1 / 3

Papers

Journal Article•10.1162/003355304772839588•

How Much Should We Trust Differences-In-Differences Estimates?

[...]

Marianne Bertrand¹, Esther Duflo², Sendhil Mullainathan²•Institutions (2)

University of Chicago¹, National Bureau of Economic Research²

01 Feb 2004-Quarterly Journal of Economics

TL;DR: In this article, the authors randomly generate placebo laws in state-level data on female wages from the Current Population Survey and use OLS to compute the DD estimate of its "effect" as well as the standard error of this estimate.

...read moreread less

Abstract: Most papers that employ Differences-in-Differences estimation (DD) use many years of data and focus on serially correlated outcomes but ignore that the resulting standard errors are inconsistent. To illustrate the severity of this issue, we randomly generate placebo laws in state-level data on female wages from the Current Population Survey. For each law, we use OLS to compute the DD estimate of its “effect” as well as the standard error of this estimate. These conventional DD standard errors severely understate the standard deviation of the estimators: we find an “effect” significant at the 5 percent level for up to 45 percent of the placebo interventions. We use Monte Carlo simulations to investigate how well existing methods help solve this problem. Econometric corrections that place a specific parametric form on the time-series process do not perform well. Bootstrap (taking into account the autocorrelation of the data) works well when the number of states is large enough. Two corrections based on asymptotic approximation of the variance-covariance matrix work well for moderate numbers of states and one correction that collapses the time series information into a “pre”- and “post”-period and explicitly takes into account the effective sample size works well even for small numbers of states.

...read moreread less

11,946 citations

Journal Article•10.1177/096228029900800204•

Measuring agreement in method comparison studies

[...]

J M Bland¹, Douglas G. Altman test²•Institutions (2)

St George's Hospital¹, University of Oxford²

01 Apr 1999-Statistical Methods in Medical Research

TL;DR: The 95% limits of agreement, estimated by mean difference 1.96 standard deviation of the differences, provide an interval within which 95% of differences between measurements by the two methods are expected to lie.

...read moreread less

Abstract: Agreement between two methods of clinical measurement can be quantified using the differences between observations made using the two methods on the same subjects. The 95% limits of agreement, estimated by mean difference +/- 1.96 standard deviation of the differences, provide an interval within which 95% of differences between measurements by the two methods are expected to lie. We describe how graphical methods can be used to investigate the assumptions of the method and we also give confidence intervals. We extend the basic approach to data where there is a relationship between difference and magnitude, both with a simple logarithmic transformation approach and a new, more general, regression approach. We discuss the importance of the repeatability of each method separately and compare an estimate of this to the limits of agreement. We extend the limits of agreement approach to data with repeated measurements, proposing new estimates for equal numbers of replicates by each method on each subject, for unequal numbers of replicates, and for replicated data collected in pairs, where the underlying value of the quantity being measured is changing. Finally, we describe a nonparametric approach to comparing methods.

...read moreread less

8,970 citations

Journal Article•10.1186/1471-2288-5-13•

Estimating the mean and variance from the median, range, and the size of a sample.

[...]

Stela Pudar Hozo¹, Benjamin Djulbegovic², Iztok Hozo¹•Institutions (2)

Indiana University¹, University of South Florida²

20 Apr 2005-BMC Medical Research Methodology

TL;DR: Two simple formulas are found that estimate the mean using the values of the median, low and high end of the range, and n (the sample size) and these hope to help meta-analysts use clinical trials in their analysis even when not all of the information is available and/or reported.

...read moreread less

Abstract: Usually the researchers performing meta-analysis of continuous outcomes from clinical trials need their mean value and the variance (or standard deviation) in order to pool data. However, sometimes the published reports of clinical trials only report the median, range and the size of the trial. In this article we use simple and elementary inequalities and approximations in order to estimate the mean and the variance for such trials. Our estimation is distribution-free, i.e., it makes no assumption on the distribution of the underlying data. We found two simple formulas that estimate the mean using the values of the median (m), low and high end of the range (a and b, respectively), and n (the sample size). Using simulations, we show that median can be used to estimate mean when the sample size is larger than 25. For smaller samples our new formula, devised in this paper, should be used. We also estimated the variance of an unknown sample using the median, low and high end of the range, and the sample size. Our estimate is performing as the best estimate in our simulations for very small samples (n ≤ 15). For moderately sized samples (15 70), the formula range/6 gives the best estimator for the standard deviation (variance). We also include an illustrative example of the potential value of our method using reports from the Cochrane review on the role of erythropoietin in anemia due to malignancy. Using these formulas, we hope to help meta-analysts use clinical trials in their analysis even when not all of the information is available and/or reported.

...read moreread less

8,602 citations

Journal Article•10.1186/1471-2288-14-135•

Estimating the sample mean and standard deviation from the sample size, median, range and/or interquartile range

[...]

Xiang Wan¹, Wenqian Wang², Jiming Liu¹, Tiejun Tong¹•Institutions (2)

Hong Kong Baptist University¹, Northwestern University²

19 Dec 2014-BMC Medical Research Methodology

TL;DR: In this article, the authors proposed a new estimation method by incorporating the sample size and compared the estimators of the sample mean and standard deviation under all three scenarios and presented some suggestions on which scenario is preferred in real-world applications.

...read moreread less

Abstract: In systematic reviews and meta-analysis, researchers often pool the results of the sample mean and standard deviation from a set of similar clinical trials. A number of the trials, however, reported the study using the median, the minimum and maximum values, and/or the first and third quartiles. Hence, in order to combine results, one may have to estimate the sample mean and standard deviation for such trials. In this paper, we propose to improve the existing literature in several directions. First, we show that the sample standard deviation estimation in Hozo et al.’s method (BMC Med Res Methodol 5:13, 2005) has some serious limitations and is always less satisfactory in practice. Inspired by this, we propose a new estimation method by incorporating the sample size. Second, we systematically study the sample mean and standard deviation estimation problem under several other interesting settings where the interquartile range is also available for the trials. We demonstrate the performance of the proposed methods through simulation studies for the three frequently encountered scenarios, respectively. For the first two scenarios, our method greatly improves existing methods and provides a nearly unbiased estimate of the true sample standard deviation for normal data and a slightly biased estimate for skewed data. For the third scenario, our method still performs very well for both normal data and skewed data. Furthermore, we compare the estimators of the sample mean and standard deviation under all three scenarios and present some suggestions on which scenario is preferred in real-world applications. In this paper, we discuss different approximation methods in the estimation of the sample mean and standard deviation and propose some new estimation methods to improve the existing literature. We conclude our work with a summary table (an Excel spread sheet including all formulas) that serves as a comprehensive guidance for performing meta-analysis in different situations.

...read moreread less

8,014 citations

Journal Article•10.1561/104.00000024•

The Cross-section of Expected Stock Returns

[...]

Jonathan Lewellen¹•Institutions (1)

National Bureau of Economic Research¹

28 Jun 2015

TL;DR: In this paper, the cross-sectional properties of return forecasts derived from Fama-MacBeth regressions were studied, and the authors found that the forecasts vary substantially across stocks and have strong predictive power for actual returns.

...read moreread less

Abstract: This paper studies the cross-sectional properties of return forecasts derived from Fama-MacBeth regressions. These forecasts mimic how an investor could, in real time, combine many firm characteristics to obtain a composite estimate of a stockâs expected return. Empirically, the forecasts vary substantially across stocks and have strong predictive power for actual returns. For example, using ten-year rolling estimates of Fama- MacBeth slopes and a cross-sectional model with 15 firm characteristics (all based on low-frequency data), the expected-return estimates have a cross-sectional standard deviation of 0.87% monthly and a predictive slope for future monthly returns of 0.74, with a standard error of 0.07.

...read moreread less

5,274 citations

...

Expand

Year	Papers
2026	3
2025	224
2024	486
2023	1,268
2022	2,078
2021	600

Topic Tools

Papers published on a yearly basis

Papers

How Much Should We Trust Differences-In-Differences Estimates?

Measuring agreement in method comparison studies

Estimating the mean and variance from the median, range, and the size of a sample.

Estimating the sample mean and standard deviation from the sample size, median, range and/or interquartile range

The Cross-section of Expected Stock Returns

Related Topics (5)

Performance Metrics