Type I and type II errors

Topic Tools

Papers published on a yearly basis

1 / 2

Papers

Journal Article•10.1037/1082-989X.7.1.83•

A comparison of methods to test mediation and other intervening variable effects.

[...]

David P. MacKinnon¹, Chondra M. Lockwood¹, Jeanne M. Hoffman¹, Stephen G. West¹, Virgil L. Sheets¹ - Show less +1 more•Institutions (1)

Arizona State University¹

01 Mar 2002-Psychological Methods

TL;DR: A Monte Carlo study compared 14 methods to test the statistical significance of the intervening variable effect and found two methods based on the distribution of the product and 2 difference-in-coefficients methods have the most accurate Type I error rates and greatest statistical power.

...read moreread less

Abstract: A Monte Carlo study compared 14 methods to test the statistical significance of the intervening variable effect. An intervening variable (mediator) transmits the effect of an independent variable to a dependent variable. The commonly used R. M. Baron and D. A. Kenny (1986) approach has low statistical power. Two methods based on the distribution of the product and 2 difference-in-coefficients methods have the most accurate Type I error rates and greatest statistical power except in 1 important case in which Type I error rates are too high. The best balance of Type I error and statistical power across all cases is the test of the joint significance of the two effects comprising the intervening variable effect.

...read moreread less

9,453 citations

Journal Article•10.1111/J.2041-210X.2009.00001.X•

A protocol for data exploration to avoid common statistical problems

[...]

Alain F. Zuur¹, Elena N. Ieno¹, Chris S. Elphick²•Institutions (2)

University of Aberdeen¹, University of Connecticut²

01 Mar 2010-Methods in Ecology and Evolution

TL;DR: A protocol for data exploration is provided; current tools to detect outliers, heterogeneity of variance, collinearity, dependence of observations, problems with interactions, double zeros in multivariate analysis, zero inflation in generalized linear modelling, and the correct type of relationships between dependent and independent variables are discussed; and advice on how to address these problems when they arise is provided.

...read moreread less

Abstract: Summary 1. While teaching statistics to ecologists, the lead authors of this paper have noticed common statistical problems. If a random sample of their work (including scientific papers) produced before doing these courses were selected, half would probably contain violations of the underlying assumptions of the statistical techniques employed. 2. Some violations have little impact on the results or ecological conclusions; yet others increase type I or type II errors, potentially resulting in wrong ecological conclusions. Most of these violations can be avoided by applying better data exploration. These problems are especially troublesome in applied ecology, where management and policy decisions are often at stake. 3. Here, we provide a protocol for data exploration; discuss current tools to detect outliers, heterogeneity of variance, collinearity, dependence of observations, problems with interactions, double zeros in multivariate analysis, zero inflation in generalized linear modelling, and the correct type of relationships between dependent and independent variables; and provide advice on how to address these problems when they arise. We also address misconceptions about normality, and provide advice on data transformations. 4. Data exploration avoids type I and type II errors, among other problems, thereby reducing the chance of making wrong ecological conclusions and poor recommendations. It is therefore essential for good quality management and policy based on statistical analyses.

...read moreread less

7,417 citations

Journal Article•10.1111/1467-9868.00346•

A direct approach to false discovery rates

[...]

John D. Storey¹•Institutions (1)

Stanford University¹

01 Aug 2002-Journal of The Royal Statistical Society Series B-statistical Methodology

TL;DR: The calculation of the q‐value is discussed, the pFDR analogue of the p‐value, which eliminates the need to set the error rate beforehand as is traditionally done, and can yield an increase of over eight times in power compared with the Benjamini–Hochberg FDR method.

...read moreread less

Abstract: Summary. Multiple-hypothesis testing involves guarding against much more complicated errors than single-hypothesis testing. Whereas we typically control the type I error rate for a single-hypothesis test, a compound error rate is controlled for multiple-hypothesis tests. For example, controlling the false discovery rate FDR traditionally involves intricate sequential p-value rejection methods based on the observed data. Whereas a sequential p-value method fixes the error rate and estimates its corresponding rejection region, we propose the opposite approach—we fix the rejection region and then estimate its corresponding error rate. This new approach offers increased applicability, accuracy and power. We apply the methodology to both the positive false discovery rate pFDR and FDR, and provide evidence for its benefits. It is shown that pFDR is probably the quantity of interest over FDR. Also discussed is the calculation of the q-value, the pFDR analogue of the p-value, which eliminates the need to set the error rate beforehand as is traditionally done. Some simple numerical examples are presented that show that this new approach can yield an increase of over eight times in power compared with the Benjamini–Hochberg FDR method.

...read moreread less

6,168 citations

Journal Article•10.1136/BMJ.316.7139.1236•

What's wrong with Bonferroni adjustments

[...]

Thomas V. Perneger¹•Institutions (1)

University of Geneva¹

18 Apr 1998-BMJ

TL;DR: This paper advances the view, widely held by epidemiologists, that Bonferroni adjustments are, at best, unnecessary and, at worst, deleterious to sound statistical inference.

...read moreread less

Abstract: When more than one statistical test is performed in analysing the data from a clinical study, some statisticians and journal editors demand that a more stringent criterion be used for “statistical significance” than the conventional P<0051 Many well meaning researchers, eager for methodological rigour, comply without fully grasping what is at stake Recently, adjustments for multiple tests (or Bonferroni adjustments) have found their way into introductory texts on medical statistics, which has increased their apparent legitimacy This paper advances the view, widely held by epidemiologists, that Bonferroni adjustments are, at best, unnecessary and, at worst, deleterious to sound statistical inference #### Summary points Adjusting statistical significance for the number of tests that have been performed on study data—the Bonferroni method—creates more problems than it solves The Bonferroni method is concerned with the general null hypothesis (that all null hypotheses are true simultaneously), which is rarely of interest or use to researchers The main weakness is that the interpretation of a finding depends on the number of other tests performed The likelihood of type II errors is also increased, so that truly important differences are deemed non-significant Simply describing what tests of significance have been performed, and why, is generally the best way of dealing with multiple comparisons Bonferroni adjustments are based on the following reasoning1-3 If a null hypothesis is true (for instance, two treatment groups in a randomised trial do not differ in terms of cure rates), a significant difference (P<005) will be observed by chance once in 20 trials This is the type I error, or α When 20 independent tests are performed (for example, study groups are compared with regard to 20 unrelated variables) and the null hypothesis holds for all 20 comparisons, the chance of at least one test being significant is no longer 005, but 064 …

...read moreread less

5,944 citations

Journal Article•10.1097/00001648-199001000-00010•

No adjustments are needed for multiple comparisons.

[...]

Kenneth J. Rothman

01 Jan 1990-Epidemiology

TL;DR: A policy of not making adjustments for multiple comparisons is preferable because it will lead to fewer errors of interpretation when the data under evaluation are not random numbers but actual observations on nature.

...read moreread less

Abstract: Adjustments for making multiple comparisons in large bodies of data are recommended to avoid rejecting the null hypothesis too readily. Unfortunately, reducing the type I error for null associations increases the type II error for those associations that are not null. The theoretical basis for advocating a routine adjustment for multiple comparisons is the "universal null hypothesis" that "chance" serves as the first-order explanation for observed phenomena. This hypothesis undermines the basic premises of empirical research, which holds that nature follows regular laws that may be studied through observations. A policy of not making adjustments for multiple comparisons is preferable because it will lead to fewer errors of interpretation when the data under evaluation are not random numbers but actual observations on nature. Furthermore, scientists should not be so reluctant to explore leads that may turn out to be wrong that they penalize themselves by missing possibly important findings.

...read moreread less

5,590 citations

...

Expand

Year	Papers
2026	1
2025	54
2024	87
2023	208
2022	331
2021	234

Topic Tools

Papers published on a yearly basis

Papers

A comparison of methods to test mediation and other intervening variable effects.

A protocol for data exploration to avoid common statistical problems

A direct approach to false discovery rates

What's wrong with Bonferroni adjustments

No adjustments are needed for multiple comparisons.

Related Topics (5)

Performance Metrics