A Primer on Visualizations for Comparing Populations,Including the Issue of Overlapping Confidence Intervals |
| |
Authors: | Tommy Wright Martin Klein Jerzy Wieczorek |
| |
Affiliation: | 1. Center for Statistical Research and Methodology, U.S. Bureau of Census, Washington, DC;2. Mathematics and Statistics Department, Georgetown University, Washington, DCtommy.wright@census.gov;4. Department of Mathematics and Statistics, University of Maryland Baltimore County, Baltimore, MD;5. Department of Mathematics and Statistics, Colby College, Waterville, ME |
| |
Abstract: | In comparing a collection of K populations, it is common practice to display in one visualization confidence intervals for the corresponding population parameters θ1, θ2, …, θK. For a pair of confidence intervals that do (or do not) overlap, viewers of the visualization are cognitively compelled to declare that there is not (or there is) a statistically significant difference between the two corresponding population parameters. It is generally well known that the method of examining overlap of pairs of confidence intervals should not be used for formal hypothesis testing. However, use of a single visualization with overlapping and nonoverlapping confidence intervals leads many to draw such conclusions, despite the best efforts of statisticians toward preventing users from reaching such conclusions. In this article, we summarize some alternative visualizations from the literature that can be used to properly test equality between a pair of population parameters. We recommend that these visualizations be used with caution to avoid incorrect statistical inference. The methods presented require only that we have K sample estimates and their associated standard errors. We also assume that the sample estimators are independent, unbiased, and normally distributed. |
| |
Keywords: | Confidence intervals Overlap Two-sample test Visualizations |
|
|