Post hoc false positive control for structured hypotheses |
| |
Authors: | Guillermo Durand Gilles Blanchard Pierre Neuvial Etienne Roquain |
| |
Institution: | 1. Laboratoire de probabilités Statistique et Modélisation, LPSM, Sorbonne Université, France;2. Laboratoire de Mathématiques d'Orsay, Université Paris-Sud, CNRS, Université Paris-Saclay, France;3. Institut de Mathématiques de Toulouse, UMR 5219, Université de Toulouse, CNRS, UPS IMT, France |
| |
Abstract: | In a high-dimensional multiple testing framework, we present new confidence bounds on the false positives contained in subsets S of selected null hypotheses. These bounds are post hoc in the sense that the coverage probability holds simultaneously over all S, possibly chosen depending on the data. This article focuses on the common case of structured null hypotheses, for example, along a tree, a hierarchy, or geometrically (spatially or temporally). Following recent advances in post hoc inference, we build confidence bounds for some prespecified forest-structured subsets and deduce a bound for any subset S by interpolation. The proposed bounds are shown to improve substantially previous ones when the signal is locally structured. Our findings are supported both by theoretical results and numerical experiments. Moreover, our bounds can be obtained by an algorithm (with complexity bilinear in the sizes of the reference hierarchy and of the selected subset) that is implemented in the open-source R package sansSouci available from https://github.com/pneuvial/sanssouci , making our approach operational. |
| |
Keywords: | Dvoretzky Kiefer Wolfowitz inequality forest structure multiple testing post hoc bound selective inference |
|
|