Sequential Correct Screening and Post-Screening Inference

stat.ME arXiv:2508.14596
View PDF arXiv JSON

Abstract

Selecting the top-$m$ variables with the $m$ largest population parameters from a larger set of candidates is a fundamental problem in statistics. In this paper, we propose a novel methodology called Sequential Correct Screening (SCS), which sequentially screens out variables that are not among the top-$m$. A key feature of our method is its anytime validity; it provides a sequence of variable subsets that, with high probability, always contain the true top-$m$ variables. Furthermore, we develop a post-screening inference (PSI) procedure to construct confidence intervals for the selected parameters. Importantly, this procedure is designed to control the false coverage rate (FCR) whenever it is conducted -- an aspect that has been largely overlooked in the existing literature. We establish theoretical guarantees for both SCS and PSI, and demonstrate their performance through simulation studies and an application to a real-world dataset on suicide rates.

PDF Viewer