Anytime-valid t-tests and confidence sequences for Gaussian means with unknown variance

Read original: arXiv:2310.03722 - Published 5/15/2024 by Hongjian Wang, Aaditya Ramdas

📉

Overview

In 1976, Lai constructed a method to estimate the mean of a Gaussian distribution with unknown variance.
Lai used a mixture of improper distributions, including an improper (right Haar) mixture over the variance and an improper (flat) mixture over the mean.
This method uses generalized nonintegrable martingales and an extended Ville's inequality, resulting in a sequential t-test.
However, this approach does not yield an e-process (due to the nonintegrability of the martingale).
This paper develops two new e-processes and confidence sequences for the same setting:
1. A test martingale in a reduced filtration
2. An e-process in the canonical data filtration

Plain English Explanation

In this paper, the researchers are looking at a problem of estimating the mean of a Gaussian (normal) distribution when the variance is unknown. This is a common situation in statistics and data analysis.

Back in 1976, a researcher named Lai came up with a method to solve this problem. Lai's approach involved using a mixture of improper distributions - that is, distributions that don't have a well-defined total area under the curve. Specifically, he used an improper "right Haar" mixture for the variance and an improper "flat" mixture for the mean.

This method Lai developed uses some advanced mathematical techniques, including generalized nonintegrable martingales and an extended version of Ville's inequality. The end result is a sequential t-test, which allows you to keep making measurements and updating your estimate of the mean over time.

However, Lai's approach has a downside - it doesn't produce what's called an "e-process", which is a useful mathematical tool for sequential inference.

In this new paper, the researchers develop two alternative approaches that do produce valid e-processes for the same problem. One approach uses a Gaussian mixture instead of Lai's improper mixtures, and the other uses the maximum likelihood estimate of the variance instead of the right Haar mixture.

The researchers also analyze the properties of the confidence sequences (the ranges of values that likely contain the true mean) produced by these new methods. They find some interesting mathematical properties, including a polynomial dependence on the error probability that is better than the classical fixed-sample t-test.

Throughout the paper, the researchers provide numerical experiments to compare the different approaches and provide insights into their strengths and weaknesses.

Technical Explanation

The paper starts by describing Lai's 1976 construction of a nontrivial confidence sequence for the mean $\mu$ of a Gaussian distribution with unknown variance $\sigma^2$. Lai used both an improper (right Haar) mixture over $\sigma$ and an improper (flat) mixture over $\mu$.

The authors then elaborate on the details of Lai's construction, which involves the use of generalized nonintegrable martingales and an extended Ville's inequality. This approach yields a sequential t-test, but does not produce an e-process due to the nonintegrability of Lai's martingale.

To address this, the authors develop two new e-processes and confidence sequences for the same setting:

A test martingale in a reduced filtration, obtained by swapping Lai's flat mixture for a Gaussian mixture.
An e-process in the canonical data filtration, obtained by swapping the right Haar mixture over $\sigma$ with the maximum likelihood estimate under the null, as done in universal inference.

The authors then analyze the width of the resulting confidence sequences, which they find to have a curious polynomial dependence on the error probability $\alpha$. They prove that this polynomial dependence is not only unavoidable, but (for universal inference) even better than the classical fixed-sample t-test.

Throughout the paper, the authors provide numerical experiments to compare and contrast the various approaches, including some recent suboptimal ones.

Critical Analysis

The paper presents a technical and rigorous analysis of the problem of constructing confidence sequences for the mean of a Gaussian distribution with unknown variance. The authors build upon Lai's previous work and develop two new methods that address the limitations of Lai's approach.

One potential concern is the reliance on advanced mathematical concepts such as generalized nonintegrable martingales and extended Ville's inequality. While the authors provide a detailed technical explanation, these concepts may be challenging for a general audience to fully grasp. The authors could have provided more intuitive explanations or analogies to help readers better understand the underlying ideas.

Additionally, the authors note that their methods do not yield an e-process in Lai's construction, which is a useful tool for sequential inference. It would be interesting to see if the authors can further refine their methods to produce a valid e-process while maintaining the desirable properties of their confidence sequences.

The paper's focus on the mathematical properties of the confidence sequences, such as the polynomial dependence on the error probability, is intriguing. However, the practical implications of these findings may not be immediately clear to a general audience. The authors could have discussed potential applications or real-world scenarios where these theoretical results may be particularly useful.

Overall, the paper presents a solid technical contribution to the field of sequential inference, but could benefit from additional efforts to make the concepts more accessible and to highlight the potential practical applications of the research.

Conclusion

This paper builds upon previous work by Lai to develop new methods for constructing confidence sequences for the mean of a Gaussian distribution with unknown variance. The authors introduce two new e-processes and confidence sequences that address the limitations of Lai's approach, which relied on improper distributions and nonintegrable martingales.

The key contributions of this paper are the development of these new techniques, as well as the analysis of the mathematical properties of the resulting confidence sequences. The authors demonstrate that their methods have a curious polynomial dependence on the error probability that is not only unavoidable, but in some cases even better than the classical fixed-sample t-test.

While the technical details of the paper may be challenging for a general audience, the underlying problem and the authors' efforts to improve upon previous work are important contributions to the field of sequential inference. Further research may explore ways to make the concepts more accessible and to identify real-world applications where these theoretical results can be leveraged.

This summary was produced with help from an AI and may contain inaccuracies - check out the links to read the original source documents!

Follow @aimodelsfyi on 𝕏 →

Related Papers

📉

Anytime-valid t-tests and confidence sequences for Gaussian means with unknown variance

Hongjian Wang, Aaditya Ramdas

In 1976, Lai constructed a nontrivial confidence sequence for the mean $mu$ of a Gaussian distribution with unknown variance $sigma^2$. Curiously, he employed both an improper (right Haar) mixture over $sigma$ and an improper (flat) mixture over $mu$. Here, we elaborate carefully on the details of his construction, which use generalized nonintegrable martingales and an extended Ville's inequality. While this does yield a sequential t-test, it does not yield an e-process (due to the nonintegrability of his martingale). In this paper, we develop two new e-processes and confidence sequences for the same setting: one is a test martingale in a reduced filtration, while the other is an e-process in the canonical data filtration. These are respectively obtained by swapping Lai's flat mixture for a Gaussian mixture, and swapping the right Haar mixture over $sigma$ with the maximum likelihood estimate under the null, as done in universal inference. We also analyze the width of resulting confidence sequences, which have a curious polynomial dependence on the error probability $alpha$ that we prove to be not only unavoidable, but (for universal inference) even better than the classical fixed-sample t-test. Numerical experiments are provided along the way to compare and contrast the various approaches, including some recent suboptimal ones.

5/15/2024

🎯

Nonparametric extensions of randomized response for private confidence sets

Ian Waudby-Smith, Zhiwei Steven Wu, Aaditya Ramdas

This work derives methods for performing nonparametric, nonasymptotic statistical inference for population means under the constraint of local differential privacy (LDP). Given bounded observations $(X_1, dots, X_n)$ with mean $mu^star$ that are privatized into $(Z_1, dots, Z_n)$, we present confidence intervals (CI) and time-uniform confidence sequences (CS) for $mu^star$ when only given access to the privatized data. To achieve this, we study a nonparametric and sequentially interactive generalization of Warner's famous ``randomized response'' mechanism, satisfying LDP for arbitrary bounded random variables, and then provide CIs and CSs for their means given access to the resulting privatized observations. For example, our results yield private analogues of Hoeffding's inequality in both fixed-time and time-uniform regimes. We extend these Hoeffding-type CSs to capture time-varying (non-stationary) means, and conclude by illustrating how these methods can be used to conduct private online A/B tests.

7/26/2024

🤯

Simultaneous inference for generalized linear models with unmeasured confounders

Jin-Hong Du, Larry Wasserman, Kathryn Roeder

Tens of thousands of simultaneous hypothesis tests are routinely performed in genomic studies to identify differentially expressed genes. However, due to unmeasured confounders, many standard statistical approaches may be substantially biased. This paper investigates the large-scale hypothesis testing problem for multivariate generalized linear models in the presence of confounding effects. Under arbitrary confounding mechanisms, we propose a unified statistical estimation and inference framework that harnesses orthogonal structures and integrates linear projections into three key stages. It begins by disentangling marginal and uncorrelated confounding effects to recover the latent coefficients. Subsequently, latent factors and primary effects are jointly estimated through lasso-type optimization. Finally, we incorporate projected and weighted bias-correction steps for hypothesis testing. Theoretically, we establish the identification conditions of various effects and non-asymptotic error bounds. We show effective Type-I error control of asymptotic $z$-tests as sample and response sizes approach infinity. Numerical experiments demonstrate that the proposed method controls the false discovery rate by the Benjamini-Hochberg procedure and is more powerful than alternative methods. By comparing single-cell RNA-seq counts from two groups of samples, we demonstrate the suitability of adjusting confounding effects when significant covariates are absent from the model.

4/23/2024

🏅

Sampling and estimation on manifolds using the Langevin diffusion

Karthik Bharath, Alexander Lewis, Akash Sharma, Michael V Tretyakov

Error bounds are derived for sampling and estimation using a discretization of an intrinsically defined Langevin diffusion with invariant measure $text{d}mu_phi propto e^{-phi} mathrm{dvol}_g $ on a compact Riemannian manifold. Two estimators of linear functionals of $mu_phi $ based on the discretized Markov process are considered: a time-averaging estimator based on a single trajectory and an ensemble-averaging estimator based on multiple independent trajectories. Imposing no restrictions beyond a nominal level of smoothness on $phi$, first-order error bounds, in discretization step size, on the bias and variance/mean-square error of both estimators are derived. The order of error matches the optimal rate in Euclidean and flat spaces, and leads to a first-order bound on distance between the invariant measure $mu_phi$ and a stationary measure of the discretized Markov process. This order is preserved even upon using retractions when exponential maps are unavailable in closed form, thus enhancing practicality of the proposed algorithms. Generality of the proof techniques, which exploit links between two partial differential equations and the semigroup of operators corresponding to the Langevin diffusion, renders them amenable for the study of a more general class of sampling algorithms related to the Langevin diffusion. Conditions for extending analysis to the case of non-compact manifolds are discussed. Numerical illustrations with distributions, log-concave and otherwise, on the manifolds of positive and negative curvature elucidate on the derived bounds and demonstrate practical utility of the sampling algorithm.

6/18/2024