Universally Consistent K-Sample Tests via Dependence Measures

Read original: arXiv:1910.08883 - Published 9/17/2024 by Sambit Panda, Cencheng Shen, Ronan Perry, Jelle Zorn, Antoine Lutz, Carey E. Priebe, Joshua T. Vogelstein

✅

Overview

The K-sample testing problem involves determining if K groups of data points come from the same distribution.
Analysis of variance is a common method for testing mean differences, but there are also methods for testing distributional differences.
This paper demonstrates a transformation that allows K-sample testing to be done using any dependence measure.
This enables a wide range of dependence measures to be used for K-sample testing, including universally consistent measures like distance correlation and Hilbert-Schmidt independence criterion.

Plain English Explanation

The K-sample testing problem is about figuring out whether K different groups of data come from the same underlying distribution. This is useful for comparing things like test scores between different classrooms or the effectiveness of different medical treatments.

One common approach is analysis of variance, which looks at whether the average values are different across the groups. However, there are also methods that can test for differences in the overall distribution, not just the average.

The key insight in this paper is that there's a way to transform the K-sample testing problem so that you can use any measure of dependence or similarity between the groups. This is powerful because it means you can leverage a wide range of advanced statistical tools, like distance correlation and Hilbert-Schmidt independence criterion, to test for differences between the groups.

Technical Explanation

The paper shows that the K-sample testing problem can be reduced to a problem of testing the independence between the group labels and the data. By transforming the problem in this way, any dependence measure can be used to perform the K-sample test.

This is significant because it allows the use of universally consistent dependence measures, such as distance correlation and the Hilbert-Schmidt independence criterion. These measures can detect any type of dependence between the groups, making the K-sample tests more powerful and flexible.

The paper provides theoretical results establishing the validity and consistency of this approach, as well as empirical demonstrations on simulated and real-world datasets.

Critical Analysis

The paper presents a clever and general approach to K-sample testing, but there are a few potential limitations worth considering:

The paper focuses on the theoretical properties of the method, but more empirical evaluation on a wider range of real-world datasets could help demonstrate its practical benefits.
The method requires choosing an appropriate dependence measure, which could be challenging in some cases and may require domain expertise.
The computational complexity of calculating some dependence measures, like distance correlation, may be a concern for very large datasets.

Overall, this research expands the toolbox for K-sample testing and opens up new possibilities for detecting more subtle distributional differences between groups.

Conclusion

This paper introduces a novel approach to the K-sample testing problem that allows for the use of a wide range of dependence measures. By transforming the problem, the method enables universally consistent tests that can detect any type of difference between the groups, beyond just differences in the means.

This flexible and powerful framework has the potential to significantly improve statistical testing in many domains where comparing distributions across multiple groups is important, such as in medicine, social science, and machine learning. Further research and real-world applications could help solidify the benefits and limitations of this approach.

This summary was produced with help from an AI and may contain inaccuracies - check out the links to read the original source documents!

Follow @aimodelsfyi on 𝕏 →

Related Papers

✅

Universally Consistent K-Sample Tests via Dependence Measures

Sambit Panda, Cencheng Shen, Ronan Perry, Jelle Zorn, Antoine Lutz, Carey E. Priebe, Joshua T. Vogelstein

The K-sample testing problem involves determining whether K groups of data points are each drawn from the same distribution. Analysis of variance is arguably the most classical method to test mean differences, along with several recent methods to test distributional differences. In this paper, we demonstrate the existence of a transformation that allows K-sample testing to be carried out using any dependence measure. Consequently, universally consistent K-sample testing can be achieved using a universally consistent dependence measure, such as distance correlation and the Hilbert-Schmidt independence criterion. This enables a wide range of dependence measures to be easily applied to K-sample testing.

9/17/2024

Refereeing the Referees: Evaluating Two-Sample Tests for Validating Generators in Precision Sciences

Samuele Grossi, Marco Letizia, Riccardo Torre

We propose a robust methodology to evaluate the performance and computational efficiency of non-parametric two-sample tests, specifically designed for high-dimensional generative models in scientific applications such as in particle physics. The study focuses on tests built from univariate integral probability measures: the sliced Wasserstein distance and the mean of the Kolmogorov-Smirnov statistics, already discussed in the literature, and the novel sliced Kolmogorov-Smirnov statistic. These metrics can be evaluated in parallel, allowing for fast and reliable estimates of their distribution under the null hypothesis. We also compare these metrics with the recently proposed unbiased Fr'echet Gaussian Distance and the unbiased quadratic Maximum Mean Discrepancy, computed with a quartic polynomial kernel. We evaluate the proposed tests on various distributions, focusing on their sensitivity to deformations parameterized by a single parameter $epsilon$. Our experiments include correlated Gaussians and mixtures of Gaussians in 5, 20, and 100 dimensions, and a particle physics dataset of gluon jets from the JetNet dataset, considering both jet- and particle-level features. Our results demonstrate that one-dimensional-based tests provide a level of sensitivity comparable to other multivariate metrics, but with significantly lower computational cost, making them ideal for evaluating generative models in high-dimensional settings. This methodology offers an efficient, standardized tool for model comparison and can serve as a benchmark for more advanced tests, including machine-learning-based approaches.

9/26/2024

🧪

Independence Testing for Temporal Data

Cencheng Shen, Jaewon Chung, Ronak Mehta, Ting Xu, Joshua T. Vogelstein

Temporal data are increasingly prevalent in modern data science. A fundamental question is whether two time series are related or not. Existing approaches often have limitations, such as relying on parametric assumptions, detecting only linear associations, and requiring multiple tests and corrections. While many non-parametric and universally consistent dependence measures have recently been proposed, directly applying them to temporal data can inflate the p-value and result in an invalid test. To address these challenges, this paper introduces the temporal dependence statistic with block permutation to test independence between temporal data. Under proper assumptions, the proposed procedure is asymptotically valid and universally consistent for testing independence between stationary time series, and capable of estimating the optimal dependence lag that maximizes the dependence. Moreover, it is compatible with a rich family of distance and kernel based dependence measures, eliminates the need for multiple testing, and exhibits excellent testing power in various simulation settings.

5/29/2024

🤿

Learning Deep Kernels for Non-Parametric Independence Testing

Nathaniel Xu, Feng Liu, Danica J. Sutherland

The Hilbert-Schmidt Independence Criterion (HSIC) is a powerful tool for nonparametric detection of dependence between random variables. It crucially depends, however, on the selection of reasonable kernels; commonly-used choices like the Gaussian kernel, or the kernel that yields the distance covariance, are sufficient only for amply sized samples from data distributions with relatively simple forms of dependence. We propose a scheme for selecting the kernels used in an HSIC-based independence test, based on maximizing an estimate of the asymptotic test power. We prove that maximizing this estimate indeed approximately maximizes the true power of the test, and demonstrate that our learned kernels can identify forms of structured dependence between random variables in various experiments.

9/12/2024