Inference on Covariance Operators via Concentration Inequalities: k-sample Tests, Classification, and Clustering via Rademacher Complexities

Kashlak, Adam B.; Aston, John A. D.; Nickl, Richard

doi:10.1007/s13171-018-0143-9

Inference on Covariance Operators via Concentration Inequalities: k-sample Tests, Classification, and Clustering via Rademacher Complexities

Open access
Published: 28 September 2018

Volume 81, pages 214–243, (2019)
Cite this article

Download PDF

You have full access to this open access article

Sankhya A Aims and scope Submit manuscript

Inference on Covariance Operators via Concentration Inequalities: k-sample Tests, Classification, and Clustering via Rademacher Complexities

Download PDF

719 Accesses
4 Citations
2 Altmetric
Explore all metrics

Abstract

We propose a novel approach to the analysis of covariance operators making use of concentration inequalities. First, non-asymptotic confidence sets are constructed for such operators. Then, subsequent applications including a k sample test for equality of covariance, a functional data classifier, and an expectation-maximization style clustering algorithm are derived and tested on both simulated and phoneme data.

Article PDF

A unified approach to estimation of noncentrality parameters, the multiple correlation coefficient, and mixture models

Article 01 April 2017

Robust statistical inference based on the C-divergence family

Article 30 July 2018

The Schoenberg kernel and more flexible multivariate covariance models in Euclidean spaces

Article 10 April 2023

Use our pre-submission checklist

Avoid common mistakes on your manuscript.

References

Abraham, C., Cornillon, P.-A., Matzner-Løber, E. and Molinari, N. (2003). Unsupervised curve clustering using b-splines. Scand. J. Stat. 30, 3, 581–595.
Article MathSciNet MATH Google Scholar
Arlot, S., Blanchard, G. and Roquain, E. (2010). Some nonasymptotic results on resampling in high dimension, i: Confidence regions. Ann. Stat. 38, 1, 51–82.
Article MathSciNet MATH Google Scholar
Bartlett, P.L. and Mendelson, S. (2003). Rademacher and Gaussian complexities: Risk bounds and structural results. J. Mach. Learn. Res. 3, 463–482.
MathSciNet MATH Google Scholar
Bartlett, P.L., Boucheron, S. and Lugosi, G. (2002). Model selection and error estimation. Mach. Learn. 48, 1–3, 85–113.
Article MATH Google Scholar
Berlinet, A., Biau, G. and Rouviere, L. (2008). Functional supervised classification with wavelets. In Annales de l’ISUP, volume 52.
Boucheron, S., Lugosi, G. and Massart, P. (2013). Concentration inequalities: A nonasymptotic theory of independence. Oxford University Press.
Cabassi, A. and Kashlak, A.B. (2016). fdcov: Analysis of Covariance Operators. R package version 1.0.0.
Casella, G. and Berger, R.L. (2002). Statistical inference, volume 2. Duxbury Pacific Grove.
Chang, C., Chen, Y. and Ogden, T. (2014). Functional data classification: a wavelet approach. Comput. Stat. 29, 6, 1497–1513.
Article MathSciNet MATH Google Scholar
De la Pena, V. and Giné, E. (2012). Decoupling: From dependence to independence. Springer Science & Business Media.
Delaigle, A. and Hall, P. (2012). Achieving near perfect classification for functional data. J. R. Statist. Soc. Series B (Statist. Methodol.) 74, 2, 267–286.
Article MathSciNet MATH Google Scholar
Fan, Z. (2011). Confidence regions for infinite-dimensional statistical parameters. Part III essay in Mathematics, University of Cambridge. http://web.stanford.edu/zhoufan/PartIIIEssay.pdf.
Ferraty, F. and Vieu, P. (2003). Curves discrimination: A nonparametric functional approach. Comput. Statist. Data Anal. 44, 1, 161–173.
Article MathSciNet MATH Google Scholar
Ferraty, F. and Vieu, P. (2006). Nonparametric Functional Data Analysis: Theory and Practice. Springer Science & Business Media.
Fremdt, S., Steinebach, J.G., Horváth, L. and Kokoszka, P. (2013). Testing the equality of covariance operators in functional samples. Scand. J. Stat. 40, 1, 138–152.
Article MathSciNet MATH Google Scholar
Giné, E. and Nickl, R. (2010). Confidence bands in density estimation. Ann. Stat. 38, 2, 1122–1170.
Article MathSciNet MATH Google Scholar
Giné, E. and Nickl, R. (2016). Mathematical Foundations of Infinite-Dimensional Statistical Models. Cambridge University Press.
Glendinning, R.H. and Herbert, R.A. (2003). Shape classification using smooth principal components. Pattern Recogn. Lett. 24, 12, 2021–2030.
Article Google Scholar
Hall, P., Poskitt, D.S. and Presnell, B. (2001). A functional data-analytic approach to signal discrimination. Technometrics 43, 1, 1–9.
Article MathSciNet MATH Google Scholar
Hastie, T., Buja, A. and Tibshirani, R. (1995). Penalized discriminant analysis. Ann. Stat., 73–102.
Horváth, L. and Kokoszka, P. (2012). Inference for Functional Data with Applications, volume 200. Springer Science & Business Media.
Isserlis, L. (1918). On a formula for the product-moment coefficient of any order of a normal frequency distribution in any number of variables. Biometrika 12, 1/2, 134–139.
Article Google Scholar
James, G.M. and Hastie, T.J. (2001). Functional linear discriminant analysis for irregularly sampled curves. J. R. Statist. Soc. Series B, Statist. Methodol., 533–550.
Jiang, C.-R., Aston, J.A. and Wang, J.-L. (2016). A functional approach to deconvolve dynamic neuroimaging data. J. Am. Stat. Assoc. 111, 513, 1–13.
Article MathSciNet Google Scholar
Kerkyacharian, G., Nickl, R. and Picard, D. (2012). Concentration inequalities and confidence bands for needlet density estimators on compact homogeneous manifolds. Probab. Theory Relat. Fields 153, 1–2, 363–404.
Article MathSciNet MATH Google Scholar
Koltchinskii, V. (2001). Rademacher penalties and structural risk minimization. IEEE Trans. Inf. Theory 47, 5, 1902–1914.
Article MathSciNet MATH Google Scholar
Koltchinskii, V. (2006). Local rademacher complexities and oracle inequalities in risk minimization. Ann. Stat. 34, 6, 2593–2656.
Article MathSciNet MATH Google Scholar
Ledoux, M. (2001). The Concentration of Measure Phenomenon, volume 89. American Mathematical Soc.
Lounici, K. and Nickl, R. (2011). Global uniform risk bounds for wavelet deconvolution estimators. Ann. Stat. 39, 1, 201–231.
Article MathSciNet MATH Google Scholar
Müller, H.-G. and Stadtmüller, U. (2005). Generalized functional linear models. Ann. Statist., 774–805.
Panaretos, V.M., Kraus, D. and Maddocks, J.H. (2010). Second-order comparison of Gaussian random functions and the geometry of DNA minicircles. J. Am. Stat. Assoc. 105, 490, 670–682.
Article MathSciNet MATH Google Scholar
Peng, J. and Müller, H.-G. (2008). Distance-based clustering of sparsely observed stochastic processes, with applications to online auctions. Ann. Appl. Statist., 1056–1077.
Pigoli, D., Aston, J.A.D., Dryden, I.L. and Secchi, P. (2014). Distances and inference for covariance operators. Biometrika, page asu008.
Pigoli, D., Hadjipantelis, P.Z., Coleman, J.S. and Aston, J.A.D. (2015). The analysis of acoustic phonetic data: exploring differences in the spoken romance languages. arXiv:1507.07587.
Ramsay, J.O. and Silverman, B.W. (2005). Functional Data Analysis. Springer, New York.
Book MATH Google Scholar
Talagrand, M. (1996). New concentration inequalities in product spaces. Inventiones mathematicae 126, 3, 505–563.
Article MathSciNet MATH Google Scholar

Download references

Acknowledgements

JA is grateful that this research was supported by EPSRC grant EP/K021672/2.

Author information

Authors and Affiliations

Statistical Laboratory, University of Cambridge, Cambridge, UK
Adam B. Kashlak, John A. D. Aston & Richard Nickl
Mathematical and Statistical Sciences, University of Alberta, Edmonton, Alberta, Canada
Adam B. Kashlak

Authors

Adam B. Kashlak
View author publications
You can also search for this author in PubMed Google Scholar
John A. D. Aston
View author publications
You can also search for this author in PubMed Google Scholar
Richard Nickl
View author publications
You can also search for this author in PubMed Google Scholar

Corresponding author

Correspondence to John A. D. Aston.

Appendices

Appendix A: Confidence Sets for the Mean in Banach Spaces

The goal of this section is to construct a non-asymptotic confidence region in the Banach space setting. This is specialized in Section 3 to our case of interest, covariance operators, when the X_i below are replaced with $f_{i}^{\otimes 2}$.

Let X₁, … , X_n ∈ (B, ‖⋅‖_B) be mean zero independent and identically distributed Banach space valued random variables with ‖X_i‖ _B ≤ U for all i = 1, … , n where U is some positive constant. Furthermore, let $\left \langle {\cdot },{\cdot }\right \rangle :B\times B^{*} \rightarrow \mathbb {R}$ such that for X ∈ B and ϕ ∈ B^∗ then 〈X, ϕ〉 = ϕ(X). Define

$$Z = \sup_{\left\lVert{\phi}\right\rVert_{B^{*}}\le1} \sum\limits_{i = 1}^{n} \left\langle{X_{i}},{\phi}\right\rangle = \left\lVert{\sum\limits_{i = 1}^{n} X_{i}}\right\rVert_{B}, \sigma^{2} = \frac{1}{n}\sum\limits_{i = 1}^{n}\sup_{\left\lVert{\phi}\right\rVert_{B^{*}}\le1} \mathrm{E}{\left\langle{X_{i}},{\phi}\right\rangle^{2}}, $$

where the supremum is taken over a countably dense subset of the unit ball of B^∗. Furthermore, define v_n = 2U EZ + nσ². Then, P (Z > EZ + r) ≤ exp{−r²/(2v_n + 2rU/3)}. Rewriting Z as $n\left \lVert {\bar {X}-\mathrm {E}{ \bar {X}}}\right \rVert _{B}$ results in

$$\mathrm{P}\left( { \left\lVert{\bar{X}-\mathrm{E}{ \bar{X}}}\right\rVert_{B}>\mathrm{E} \left\lVert{\bar{X}-\mathrm{E}{\bar{X}}}\right\rVert_{B} +r }\right) < \exp\left( \frac{-n^{2}r^{2}}{2v_{n} + 2nrU/3}\right) $$

where ‖X_i‖ _B < U and $v_{n} = 2nU\mathrm {E}\{\left \lVert {\bar {X}-\mathrm {E}{\bar {X}}}\right \rVert _{B}\} + n\sigma ^{2}$.

The above tail bound incorporates the unknown $\mathrm {E}(\lVert {\bar {X}-\mathrm {E}{\bar {X}}}\rVert _{B})$. Consequently, a symmetrization technique is used. This term is replaced by the norm of the Rademacher average $ R_{n} = {n^{-1}}{\sum }_{i = 1}^{n}\varepsilon _{i}(X_{i}- \bar {X}) $ where the ε_i are independent and identically distributed Rademacher random variables also independent of the X_i. This substitution is justified by invoking the symmetrization inequality (Giné and Nickl, 2016, Theorem 3.1.21),

$$\mathrm{E}{Z} = \mathrm{E} \left\lVert{ \frac{1}{n}\sum\limits_{i = 1}^{n}(X_{i}- \mathrm{E}{ \bar{X}})}\right\rVert_{B} \le 2\mathrm{E} \left\lVert{ \frac{1}{n}\sum\limits_{i = 1}^{n}\varepsilon_{i}(X_{i}- \bar{X}) }\right\rVert_{B} = 2\mathrm{E}{\left\lVert{R_{n}}\right\rVert_{B}}. $$

If the data are symmetric about their mean, which is when X_i −EX_i and EX_i − X_i are equidistributed, the coefficient of 2 is unnecessary and can be dropped. This is because X_i −EX_i and ε{X_i −EX_i} are also equidistributed. In practice, the data may not be symmetric. However, averaging even a moderately sized data set has a symmetrizing effect on the sample mean. Assuming the data is not highly skewed, the coefficient of 2 can be safely dropped in practice to tighten the confidence set. In fact, considering the phoneme data from Section 5.1 in this setting results in the values displayed in Table 6, which shows that in the trace norm setting, the Rademacher average is much greater than half the size of EZ, and that in the Hilbert-Schmidt and operator norm settings, the Rademacher average is actually marginally less than EZ.

Table 6 A comparison of the left and right hand sides of the symmetrization inequality and, hence, a justification for safely dropping the coefficient of 2 in the construction of confidence sets

Full size table

This symmetrization result allows us to replace the original expectation with the expectation of the Rademacher average. Furthermore, Talagrand’s inequality also applies to R_n. Hence, the Rademacher average concentrates strongly about its expectation, which justifies dropping the expectation. In practice, one can use the intermediary E_ε‖R_n‖ _B, which can be approximated for reasonable sized data sets via Monte Carlo simulations of the ε_i. However, this is not strictly necessary, and for large data sets, a single random draw of ε_i will suffice (Giné and Nickl, 2016, Section 3.4.2).

The resulting (1 − α)-confidence set is

$$ \left\{ {X} : \left\lVert{{X}-\bar{X}}\right\rVert_{B} \le \left\lVert{R_{n}}\right\rVert_{B} + \left\{\frac{2}{n}\log(2\alpha)\left( \sigma^{2} + 2U\left\lVert{R_{n}}\right\rVert_{B} \right)\right\}^{1/2} + \frac{U\log(2\alpha)}{3n} \right\}. $$

(A.1)

To make use of these results in practice, both the weak variance σ² must be estimated for the data and a reasonable choice of U must be made, and a main contribution of this present paper is to propose some theoretically motivated but practically useful non-asymptotic choices for these constants that work for the functional data applications we are investigating.

Appendix B: Calculation of the Weak Variance

1.1 B.1 The Weak Variance for p ∈ [1, ∞)

To calculate the weak variance σ², define f^⊗n = f ⊗… ⊗ f to be the n-fold tensor product of f with itself and extend the definition of $ \left \langle {\cdot },{\cdot }\right \rangle : (L^{2})^{\otimes 4}\times \{(L^{2})^{\otimes 4}\}^{*}\rightarrow \mathbb {R} $ such that 〈f^⊗4, ϕ^⊗4〉 = 〈f^⊗2, ϕ^⊗2〉² = 〈f, ϕ〉⁴ = ϕ(f)⁴. For operators π ∈ {(L²)^⊗2}^∗ and Ξ ∈ {(L²)^⊗4}^∗, the weak variance is

$$\begin{array}{@{}rcl@{}} \sigma^{2} &=& \frac{1}{n}\sum\limits_{i = 1}^{n}\sup\limits_{\left\lVert{\Pi}\right\rVert_{q}\le1} \mathrm{E}{ \left\langle{f_{i}^{\otimes2}-\mathrm{E}{ f_{i}^{\otimes2}}},{\Pi}\right\rangle^{2} }\\ &\le& \frac{1}{n}\sum\limits_{i = 1}^{n}\sup\limits_{\left\lVert{\Xi}\right\rVert_{q}\le1} \left\langle{ \mathrm{E}{ f_{i}^{\otimes4}} - \left\{\mathrm{E}{ f^{\otimes2}}\right\}^{\otimes2} },{\Xi}\right\rangle \le \left\lVert{ \mathrm{E}{ f^{\otimes 4}} - {\Sigma}^{\otimes2} }\right\rVert_{p} \end{array} $$

where the inequality stems from the fact that the supremum is being taken over a larger set. However, in the Hilbert space setting, the dual of the tensor product does coincide with the tensor product of the dual space, and thus the above inequality can be replaced with an equality if the Hilbert-Schmidt norm, 2-Schatten norm, is used. Given a bound $\left \lVert {f_{i}}\right \rVert _{L^{2}}^{2}\le c^{2}=U$, then $ \sigma ^{2}\le \lVert { \mathrm {E}{ f^{\otimes 4}}}\rVert _{p}\le \mathrm {E}\lVert {f}\rVert _{L^{2}}^{4} \le c^{4} = U^{2}. $

1.2 B.2 The Weak Variance for p = ∞

Let E be a countable dense subset of the unit ball of L²(I). In the case p = ∞, we cannot use duality, but can still write Z and σ² as suprema over the countable set and achieve the same results as above.

$$\begin{array}{@{}rcl@{}} Z &=& \frac{1}{n}\sup\limits_{e\in E} \sum\limits_{i = 1}^{n} \left\langle{\left\{f^{\otimes 2}_{i}-\mathrm{E}{f^{\otimes 2}_{i}}\right\}e},{e}\right\rangle = \sup\limits_{e\in E}\left\langle{(\hat{\Sigma}-{\Sigma})e},{e}\right\rangle = \left\lVert{\hat{\Sigma}-{\Sigma}}\right\rVert_{\infty}, \\ \sigma^{2} &=& \frac{1}{n} \sum\limits_{i = 1}^{n} \sup\limits_{e_{1}\in E}\mathrm{E}{ \left\langle{({f_{i}}^{\otimes 2}-{\Sigma})e_{1}},{e_{1}}\right\rangle^{2} } \le \frac{1}{n} \sum\limits_{i = 1}^{n} \sup\limits_{e_{1},e_{2}\in E}\mathrm{E}{ \left\langle{f_{i}^{\otimes2}-{\Sigma}},{e_{1}\otimes e_{2}}\right\rangle^{2} } \\ &\le& \frac{1}{n} \sum\limits_{i = 1}^{n} \sup\limits_{e_{1},e_{2}\in E} \left\langle{\left( \mathrm{E}{f_{i}^{\otimes4}}-{\Sigma}^{\otimes2} \right)(e_{1}\otimes e_{2})},{e_{1}\otimes e_{2}}\right\rangle = \left\lVert\mathrm{E}{f_{i}^{\otimes4}}-{{\Sigma}^{\otimes2}}\right\rVert_{\infty}. \end{array} $$

As before, if $\left \lVert {f_{i}^{\otimes 2}}\right \rVert _{\infty }=\left \lVert {f_{i}}\right \rVert _{L^{2}}^{2}\le c^{2}=U$, then σ² ≤ U².

1.3 B.3 The Weak Variance for Gaussian Data

Similarly to the bounded case, we estimate ‖Ef^⊗4 −Σ^⊗2‖ _p for Gaussian data. Consider f from a Gaussian process with mean zero and covariance Σ. Strictly speaking these variables are not norm bounded, but similar concentration results for Gaussian processes can be derived. Indeed, let f₁, … , f_n be independent Gaussian processes with mean zero and covariance Σ. The empirical covariance kernel is $\hat {c}(s,t) = n^{-1}{\sum }_{i = 1}^{n} f_{i}(s)f_{i}(t)$, which is a Gaussian polynomial. By the decoupling inequality (De la Pena and Giné, 2012, Theorem 4.2.27), there exists a κ > 0 such that

$$\mathrm{P}\left( { \left\lVert{\hat{c}(s,t)}\right\rVert \ge \mathrm{E}{\left\lVert{\hat{c}}\right\rVert} + r }\right) \le \kappa \mathrm{P}\left( { \left\lVert{\tilde{c}(s,t)}\right\rVert \ge \mathrm{E}{\left\lVert{\hat{c}}\right\rVert} + r/\kappa }\right) $$

where $\tilde {c}(s,t) = n^{-1}{\sum }_{i = 1}^{n} f_{i}(s)f_{i}^{\prime }(t)$ with $f_{1}^{\prime },\ldots ,f_{n}^{\prime }$ independent copies of the original f_i. Thus, our Gaussian polynomial can be thought of as a conditional Gaussian random variable. Now using concentration bounds for norms of Gaussian vectors (Giné and Nickl, 2016, Theorem 2.6.8) twice, an inequality similar to the one in the bounded case is obtained easily.

Defining f^s = f(s), the integral kernel can be written as (Isserlis, 1918)

$$\begin{array}{@{}rcl@{}} \mathrm{E}{f^{s}f^{t}f^{u}f^{v}} &=& \mathrm{E}{f^{s}f^{t}}\mathrm{E}{f^{u}f^{v}} + \mathrm{E}{f^{s}f^{u}}\mathrm{E}{f^{t}f^{v}} + \mathrm{E}{f^{s}f^{v}}\mathrm{E}{f^{t}f^{u}}\\ &=& c_{f}(s,t)c_{f}(u,v)+c_{f}(s,u)c_{f}(t,v)+c_{f}(s,v)c_{f}(t,u). \end{array} $$

Hence, we have that Ef^sf^tf^uf^v −Σ_{s, t}Σ_{u, v} = Σ_{s, u}Σ_{t, v} + Σ_{s, v}Σ_{t, u} and that the operator Ef^⊗4 −Σ^⊗2, which can be thought of as an Hilbert-Schmidt operator on the space Op(L²), can be represented by the integral kernel c_f(s, u)c_f(t, v) + c_f(s, v)c_f(t, u). These two terms are merely relabeled versions of Σ^⊗2. Consequently, using the subadditivity of the norm, ‖Ef^⊗4 −Σ^⊗2‖ _p ≤ ‖Σ^⊗2‖ _p + ‖Σ^⊗2‖ _p = 2 ‖Σ^⊗2‖ _p. For example, for the Hilbert-Schmidt norm,

$$\begin{array}{@{}rcl@{}} \left\lVert{ \mathrm{E}{ f^{\otimes 4}} - {\Sigma}^{\otimes2} }\right\rVert^{2}_{HS} &=& \iiiint\left\{ c_{f}(s,u)c_{f}(t,v)+c_{f}(s,v)c_{f}(t,u) \right\}^{2}dsdtdudv \\ &=& 2\left\lVert{\Sigma}\right\rVert^{4}_{HS} + 2\iiiint c_{f}(s,u)c_{f}(s,v)c_{f}(t,v)c_{f}(t,u)\\ &&\times dsdtdudv \le 4\left\lVert{\Sigma}\right\rVert^{4}_{HS}. \end{array} $$

Lemma 5.1 of Horváth and Kokoszka (2012) gives an explicit form of a covariance operator of Σ in terms of the eigenfunctions of Σ for Gaussian data in the Hilbert-Schmidt setting.

Given λ_i, the eigenvalues of Σ, the spectrum of Σ^⊗2 is $\{ \lambda _{i}\lambda _{j}\}_{i,j = 1}^{\infty }$. Hence, for any of the p-Schatten norms, $\lVert {{\Sigma }\otimes {\Sigma }}\rVert _{p} = \lVert {\Sigma }{\rVert _{p}^{2}}$. Note that in the above calculations, the weak variance depends on the unknown Σ. In practice, this can be replaced by the empirical estimate $\hat {\Sigma }$.

Appendix C: Heavy Tails and Noisy Measurements

As often in practice functional data comes from noisy measurements, consider data of the form Y_i = X_i + ε_i where X_i is a mean zero Gaussian process with covariance operator Σ and ε_i is Gaussian white noise with covariance c²I for some c² > 0. Figure 5 repeats the previous power analysis for the two sample test but in the moderately noisy settings.

Secondly, heavier tailed data, specifically t-distributed data with 6 degrees of freedom, can also be handled by this method. Figure 6 repeats the earlier two sample power analysis but with the heavier tailed distribution in place of the Gaussian. Here, the coefficient of (k + 2)/(k + 3) in Eq. 4.1 was replaced with simply 1 in order to achieve the correct empirical size. In general, given arbitrary data, one can simulate null data and adjust the tuning parameters to match the desired empirical size of the test.

Lastly, the empirical coverage of the concentration based confidence set is still comparable to the desired coverage in the heavy tailed case. Consider t-distributed data with six degrees of freedom; Nine operators were randomly generated and data was simulated from each. Figure 7 recreates the simulated confidence sets from Fig. 2, but with the t-distributed data. To achieve these empirical coverages, the Gaussian weak variance, previously calculated to be $\sigma ^{2}= 2\left \lVert {\Sigma }\right \rVert _{p}^{2}$, is scaled by a factor of ν/(ν − 4) where ν is the degrees of freedom.

Rights and permissions

Open Access This article is distributed under the terms of the Creative Commons Attribution 4.0 International License (http://creativecommons.org/licenses/by/4.0/), which permits unrestricted use, distribution, and reproduction in any medium, provided you give appropriate credit to the original author(s) and the source, provide a link to the Creative Commons license, and indicate if changes were made.

Reprints and permissions

About this article

Cite this article

Kashlak, A.B., Aston, J.A.D. & Nickl, R. Inference on Covariance Operators via Concentration Inequalities: k-sample Tests, Classification, and Clustering via Rademacher Complexities. Sankhya A 81, 214–243 (2019). https://doi.org/10.1007/s13171-018-0143-9

Download citation

Received: 25 November 2017
Published: 28 September 2018
Issue Date: 01 February 2019
DOI: https://doi.org/10.1007/s13171-018-0143-9

Keywords

AMS (2000) subject classification

Use our pre-submission checklist

Avoid common mistakes on your manuscript.

Inference on Covariance Operators via Concentration Inequalities: k-sample Tests, Classification, and Clustering via Rademacher Complexities

Abstract

Article PDF

Similar content being viewed by others

A unified approach to estimation of noncentrality parameters, the multiple correlation coefficient, and mixture models

Robust statistical inference based on the C-divergence family

The Schoenberg kernel and more flexible multivariate covariance models in Euclidean spaces

References

Acknowledgements