Zongming Ma
University of Pennsylvania
Network
Latest external collaboration on country level. Dive into details by clicking on the dots.
Publication
Featured researches published by Zongming Ma.
Annals of Statistics | 2013
Zongming Ma
Principal component analysis (PCA) is a classical dimension reduction method which projects data onto the principal subspace spanned by the leading eigenvectors of the covariance matrix. However, it behaves poorly when the number of features p is comparable to, or even much larger than, the sample size n. In this paper, we propose a new iterative thresholding approach for estimating principal subspaces in the setting where the leading eigenvectors are sparse. Under a spiked covariance model, we find that the new approach recovers the principal subspace and leading eigenvectors consistently, and even optimally, in a range of high-dimensional sparse settings. Simulated examples also demonstrate its competitive performance.
Annals of Statistics | 2013
T. Tony Cai; Zongming Ma; Yihong Wu
Principal component analysis (PCA) is one of the most commonly used statistical procedures with a wide range of applications. This paper considers both minimax and adaptive estimation of the principal subspace in the high dimensional setting. Under mild technical conditions, we first establish the optimal rates of convergence for estimating the principal subspace which are sharp with respect to all the parameters, thus providing a complete characterization of the difficulty of the estimation problem in term of the convergence rate. The lower bound is obtained by calculating the local metric entropy and an application of Fanos lemma. The rate optimal estimator is constructed using aggregation, which, however, might not be computationally feasible. We then introduce an adaptive procedure for estimating the principal subspace which is fully data driven and can be computed efficiently. It is shown that the estimator attains the optimal rates of convergence simultaneously over a large collection of the parameter spaces. A key idea in our construction is a reduction scheme which reduces the sparse PCA problem to a high-dimensional multivariate regression problem. This method is potentially also useful for other related problems.
Annals of Statistics | 2015
Zongming Ma; Yihong Wu
This paper studies the minimax detection of a small submatrix of elevated mean in a large matrix contaminated by additive Gaussian noise. To investigate the tradeoff between statistical performance and computational cost from a complexity-theoretic perspective, we consider a sequence of discretized models which are asymptotically equivalent to the Gaussian model. Under the hypothesis that the planted clique detection problem cannot be solved in randomized polynomial time when the clique size is of smaller order than the square root of the graph size, the following phase transition phenomenon is established: when the size of the large matrix
Bernoulli | 2013
T. Tony Cai; Zongming Ma
p\to\infty
Annals of Statistics | 2017
Chao Gao; Zongming Ma; Harrison H. Zhou
, if the submatrix size
Journal of Computational and Graphical Statistics | 2014
Dan Yang; Zongming Ma; Andreas Buja
k=\Theta(p^{\alpha})
Annals of Applied Probability | 2012
Iain M. Johnstone; Zongming Ma
for any
PLOS ONE | 2009
Hongjuan Zhao; Zongming Ma; Robert Tibshirani; John P. Higgins; Börje Ljungberg; James D. Brooks
\alpha\in(0,{2}/{3})
Annals of Statistics | 2015
Chao Gao; Zongming Ma; Zhao Ren; Harrison H. Zhou
, computational complexity constraints can incur a severe penalty on the statistical performance in the sense that any randomized polynomial-time test is minimax suboptimal by a polynomial factor in
Journal of The Royal Statistical Society Series B-statistical Methodology | 2017
Edward H. Kennedy; Zongming Ma; Matthew D. McHugh; Dylan S. Small
p