Anru Zhang | Researchain

Archive Network Publication Hotspot Collaboration

Network

Latest external collaboration on country level. Dive into details by clicking on the dots.

Explore More

Hotspot

Dive into the research topics where Anru Zhang is active.

Explore More

Publication

Featured researches published by Anru Zhang.

IEEE Transactions on Information Theory | 2014

Sparse Representation of a Polytope and Recovery of Sparse Signals and Low-Rank Matrices

T. Tony Cai; Anru Zhang

This paper considers compressed sensing and affine rank minimization in both noiseless and noisy cases and establishes sharp restricted isometry conditions for sparse signal and low-rank matrix recovery. The analysis relies on a key technical tool, which represents points in a polytope by convex combinations of sparse vectors. The technique is elementary while yielding sharp results. It is shown that for any given constant t ≥ 4/3, in compressed sensing, δtkA <; √((t-1)/t) guarantees the exact recovery of all k sparse signals in the noiseless case through the constrained l1 minimization, and similarly, in affine rank minimization, δtrM <; √((t-1)/t) ensures the exact reconstruction of all matrices with rank at most r in the noiseless case via the constrained nuclear norm minimization. In addition, for any ε > 0, δtkA <; √(t-1/t) + ε is not sufficient to guarantee the exact recovery of all k-sparse signals for large k. Similar results also hold for matrix recovery. In addition, the conditions δtkA <; √((t-)1/t) and δtrM <; √((t-1)/t) are also shown to be sufficient, respectively, for stable recovery of approximately sparse signals and low-rank matrices in the noisy case.

Journal of the American Statistical Association | 2016

Instrumental Variables Estimation With Some Invalid Instruments and its Application to Mendelian Randomization

Hyunseung Kang; Anru Zhang; T. Tony Cai; Dylan S. Small

Instrumental variables have been widely used for estimating the causal effect between exposure and outcome. Conventional estimation methods require complete knowledge about all the instruments’ validity; a valid instrument must not have a direct effect on the outcome and not be related to unmeasured confounders. Often, this is impractical as highlighted by Mendelian randomization studies where genetic markers are used as instruments and complete knowledge about instruments’ validity is equivalent to complete knowledge about the involved genes’ functions. In this article, we propose a method for estimation of causal effects when this complete knowledge is absent. It is shown that causal effects are identified and can be estimated as long as less than 50% of instruments are invalid, without knowing which of the instruments are invalid. We also introduce conditions for identification when the 50% threshold is violated. A fast penalized ℓ1 estimation method, called sisVIVE, is introduced for estimating the causal effect without knowing which instruments are valid, with theoretical guarantees on its performance. The proposed method is demonstrated on simulated data and a real Mendelian randomization study concerning the effect of body mass index(BMI) on health-related quality of life (HRQL) index. An R package sisVIVE is available on CRAN. Supplementary materials for this article are available online.

Annals of Statistics | 2015

ROP: Matrix recovery via rank-one projections

T. Tony Cai; Anru Zhang

Estimation of low-rank matrices is of significant interest in a range of contemporary applications. In this paper, we introduce a rank-one projection model for low-rank matrix recovery and propose a constrained nuclear norm minimization method for stable recovery of low-rank matrices in the noisy case. The procedure is adaptive to the rank and robust against small perturbations. Both upper and lower bounds for the estimation accuracy under the Frobenius norm loss are obtained. The proposed estimator is shown to be rate-optimal under certain conditions. The estimator is easy to implement via convex programming and performs well numerically. The techniques and main results developed in the paper also have implications to other related statistical problems. An application to estimation of spiked covariance matrices from one-dimensional random projections is considered. The results demonstrate that it is still possible to accurately estimate the covariance matrix of a high-dimensional distribution based only on one-dimensional projections.

The Annals of Applied Statistics | 2016

Regression analysis for microbiome compositional data

Pixu Shi; Anru Zhang; Hongzhe Li

One important problem in microbiome analysis is to identify the bacterial taxa that are associated with a response, where the microbiome data are summarized as the composition of the bacterial taxa at different taxonomic levels. This paper considers regression analysis with such compositional data as covariates. In order to satisfy the subcompositional coherence of the results, linear models with a set of linear constraints on the regression coefficients are introduced. Such models allow regression analysis for subcompositions and include the log-contrast model for compositional covariates as a special case. A penalized estimation procedure for estimating the regression coefficients and for selecting variables under the linear constraints is developed. A method is also proposed to obtain de-biased estimates of the regression coefficients that are asymptotically unbiased and have a joint asymptotic multivariate normal distribution. This provides valid confidence intervals of the regression coefficients and can be used to obtain the

Annals of Statistics | 2018

Rate-optimal perturbation bounds for singular subspaces with applications to high-dimensional statistics

T. Tony Cai; Anru Zhang

Journal of the American Statistical Association | 2016

Structured Matrix Completion with Applications to Genomic Data Integration

Tianxi Cai; T. Tony Cai; Anru Zhang

-values. Simulation results show the validity of the confidence intervals and smaller variances of the de-biased estimates when the linear constraints are imposed. The proposed methods are applied to a gut microbiome data set and identify four bacterial genera that are associated with the body mass index after adjusting for the total fat and caloric intakes.

Journal of Multivariate Analysis | 2016

Inference for high-dimensional differential correlation matrices

T. Tony Cai; Anru Zhang

Perturbation bounds for singular spaces, in particular Wedins

Journal of Multivariate Analysis | 2016

Minimax rate-optimal estimation of high-dimensional covariance matrices with incomplete data

T. Tony Cai; Anru Zhang

\sin \Theta

Applied and Computational Harmonic Analysis | 2013

Sharp RIP bound for sparse signal and low-rank matrix recovery

T. Tony Cai; Anru Zhang

theorem, are a fundamental tool in many fields including high-dimensional statistics, machine learning, and applied mathematics. In this paper, we establish separate perturbation bounds, measured in both spectral and Frobenius

IEEE Transactions on Information Theory | 2018