Guenther Walther
Stanford University
Network
Latest external collaboration on country level. Dive into details by clicking on the dots.
Publication
Featured researches published by Guenther Walther.
Journal of The Royal Statistical Society Series B-statistical Methodology | 2001
Robert Tibshirani; Guenther Walther; Trevor Hastie
We propose a method (the ‘gap statistic’) for estimating the number of clusters (groups) in a set of data. The technique uses the output of any clustering algorithm (e.g. K-means or hierarchical), comparing the change in within-cluster dispersion with that expected under an appropriate reference null distribution. Some theory is developed for the proposal and a simulation study shows that the gap statistic usually outperforms other methods that have been proposed in the literature.
Journal of Computational and Graphical Statistics | 2005
Robert Tibshirani; Guenther Walther
This article proposes a new quantity for assessing the number of groups or clusters in a dataset. The key idea is to view clustering as a supervised classification problem, in which we must also estimate the “true” class labels. The resulting “prediction strength” measure assesses how many groups can be predicted from the data, and how well. In the process, we develop novel notions of bias and variance for unlabeled data. Prediction strength performs well in simulation studies, and we apply it to clusters of breast cancer samples from a DNA microarray study. Finally, some consistency properties of the method are established.
Electronic Journal of Statistics | 2007
Trevor Hastie; Jonathan Taylor; Robert Tibshirani; Guenther Walther
We consider the least angle regression and forward stagewise algorithms for solving penalized least squares regression problems. In Efron, Hastie, Johnstone & Tibshirani (2004) it is proved that the least angle regression algorithm, with a small modification, solves the lasso regression problem. Here we give an analogous result for incremental forward stagewise regression, showing that it solves a version of the lasso problem that enforces monotonicity. One consequence of this is as follows: while lasso makes optimal progress in terms of reducing the residual sum-of-squares per unit increase in
Statistical Science | 2009
Guenther Walther
L_1
Annals of Statistics | 2010
Guenther Walther
-norm of the coefficient
Journal of the American Statistical Association | 2002
Guenther Walther
\beta
Mathematical Methods in The Applied Sciences | 1999
Guenther Walther
, forward stage-wise is optimal per unit
Computational Statistics & Data Analysis | 2007
George T. Chang; Guenther Walther
L_1
The Astrophysical Journal | 1999
P. A. Sturrock; Jeffrey D. Scargle; Guenther Walther; M. S. Wheatland
arc-length traveled along the coefficient path. We also study a condition under which the coefficient paths of the lasso are monotone, and hence the different algorithms coincide. Finally, we compare the lasso and forward stagewise procedures in a simulation study involving a large number of correlated predictors.
Advances in Bioinformatics | 2009
Guenther Walther; Noah Zimmerman; Wayne A. Moore; David R. Parks; Stephen Meehan; Ilana Belitskaya; Jinhui Pan; Leonore A. Herzenberg
Log-concave distributions are an attractive choice for modeling and inference, for several reasons: The class of log-concave distributions contains most of the commonly used parametric distributions and thus is a rich and flexible nonparametric class of distributions. Further, the MLE exists and can be computed with readily available algorithms. Thus, no tuning parameter, such as a bandwidth, is necessary for estimation. Due to these attractive properties, there has been considerable recent research activity concerning the theory and applications of log-concave distributions. This article gives a review of these results.