Guenther Walther | Researchain

Archive Network Publication Hotspot Collaboration

Network

Latest external collaboration on country level. Dive into details by clicking on the dots.

Explore More

Hotspot

Dive into the research topics where Guenther Walther is active.

Explore More

Publication

Featured researches published by Guenther Walther.

Journal of The Royal Statistical Society Series B-statistical Methodology | 2001

Estimating the number of clusters in a data set via the gap statistic

Robert Tibshirani; Guenther Walther; Trevor Hastie

We propose a method (the ‘gap statistic’) for estimating the number of clusters (groups) in a set of data. The technique uses the output of any clustering algorithm (e.g. K-means or hierarchical), comparing the change in within-cluster dispersion with that expected under an appropriate reference null distribution. Some theory is developed for the proposal and a simulation study shows that the gap statistic usually outperforms other methods that have been proposed in the literature.

Journal of Computational and Graphical Statistics | 2005

Cluster Validation by Prediction Strength

Robert Tibshirani; Guenther Walther

This article proposes a new quantity for assessing the number of groups or clusters in a dataset. The key idea is to view clustering as a supervised classification problem, in which we must also estimate the “true” class labels. The resulting “prediction strength” measure assesses how many groups can be predicted from the data, and how well. In the process, we develop novel notions of bias and variance for unlabeled data. Prediction strength performs well in simulation studies, and we apply it to clusters of breast cancer samples from a DNA microarray study. Finally, some consistency properties of the method are established.

Electronic Journal of Statistics | 2007

Forward stagewise regression and the monotone lasso

Trevor Hastie; Jonathan Taylor; Robert Tibshirani; Guenther Walther

We consider the least angle regression and forward stagewise algorithms for solving penalized least squares regression problems. In Efron, Hastie, Johnstone & Tibshirani (2004) it is proved that the least angle regression algorithm, with a small modification, solves the lasso regression problem. Here we give an analogous result for incremental forward stagewise regression, showing that it solves a version of the lasso problem that enforces monotonicity. One consequence of this is as follows: while lasso makes optimal progress in terms of reducing the residual sum-of-squares per unit increase in

Statistical Science | 2009

Inference and Modeling with Log-concave Distributions

Guenther Walther

L_1

Annals of Statistics | 2010

Optimal and fast detection of spatial clusters with scan statistics

Guenther Walther

-norm of the coefficient

Journal of the American Statistical Association | 2002

Detecting the Presence of Mixing with Multiscale Maximum Likelihood

Guenther Walther

\beta

Mathematical Methods in The Applied Sciences | 1999

On a generalization of Blaschke's Rolling Theorem and the smoothing of surfaces

Guenther Walther

, forward stage-wise is optimal per unit

Computational Statistics & Data Analysis | 2007

Clustering with mixtures of log-concave distributions

George T. Chang; Guenther Walther

L_1

The Astrophysical Journal | 1999

Rotational Signature and Possible r-Mode Signature in the GALLEX Solar Neutrino Data

P. A. Sturrock; Jeffrey D. Scargle; Guenther Walther; M. S. Wheatland

arc-length traveled along the coefficient path. We also study a condition under which the coefficient paths of the lasso are monotone, and hence the different algorithms coincide. Finally, we compare the lasso and forward stagewise procedures in a simulation study involving a large number of correlated predictors.

Advances in Bioinformatics | 2009

Automatic Clustering of Flow Cytometry Data with Density-Based Merging

Guenther Walther; Noah Zimmerman; Wayne A. Moore; David R. Parks; Stephen Meehan; Ilana Belitskaya; Jinhui Pan; Leonore A. Herzenberg

Log-concave distributions are an attractive choice for modeling and inference, for several reasons: The class of log-concave distributions contains most of the commonly used parametric distributions and thus is a rich and flexible nonparametric class of distributions. Further, the MLE exists and can be computed with readily available algorithms. Thus, no tuning parameter, such as a bandwidth, is necessary for estimation. Due to these attractive properties, there has been considerable recent research activity concerning the theory and applications of log-concave distributions. This article gives a review of these results.

Explore More