Stefan M. Larson
Stanford University
Network
Latest external collaboration on country level. Dive into details by clicking on the dots.
Publication
Featured researches published by Stefan M. Larson.
Protein Science | 2009
Stefan M. Larson; Jeremy L. England; John R. Desjarlais; Vijay S. Pande
Modeling the inherent flexibility of the protein backbone as part of computational protein design is necessary to capture the behavior of real proteins and is a prerequisite for the accurate exploration of protein sequence space. We present the results of a broad exploration of sequence space, with backbone flexibility, through a novel approach: large‐scale protein design to structural ensembles. A distributed computing architecture has allowed us to generate hundreds of thousands of diverse sequences for a set of 253 naturally occurring proteins, allowing exciting insights into the nature of protein sequence space. Designing to a structural ensemble produces a much greater diversity of sequences than previous studies have reported, and homology searches using profiles derived from the designed sequences against the Protein Data Bank show that the relevance and quality of the sequences is not diminished. The designed sequences have greater overall diversity than corresponding natural sequence alignments, and no direct correlations are seen between the diversity of natural sequence alignments and the diversity of the corresponding designed sequences. For structures in the same fold, the sequence entropies of the designed sequences cluster together tightly. This tight clustering of sequence entropies within a fold and the separation of sequence entropy distributions for different folds suggest that the diversity of designed sequences is primarily determined by a structures overall fold, and that the designability principle postulated from studies of simple models holds in real proteins. This has important implications for experimental protein design and engineering, as well as providing insight into protein evolution.
Proteins | 2003
Stefan M. Larson; Amit Garg; John R. Desjarlais; Vijay S. Pande
Protein structure prediction by comparative modeling benefits greatly from the use of multiple sequence alignment information to improve the accuracy of structural template identification and the alignment of target sequences to structural templates. Unfortunately, this benefit is limited to those protein sequences for which at least several natural sequence homologues exist. We show here that the use of large diverse alignments of computationally designed protein sequences confers many of the same benefits as natural sequences in identifying structural templates for comparative modeling targets. A large‐scale massively parallelized application of an all‐atom protein design algorithm, including a simple model of peptide backbone flexibility, has allowed us to generate 500 diverse, non‐native, high‐quality sequences for each of 264 protein structures in our test set. PSI‐BLAST searches using the sequence profiles generated from the designed sequences (“reverse” BLAST searches) give near‐perfect accuracy in identifying true structural homologues of the parent structure, with 54% coverage. In 41 of 49 genomes scanned using reverse BLAST searches, at least one novel structural template (not found by the standard method of PSI‐BLAST against PDB) is identified. Further improvements in coverage, through optimizing the scoring function used to design sequences and continued application to new protein structures beyond the test set, will allow this method to mature into a useful strategy for identifying distantly related structural templates. Proteins 2003;51:390–396.
Journal of Molecular Biology | 2003
Stefan M. Larson; Vijay S. Pande
Investigating the relative importance of protein stability, function, and folding kinetics in driving protein evolution has long been hindered by the fact that we can only compare modern natural proteins, the products of the very process we seek to understand, to each other, with no external references or baselines. Through a large-scale all-atom simulation of protein evolution, we have created a large diverse alignment of SH3 domain sequences which have been selected only for native state stability, with no other influencing factors. Although the average pairwise identity between computationally evolved and natural sequences is only 17%, the residue frequency distributions of the computationally evolved sequences are similar to natural SH3 sequences at 86% of the positions in the domain, suggesting that optimization for the native state structure has dominated the evolution of natural SH3 domains. Additionally, the positions which play a consistent role in the transition state of three well-characterized SH3 domains (by phi-value analysis) are structurally optimized for the native state, and vice versa. Indeed, we see a specific and significant correlation between sequence optimization for native state stability and conservation of transition state structure.
Biopolymers | 2003
Vijay S. Pande; Ian Baker; Jarrod Chapman; Sidney P. Elmer; Siraj Khaliq; Stefan M. Larson; Young Min Rhee; Michael R. Shirts; Christopher D. Snow; Eric J. Sorin; Bojan Zagrovic
arXiv: Biological Physics | 2009
Stefan M. Larson; Michael R. Shirts; Vijay S. Pande; Christopher D. Snow
Protein Science | 2000
Stefan M. Larson; Alan R. Davidson
Journal of Molecular Biology | 2000
Stefan M. Larson; Ariel A. Di Nardo; Alan R. Davidson
Current Opinion in Structural Biology | 2005
Arash Zarrine-Afsar; Stefan M. Larson; Alan R. Davidson
Journal of Molecular Biology | 2000
Kevin W. Plaxco; Stefan M. Larson; Ingo Ruczinski; David S. Riddle; Edward C. Thayer; Brian Buchwitz; Alan R. Davidson; David Baker
Journal of Molecular Biology | 2003
Ariel A. Di Nardo; Stefan M. Larson; Alan R. Davidson