Iain Melvin
Princeton University
Network
Latest external collaboration on country level. Dive into details by clicking on the dots.
Publication
Featured researches published by Iain Melvin.
Bioinformatics | 2009
Iain Melvin; Jason Weston; Christina S. Leslie; William Stafford Noble
Summary: We present a large-scale implementation of the Rankprop protein homology ranking algorithm in the form of an openly accessible web server. We use the NRDB40 PSI-BLAST all-versus-all protein similarity network of 1.1 million proteins to construct the graph for the Rankprop algorithm, whereas previously, results were only reported for a database of 108 000 proteins. We also describe two algorithmic improvements to the original algorithm, including propagation from multiple homologs of the query and better normalization of ranking scores, that lead to higher accuracy and to scores with a probabilistic interpretation. Availability: The Rankprop web server and source code are available at http://rankprop.gs.washington.edu Contact: [email protected]; [email protected]
PLOS Computational Biology | 2011
Iain Melvin; Jason Weston; William Stafford Noble; Christina S. Leslie
Virtually every molecular biologist has searched a protein or DNA sequence database to find sequences that are evolutionarily related to a given query. Pairwise sequence comparison methods—i.e., measures of similarity between query and target sequences—provide the engine for sequence database search and have been the subject of 30 years of computational research. For the difficult problem of detecting remote evolutionary relationships between protein sequences, the most successful pairwise comparison methods involve building local models (e.g., profile hidden Markov models) of protein sequences. However, recent work in massive data domains like web search and natural language processing demonstrate the advantage of exploiting the global structure of the data space. Motivated by this work, we present a large-scale algorithm called ProtEmbed, which learns an embedding of protein sequences into a low-dimensional “semantic space.” Evolutionarily related proteins are embedded in close proximity, and additional pieces of evidence, such as 3D structural similarity or class labels, can be incorporated into the learning process. We find that ProtEmbed achieves superior accuracy to widely used pairwise sequence methods like PSI-BLAST and HHSearch for remote homology detection; it also outperforms our previous RankProp algorithm, which incorporates global structure in the form of a protein similarity network. Finally, the ProtEmbed embedding space can be visualized, both at the global level and local to a given query, yielding intuition about the structure of protein sequence space.
BMC Bioinformatics | 2007
Iain Melvin; Eugene Ie; Rui Kuang; Jason Weston; William Stafford Noble; Christina S. Leslie
Journal of Machine Learning Research | 2007
Iain Melvin; Eugene Ie; Jason Weston; William Stafford Noble; Christina S. Leslie
BMC Bioinformatics | 2008
Iain Melvin; Jason Weston; Christina S. Leslie; William Stafford Noble
Archive | 2011
Iain Melvin; Koray Kavukcuoglu; Akshat Aranya; Bing Bai
neural information processing systems | 2010
David Grangier; Iain Melvin
Archive | 2012
Iain Melvin; Murugan Sankaradas; Yun Chi; Hojjat Jafarpour
computer vision and pattern recognition | 2018
Chih-Yao Ma; Asim Kadav; Iain Melvin; Ghassan AlRegib; Hans Peter Graf
Archive | 2013
Eric Cosatto; Pierre-François Laquerre; Christopher Malon; Hans-Peter Graf; Iain Melvin