Is this you? Create Your Porfile

Vlado Dančík

University of Southern California

Archive Network Publication Hotspot Collaboration

Network

Latest external collaboration on country level. Dive into details by clicking on the dots.

Explore More

Hotspot

Dive into the research topics where Vlado Dančík is active.

Explore More

Publication

Featured researches published by Vlado Dančík.

Journal of Computational Biology | 1999

De novo peptide sequencing via tandem mass spectrometry.

Vlado Dančík; Theresa A. Addona; Karl R. Clauser; James E. Vath; Pavel A. Pevzner

Peptide sequencing via tandem mass spectrometry (MS/MS) is one of the most powerful tools in proteomics for identifying proteins. Because complete genome sequences are accumulating rapidly, the recent trend in interpretation of MS/MS spectra has been database search. However, de novo MS/MS spectral interpretation remains an open problem typically involving manual interpretation by expert mass spectrometrists. We have developed a new algorithm, SHERENGA, for de novo interpretation that automatically learns fragment ion types and intensity thresholds from a collection of test spectra generated from any type of mass spectrometer. The test data are used to construct optimal path scoring in the graph representations of MS/MS spectra. A ranked list of high scoring paths corresponds to potential peptide sequences. SHERENGA is most useful for interpreting sequences of peptides resulting from unknown proteins and for validating the results of database search algorithms in fully automated, high-throughput peptide sequencing.

research in computational molecular biology | 2000

Mutation-tolerant protein identification by mass-spectrometry

Pavel A. Pevzner; Vlado Dančík; Chris L. Tang

Database search in tandem mass spectrometry is a powerful tool for protein identification. High-throughput spectral acquisition raises the problem of dealing with genetic variation and peptide modifications within a population of related proteins. A method that cross-correlates and clusters related spectra in large collections of uncharacterized spectra (i.e from normal and diseased individuals) would be extremely valuable in functional proteomics. This problem is far from being simple since very similar peptides may have very different spectra. We introduce a new notion of spectral similarity that allows one to identify related spectra even if the corresponding peptides have multiple modifications/mutations. Based on this notion we developed a new algorithm for mutation-tolerant database search as well as a method for cross-correlating related uncharacterized spectra. The paper describes this new approach and its applications in functional proteomics.

Journal of Computational Biology | 1997

Local rules for protein folding on a triangular lattice and generalized hydrophobicity in the HP model.

Richa Agarwala; Serafim Batzoglou; Vlado Dančík; Scott E. Decatur; Sridhar Hannenhalli; Martin Farach; S. Muthukrishnan; Steven Skiena

We consider the problem of determining the three-dimensional folding of a protein given its one-dimensional amino acid sequence. We use the HP model for protein folding proposed by Dill (1985), which models protein as a chain of amino acid residues that are either hydrophobic or polar, and hydrophobic interactions are the dominant initial driving force for the protein folding. Hart and Istrail (1996a) gave approximation algorithms for folding proteins on the cubic lattice under the HP model. In this paper, we examine the choice of a lattice by considering its algorithmic and geometric implications and argue that the triangular lattice is a more reasonable choice. We present a set of folding rules for a triangular lattice and analyze the approximation ratio they achieve. In addition, we introduce a generalization of the HP model to account for residues having different levels of hydrophobicity. After describing the biological foundation for this generalization, we show that in the new model we are able to achieve similar constant factor approximation guarantees on the triangular lattice as were achieved in the standard HP model. While the structures derived from our folding rules are probably still far from biological reality, we hope that having a set of folding rules with different properties will yield more interesting folds when combined.

Journal of Computational Biology | 2000

Mutation-tolerant protein identification by mass spectrometry.

Pavel A. Pevzner; Vlado Dančík; Chris L. Tang

Database search in tandem mass spectrometry is a powerful tool for protein identification. High-throughput spectral acquisition raises the problem of dealing with genetic variation and peptide modifications within a population of related proteins. A method that cross-correlates and clusters related spectra in large collections of uncharacterized spectra (i.e., from normal and diseased individuals) would be very valuable in functional proteomics. This problem is far from being simple since very similar peptides may have very different spectra. We introduce a new notion of spectral similarity that allows one to identify related spectra even if the corresponding peptides have multiple modifications/mutations. Based on this notion, we developed a new algorithm for mutation-tolerant database search as well as a method for cross-correlating related uncharacterized spectra.

research in computational molecular biology | 1998

Estimation for restriction sites observed by optical mapping using reversible-jump Markov chain Monte Carlo

Jae Kyu Lee; Vlado Dančík; Michael S. Waterman

A fundamentally new molecular-biology approach in constructing restriction maps, Optical Mapping, has been developed by Schwartz et al. (1993). Using this method restriction maps are constructed by measuring the relevant fluorescence intensity and length measurements. However, it is difficult to directly estimate the restriction site locations of single DNA molecules based on these optical mapping data because of the precision of length measurements and the unknown number of true restriction sites in the data. We propose the use of a hierarchical Bayes model based on a mixture model with normals and random noise. In this model we explicitly consider the missing observation structure of the data, such as the orientations of molecules, the allocations of cutting sites to restriction sites, and the indicator variables of whether observed cut sites are true or false. Because of the complexity of the model, the large number of missing data, and the unknown number of restriction sites, we use Reversible-Jump Markov Chain Monte Carlo (MCMC) to estimate the number and the locations of the restriction sites. Since there exists a high multimodality due to unknown orientations of molecules, we also use a combination of our MCMC approach and the flipping algorithm suggested by Dancík and Waterman (1997). The study is highly computer-intensive and the development of an efficient algorithm is required.

research in computational molecular biology | 1997

Local rules for protein folding on a triangular lattice and generalized hydrophobicity in the HP model

Richa Agarwala; Serafim Batzoglou; Vlado Dančík; Scott E. Decatur; Martin Farach; Sridhar Hannenhalli; S. Muthukrishnan; Steven Skiena

Journal of Computational Biology | 1998

Estimation for restriction sites observed by optical mapping using reversible-jump Markov Chain Monte Carlo.

Jae Kyu Lee; Vlado Dančík; Michael S. Waterman

A fundamentally new molecular-biology approach in constructing restriction maps, Optical Mapping, has been developed by Schwartz et al. (1993). Using this method restriction maps are constructed by measuring the relevant fluorescence intensity and length measurements. However, it is difficult to directly estimate the restriction site locations of single DNA molecules based on these optical mapping data because of the precision of length measurements and the unknown number of true restriction sites in the data. We propose the use of a hierarchical Bayes model based on a mixture model with normals and random noise. In this model we explicitly consider the missing observation structure of the data, such as the orientations of molecules, the allocations of cutting sites to restriction sites, and the indicator variables of whether observed cut sites are true or false. Because of the complexity of the model, the large number of missing data, and the unknown number of restriction sites, we use Reversible-Jump Markov Chain Monte Carlo (MCMC) to estimate the number and the locations of the restriction sites. Since there exists a high multimodality due to unknown orientations of molecules, we also use a combination of our MCMC approach and the flipping algorithm suggested by Dancik and Waterman (1997). The study is highly computer-intensive and the development of an efficient algorithm is required.

Information Processing Letters | 1996

Complexity of Boolean functions over bases with unbounded fan-in gates

Vlado Dančík

Let Ω be the basis consisting of a negation and logical “and” and “or” operations over any number of inputs. Every Boolean function of n variables can be realised by a Boolean circuit over Ω using at most 2.122 · 2n2 + n + 1 gates (2 · 2n2 + n + 1 for even n). We also show that almost all Boolean functions have circuit complexity at least 1.914 · 2n2 − 4n.

Journal of Computational Biology | 2005