Patrick Argos
Purdue University
Network
Latest external collaboration on country level. Dive into details by clicking on the dots.
Publication
Featured researches published by Patrick Argos.
Biochimica et Biophysica Acta | 1986
J.K. Mohana Rao; Patrick Argos
Assignments were made for helical regions in several integral membrane proteins using an algorithm devised to delineate the transmembrane helices in bacteriorhodopsin (Eur. J. Biochem. 182 (1982) 565-575). A new conformational preference parameter for membrane-buried helices was obtained. The use of this parameter to predict helices in membrane proteins is discussed. When applied to the L and M subunits of Rhodopseudomonas sphaeroides, five helices were predicted, which is consistent with the three-dimensional X-ray crystal structure. Data on signal sequences and amino acid exchanges in membrane proteins are also analysed and discussed
Journal of Computational Chemistry | 1995
Frank Eisenhaber; Philip Lijnzaad; Patrick Argos; Chris Sander; Michael Scharf
The double cubic lattice method (DCLM) is an accurate and rapid approach for computing numerically molecular surface areas (such as the solvent accessible or van der Waals surface) and the volume and compactness of molecular assemblies and for generating dot surfaces. The algorithm has no special memory requirements and can be easily implemented. The computation speed is extremely high, making interactive calculation of surfaces, volumes, and dot surfaces for systems of 1000 and more atoms possible on single‐processor workstations. The algorithm can be easily parallelized. The DCLM is an algorithmic variant of the approach proposed by Shrake and Rupley (J. Mol. Biol., 79, 351–371, 1973). However, the application of two cubic lattices—one for grouping neighboring atomic centers and the other for grouping neighboring surface dots of an atom—results in a drastic reduction of central processing unit (CPU) time consumption by avoiding redundant distance checks. This is most noticeable for compact conformations. For instance, the calculation of the solvent accessible surface area of the crystal conformation of bovine pancreatic trypsin inhibitor (entry 4PTI of the Brookhaven Protein Data Bank, 362‐point sphere for all 454 nonhydrogen atoms) takes less than 1 second (on a single R3000 processor of an SGI 4D/480, about 5 MFLOP). The DCLM does not depend on the spherical point distribution applied. The quality of unit sphere tesselations is discussed. We propose new ways of subdivision based on the icosahedron and dodecahedron, which achieve constantly low ratios of longest to shortest arcs over the whole frequency range. The DCLM is the method of choice, especially for large molecular complexes and high point densities. Its speed has been compared to the fastest techniques known to the authors, and it was found to be superior, especially when also taking into account the small memory requirement and the flexibility of the algorithm. The program text may be obtained on request.
Methods in Enzymology | 1996
Thure Etzold; Anatoly Ulyanov; Patrick Argos
Publisher Summary This chapter presents a retrieval system called “Sequence Retrieval System (SRS)” that acts on data banks in a flat file or text format. It provides a homogeneous interface to about 80 biological databanks for accessing and querying their contents and for navigating among them. SRS is an integrated system that provides a homogeneous interface to all flat file data banks retained in their original format. It is a retrieval system that allows access to, but not the depositing of, data. Several elements are combined into a system that extends the power of normal retrieval systems and that rivals that of real databases, such as a relational system, without compromising speed. These elements include languages for data bank and syntax definition, a programmable parser, an indexing system, support for subentries, a novel system for exploiting links among data banks, and a query language. The database linking is a unique feature that considerably extends the capability of hypertext links.
European Biophysics Journal | 1983
Paul A. Hargrave; McDowell Jh; Donna R. Curtis; Janet K. Wang; Elizabeth Juszczak; Shao-Ling Fong; J.K. Mohana Rao; Patrick Argos
We have isolated 16 peptides from a cyanogen bromide digest of rhodopsin. These cyanogen bromide peptides account for the complete composition of the protein. Methionine-containing peptides from other chemical and enzymatic digests of rhodopsin have allowed us to place the cyanogen bromide peptides in order, yielding the sequence of the protein. We have completed the sequence of most of the cyanogen bromide peptides. This information, in conjunction with that from other laboratories, forms the basis for our prediction of the secondary structure of the protein and how it may be arranged in the disk membrane.
Proteins | 1997
Dmitrij Frishman; Patrick Argos
In this study we present an accurate secondary structure prediction procedure by using a query and related sequences. The most novel aspect of our approach is its reliance on local pairwise alignment of the sequence to be predicted with each related sequence rather than utilization of a multiple alignment. The residue‐by‐residue accuracy of the method is 75% in three structural states after jack‐knife tests. The gain in prediction accuracy compared with the existing techniques, which are at best 72%, is achieved by secondary structure propensities based on both local and long‐range effects, utilization of similar sequence information in the form of carefully selected pairwise alignment fragments, and reliance on a large collection of known protein primary structures. The method is especially appropriate for large‐scale sequence analysis efforts such as genome characterization, where precise and significant multiple sequence alignments are not available or achievable. Proteins 27:329–335, 1997.
FEBS Letters | 1985
Raymond S. Brown; Christian Sander; Patrick Argos
Analysis of the amino acid sequence of transcription factor TFIIIA from Xenopuslaevis reveals the presence of 12 repeating structures, each about 30 residues in length. These segments have been aligned and their secondary structure predicted. The repeats each contain two invariant cysteines and two invariant histidines, perhaps to coordinate a zinc cation. Possible nucleic acid interaction modes are discussed.
Bioinformatics | 1993
Thure Etzold; Patrick Argos
SRS (Sequence Retrieval System) is an information indexing and retrieval system designed for libraries with a flat file format such as the EMBL nucleotide sequence databank, the SwissProt protein sequence databank or the Prosite library of protein subsequence consensus patterns. SRS supports the data structure of these libraries by providing special indices for implementing lists of subentities (e.g. feature tables) or hierarchically structured data-fields (e.g. taxonomic classification). A language (ODD) has been designed for the convenient specification of library format and organization, representation of individual data-fields within the system (design of indices) and structuring other data needed during retrieval. This ensures flexibility required for coping with different library formats, which are subject to continuous change. Queries and inspection of retrieved entries can be performed from a user interface with pull-down menus and windows. SRS supports various input and output formats but is particularly well adapted to the GCG programs.
Journal of Molecular Biology | 1976
Michael G. Rossmann; Patrick Argos
Abstract A method for systematically comparing the folding of the three-dimensional structures of proteins has been developed. A search function, plotted in terms of three Eulerian angles, represents the number of sequentially equivalenced amino acids. For each orientation one protein structure is rotated about its center of mass with respect to the other and probabilities are calculated which estimate the degree of structural parallelism. The structurally equivalent residues with highest probabilities are then selected for the best common topology. It was observed that, when structures containing about 150 residues were compared, the random background had a mean value of around 14 residues and the standard deviation was approximately nine residues. The method has been shown to be successful in determining the similarity of the NAD binding domains of lactate dehydrogenase and glyceraldehyde-3-phosphate dehydrogenase, and in comparing the heme binding fold of cytochrome b 5 with the globins. Application of the method to compare hen egg white lysozyme and T4 phage lysozyme led to a single significant peak of 62 residues. The structural homology indicated by this peak showed that the substrate, as bound to hen egg white lysozyme, has a corresponding binding site in the large cleft of the phage lysozyme. The predicted binding site of N -acetyl glucosamine at position C compares well with an N -acetyl glucosamine center observed to bind to crystalline phage lysozyme (B. W. Matthews, personal communication). Some results for the comparison of the two Fe-S cage binding domains of ferredoxin are also presented.
Journal of Molecular Biology | 1992
Stefano Pascarella; Patrick Argos
An analysis of insertions and deletions (indels) occurring in a databank of multiple sequence alignments based on protein tertiary structure is reported. Indels prefer to be short (1 to 5 residues). The average intervening sequence length between them versus the percentage of residue identity in pairwise alignments shows an exponential behaviour, suggesting a stochastic process such that nearly every loop in an ancestral structure is a possible target for indels during evolution. The results also suggest a limit to the average size of indels accommodated by protein structures. The preferred indel conformations are reverse turn and coil as are the preferred conformations at the indel edges (N- and C-terminal sides). Interruptions in helices and strands were observed as very rare events.
Journal of Molecular Biology | 1990
Patrick Argos
Fifty-one examples of oligopeptides linking protein domains were extracted from the Brookhaven database of three-dimensional protein structures. In general, the peptides displayed specific characteristics in composition, conformation, hydrogen bonding, flexibility and the like. The entire database was then searched for pentapeptides that would optimize these natural linker properties. The oligopeptides found are suggested as general candidates to link protein molecules or domains through gene fusion.