Michel Eduardo Beleza Yamagishi

Archive Network Publication Hotspot Collaboration

Network

Latest external collaboration on country level. Dive into details by clicking on the dots.

Explore More

Hotspot

Dive into the research topics where Michel Eduardo Beleza Yamagishi is active.

Explore More

Publication

Featured researches published by Michel Eduardo Beleza Yamagishi.

Nucleic Acids Research | 2003

STING Millennium: a web-based suite of programs for comprehensive and simultaneous analysis of protein structure and sequence

Goran Neshich; Roberto C. Togawa; Adauto L. Mancini; Paula R. Kuser; Michel Eduardo Beleza Yamagishi; Georgios Pappas; Wellington V. Torres; Tharsis Fonseca e Campos; Leonardo L. Ferreira; Fabio M. Luna; Adilton G. Oliveira; Ronald T. Miura; Marcus K. Inoue; Luiz G. Horita; Dimas F. de Souza; Fabiana Dominiquini; Alexandre Alvaro; Cleber S. Lima; Fabio O. Ogawa; Gabriel B. Gomes; Juliana F. Palandrani; Gabriela F. dos Santos; Esther M. de Freitas; Amanda R. Mattiuz; Ivan C. Costa; Celso L. de Almeida; Savio Souza; Christian Baudet; Roberto H. Higa

STING Millennium Suite (SMS) is a new web-based suite of programs and databases providing visualization and a complex analysis of molecular sequence and structure for the data deposited at the Protein Data Bank (PDB). SMS operates with a collection of both publicly available data (PDB, HSSP, Prosite) and its own data (contacts, interface contacts, surface accessibility). Biologists find SMS useful because it provides a variety of algorithms and validated data, wrapped-up in a user friendly web interface. Using SMS it is now possible to analyze sequence to structure relationships, the quality of the structure, nature and volume of atomic contacts of intra and inter chain type, relative conservation of amino acids at the specific sequence position based on multiple sequence alignment, indications of folding essential residue (FER) based on the relationship of the residue conservation to the intra-chain contacts and Calpha-Calpha and Cbeta-Cbeta distance geometry. Specific emphasis in SMS is given to interface forming residues (IFR)-amino acids that define the interactive portion of the protein surfaces. SMS may simultaneously display and analyze previously superimposed structures. PDB updates trigger SMS updates in a synchronized fashion. SMS is freely accessible for public data at http://www.cbi.cnptia.embrapa.br, http://mirrors.rcsb.org/SMS and http://trantor.bioc.columbia.edu/SMS.

Nucleic Acids Research | 2005

The Diamond STING server

Goran Neshich; Luiz Borro; Roberto H. Higa; Paula R. Kuser; Michel Eduardo Beleza Yamagishi; Eduardo H. Franco; João N. Krauchenco; Renato Fileto; André A. Ribeiro; George B. P. Bezerra; Thiago M. Velludo; Tomás S. Jimenez; Noboru Furukawa; Hirofumi Teshima; Koji Kitajima; K. Abdulla Bava; Akinori Sarai; Roberto C. Togawa; Adauto L. Mancini

Diamond STING is a new version of the STING suite of programs for a comprehensive analysis of a relationship between protein sequence, structure, function and stability. We have added a number of new functionalities by both providing more structure parameters to the STING Database and by improving/expanding the interface for enhanced data handling. The integration among the STING components has also been improved. A new key feature is the ability of the STING server to handle local files containing protein structures (either modeled or not yet deposited to the Protein Data Bank) so that they can be used by the principal STING components: JavaProtein Dossier (JPD) and STING Report. The current capabilities of the new STING version and a couple of biologically relevant applications are described here. We have provided an example where Diamond STING identifies the active site amino acids and folding essential amino acids (both previously determined by experiments) by filtering out all but those residues by selecting the numerical values/ranges for a set of corresponding parameters. This is the fundamental step toward a more interesting endeavor—the prediction of such residues. Diamond STING is freely accessible at and .

Bioinformatics | 2004

STING Contacts: a web-based application for identification and analysis of amino acid contacts within protein structure and across protein interfaces

Adauto L. Mancini; Roberto H. Higa; Adilton G. Oliveira; Fabiana Dominiquini; Paula R. Kuser; Michel Eduardo Beleza Yamagishi; Roberto C. Togawa; Goran Neshich

UNLABELLED Amino acid contacts in terms of atomic interactions are essential factors to be considered in the analysis of the structure of a protein and its complexes. Consequently, molecular biologists do require specific tools for the identification and visualization of all such contacts. Graphical contacts (GC) and interface forming residue graphical contacts (IFRgc) presented here, calculate atomic contacts among amino acids based on a table of predefined pairs of the atom types and their distances, and then display them using number of different forms. The inventory of currently listed contact types by GC and IFRgc include hydrogen bonds (in nine different flavors), hydrophobic interactions, charge-charge interactions, aromatic stacking and disulfide bonds. Such extensive catalog of the interactions, representing the forces that govern protein folding, stability and binding, is the key feature of these two applications. GC and IFRgc are part of STING Millennium Suite. AVAILABILITY http://sms.cbi.cnptia.embrapa.br/SMS, http://trantor.bioc.columbia.edu/SMS, http://mirrors.rcsb.org//SMS, http://www.es.embnet.org/SMS and http://www.ar.embnet.org/SMS (Options: Graphical Contacts and IFR Graphical Contacts).

Nucleic Acids Research | 2004

STING Report: convenient web-based application for graphic and tabular presentations of protein sequence, structure and function descriptors from the STING database

Goran Neshich; Adauto L. Mancini; Michel Eduardo Beleza Yamagishi; Paula R. Kuser; Renato Fileto; Ivan P. Pinto; Juliana F. Palandrani; João N. Krauchenco; Christian Baudet; Arnaldo J. Montagner; Roberto H. Higa

The Sting Report is a versatile web-based application for extraction and presentation of detailed information about any individual amino acid of a protein structure stored in the STING Database. The extracted information is presented as a series of GIF images and tables, containing the values of up to 125 sequence/structure/function descriptors/parameters. The GIF images are generated by the Gold STING modules. The HTML page resulting from the STING Report query can be printed and, most importantly, it can be composed and visualized on a computer platform with an elementary configuration. Using the STING Report, a user can generate a collection of customized reports for amino acids of specific interest. Such a collection comes as an ideal match for a demand for the rapid and detailed consultation and documentation of data about structure/function. The inclusion of information generated with STING Report in a research report or even a textbook, allows for the increased density of its contents. STING Report is freely accessible within the Gold STING Suite at http://www.cbi.cnptia.embrapa.br, http://www.es.embnet.org/SMS/, http://gibk26.bse.kyutech.ac.jp/SMS/ and http://trantor.bioc.columbia.edu/SMS (option: STING Report).

BMC Bioinformatics | 2004

STING Millennium Suite: integrated software for extensive analyses of 3d structures of proteins and their complexes.

Roberto H. Higa; Roberto C. Togawa; Arnaldo J. Montagner; Juliana F. Palandrani; Igor K. S. Okimoto; Paula R. Kuser; Michel Eduardo Beleza Yamagishi; Adauto L. Mancini; Goran Neshich

BackgroundThe integration of many aspects of protein/DNA structure analysis is an important requirement for software products in general area of structural bioinformatics. In fact, there are too few software packages on the internet which can be described as successful in this respect. We might say that what is still missing is publicly available, web based software for interactive analysis of the sequence/structure/function of proteins and their complexes with DNA and ligands. Some of existing software packages do have certain level of integration and do offer analysis of several structure related parameters, however not to the extent generally demanded by a user.ResultsWe are reporting here about new Sting Millennium Suite (SMS) version which is fully accessible (including for local files at client end), web based software for molecular structure and sequence/structure/function analysis. The new SMS client version is now operational also on Linux boxes and it works with non-public pdb formatted files (structures not deposited at the RCSB/PDB), eliminating earlier requirement for the registration if SMS components were to be used with users local files. At the same time the new SMS offers some important additions and improvements such as link to ProTherm as well as significant re-engineering of SMS component ConSSeq. Also, we have added 3 new SMS mirror sites to existing network of global SMS servers: Argentina, Japan and Spain.ConclusionSMS is already established software package and many key data base and software servers worldwide, do offer either a link to, or host the SMS. SMS (S ting M illennium S uite) is web-based publicly available software developed to aid researches in their quest for translating information about the structures of macromolecules into knowledge. SMS allows to a user to interactively analyze molecular structures, cross-referencing visualized information with a correlated one, available across the internet. SMS is already used as a didactic tool by some universities. SMS analysis is now possible on Linux OS boxes and with no requirement for registration when using local files.

PLOS ONE | 2012

Is a Genome a Codeword of an Error-Correcting Code?

L.C.B. Faria; A.S.L. Rocha; João H. Kleinschmidt; Marcio C. Silva-Filho; Edson Bim; Roberto H. Herai; Michel Eduardo Beleza Yamagishi; Reginaldo Palazzo

Since a genome is a discrete sequence, the elements of which belong to a set of four letters, the question as to whether or not there is an error-correcting code underlying DNA sequences is unavoidable. The most common approach to answering this question is to propose a methodology to verify the existence of such a code. However, none of the methodologies proposed so far, although quite clever, has achieved that goal. In a recent work, we showed that DNA sequences can be identified as codewords in a class of cyclic error-correcting codes known as Hamming codes. In this paper, we show that a complete intron-exon gene, and even a plasmid genome, can be identified as a Hamming code codeword as well. Although this does not constitute a definitive proof that there is an error-correcting code underlying DNA sequences, it is the first evidence in this direction.

Bioinformatics | 2004

ConSSeq: a web-based application for analysis of amino acid conservation based on HSSP database and within context of structure

Roberto H. Higa; Arnaldo J. Montagner; Roberto C. Togawa; Paula R. Kuser; Michel Eduardo Beleza Yamagishi; Adauto L. Mancini; Georgios J Pappas; Ronald T. Miura; Luiz G. Horita; Goran Neshich

SUMMARY A web-based application to analyze protein amino acids conservation-Consensus Sequence (ConSSeq) is presented. ConSSeq graphically represents information about amino acid conservation based on sequence alignments reported in homology-derived structures of proteins. Beyond the relative entropy for each position in the alignment, ConSSeq also presents the consensus sequence and information about the amino acids, which are predominant at each position of the alignment. ConSSeq is part of the STING Millennium Suite and is implemented as a Java Applet. AVAILABILITY http://sms.cbi.cnptia.embrapa.br/SMS/STINGm/consseq/, http://trantor.bioc.columbia.edu/SMS/STINGm/consseq/, http://mirrors.rcsb.org//SMS/STINGm/consseq/, http://www.es.embnet.org/SMS/STINGm/consseq/ and http://www.ar.embnet.org/SMS/STINGm/consseq/

Computational Biology and Chemistry | 2012

Research article: Relationship between global structural parameters and Enzyme Commission hierarchy: Implications for function prediction

Marcelo Boareto; Michel Eduardo Beleza Yamagishi; Nestor Caticha; Vitor Barbanti Pereira Leite

In protein databases there is a substantial number of proteins structurally determined but without function annotation. Understanding the relationship between function and structure can be useful to predict function on a large scale. We have analyzed the similarities in global physicochemical parameters for a set of enzymes which were classified according to the four Enzyme Commission (EC) hierarchical levels. Using relevance theory we introduced a distance between proteins in the space of physicochemical characteristics. This was done by minimizing a cost function of the metric tensor built to reflect the EC classification system. Using an unsupervised clustering method on a set of 1025 enzymes, we obtained no relevant clustering formation compatible with EC classification. The distance distributions between enzymes from the same EC group and from different EC groups were compared by histograms. Such analysis was also performed using sequence alignment similarity as a distance. Our results suggest that global structure parameters are not sufficient to segregate enzymes according to EC hierarchy. This indicates that features essential for function are rather local than global. Consequently, methods for predicting function based on global attributes should not obtain high accuracy in main EC classes prediction without relying on similarities between enzymes from training and validation datasets. Furthermore, these results are consistent with a substantial number of studies suggesting that function evolves fundamentally by recruitment, i.e., a same protein motif or fold can be used to perform different enzymatic functions and a few specific amino acids (AAs) are actually responsible for enzyme activity. These essential amino acids should belong to active sites and an effective method for predicting function should be able to recognize them.

PLOS ONE | 2017

Single nucleotide variants and InDels identified from whole-genome re-sequencing of Guzerat, Gyr, Girolando and Holstein cattle breeds.

N. B. Stafuzza; Adhemar Zerlotini; Francisco Pereira Lobo; Michel Eduardo Beleza Yamagishi; Tatiane Cristina Seleguim Chud; Alexandre Rodrigues Caetano; Danísio Prado Munari; Dorian J. Garrick; Marco Antonio Machado; Marta Fonseca Martins; M.A.R. Carvalho; J.B. Cole; M. V. G. B. Silva

Whole-genome re-sequencing, alignment and annotation analyses were undertaken for 12 sires representing four important cattle breeds in Brazil: Guzerat (multi-purpose), Gyr, Girolando and Holstein (dairy production). A total of approximately 4.3 billion reads from an Illumina HiSeq 2000 sequencer generated for each animal 10.7 to 16.4-fold genome coverage. A total of 27,441,279 single nucleotide variations (SNVs) and 3,828,041 insertions/deletions (InDels) were detected in the samples, of which 2,557,670 SNVs and 883,219 InDels were novel. The submission of these genetic variants to the dbSNP database significantly increased the number of known variants, particularly for the indicine genome. The concordance rate between genotypes obtained using the Bovine HD BeadChip array and the same variants identified by sequencing was about 99.05%. The annotation of variants identified numerous non-synonymous SNVs and frameshift InDels which could affect phenotypic variation. Functional enrichment analysis was performed and revealed that variants in the olfactory transduction pathway was over represented in all four cattle breeds, while the ECM-receptor interaction pathway was over represented in Girolando and Guzerat breeds, the ABC transporters pathway was over represented only in Holstein breed, and the metabolic pathways was over represented only in Gyr breed. The genetic variants discovered here provide a rich resource to help identify potential genomic markers and their associated molecular mechanisms that impact economically important traits for Gyr, Girolando, Guzerat and Holstein breeding programs.

Bioinformatics | 2004

Defining 3D residue environment in protein structures using SCORPION and FORMIGA

Roberto H. Higa; Adilton G. Oliveira; Luiz G. Horita; Ronald T. Miura; Marcus K. Inoue; Paula R. Kuser; Adauto L. Mancini; Michel Eduardo Beleza Yamagishi; Roberto C. Togawa; Goran Neshich

SUMMARY Two web-based applications to analyze amino acids three-dimensional (3D) local environment within protein structures-SCORPION and FORMIGA-are presented. SCORPION and FORMIGA produce a graphical presentation for simple statistical data showing the frequency of residue occurrence within a given sphere (defined here as the 3D contacts). The center of that sphere is placed at the Calpha and at the last heavy atom in the side chain of the selected amino acid. Further depth of detail is given in terms of a secondary structure to which the profiled amino acid belongs. Results obtained with those two applications are relevant for estimating the importance of the amino acid 3D local environment for protein folding and stability. Effectively, SCORPION and FORMIGA construct knowledge-based force fields. The difference between SCORPION and FORMIGA is in that the latter operates on protein interfaces, while the former only functions for a single protein chain. Both applications are implemented as stand-alone components of STING Millennium Suite. AVAILABILITY http://sms.cbi.cnptia.embrapa.br/SMS, http://trantor.bioc.columbia.edu/SMS, http://mirrors.rcsb.org/SMS, http://www.es.embnet.org/SMS and http://www.ar.embnet.org/SMS. [options: Scorpion, Formiga]

Explore More