Network


Latest external collaboration on country level. Dive into details by clicking on the dots.

Hotspot


Dive into the research topics where Mitsuteru Nakao is active.

Publication


Featured researches published by Mitsuteru Nakao.


DNA Research | 2008

Genome Structure of the Legume, Lotus japonicus

Shusei Sato; Yasukazu Nakamura; Takakazu Kaneko; Erika Asamizu; Tomohiko Kato; Mitsuteru Nakao; Shigemi Sasamoto; Akiko Watanabe; Akiko Ono; Kumiko Kawashima; Tsunakazu Fujishiro; Midori Katoh; Mitsuyo Kohara; Yoshie Kishida; Chiharu Minami; Shinobu Nakayama; Naomi Nakazaki; Yoshimi Shimizu; Sayaka Shinpo; Chika Takahashi; Tsuyuko Wada; Manabu Yamada; Nobuko Ohmido; Makoto Hayashi; Kiichi Fukui; Tomoya Baba; Tomoko Nakamichi; Hirotada Mori; Satoshi Tabata

The legume Lotus japonicus has been widely used as a model system to investigate the genetic background of legume-specific phenomena such as symbiotic nitrogen fixation. Here, we report structural features of the L. japonicus genome. The 315.1-Mb sequences determined in this and previous studies correspond to 67% of the genome (472 Mb), and are likely to cover 91.3% of the gene space. Linkage mapping anchored 130-Mb sequences onto the six linkage groups. A total of 10 951 complete and 19 848 partial structures of protein-encoding genes were assigned to the genome. Comparative analysis of these genes revealed the expansion of several functional domains and gene families that are characteristic of L. japonicus. Synteny analysis detected traces of whole-genome duplication and the presence of synteny blocks with other plant genomes to various degrees. This study provides the first opportunity to look into the complex and unique genetic system of legumes.


Nucleic Acids Research | 2004

DBTBS: database of transcriptional regulation in Bacillus subtilis and its contribution to comparative genomics

Yuko Makita; Mitsuteru Nakao; Naotake Ogasawara; Kenta Nakai

DBTBS (http://dbtbs.hgc.jp) was originally released in 1999 as a reference database of published transcriptional regulation events in Bacillus subtilis, one of the best studied bacteria. It is essentially a compilation of transcription factors with their regulated genes as well as their recognition sequences, which were experimentally characterized and reported in the literature. Here we report its major update, which contains information on 114 transcription factors, including sigma factors, and 633 promoters of 525 genes. The number of references cited in the database has increased from 291 to 378. It also supports a function to find putative transcription factor binding sites within input sequences by using our collection of weight matrices and consensus patterns. Furthermore, though preliminarily, DBTBS now aims to contribute to comparative genomics by showing the presence or absence of potentially orthologous transcription factors and their corresponding cis-elements on the promoters of their potentially orthologously regulated genes in 50 eubacterial genomes.


Bioinformatics | 2010

BioRuby: bioinformatics software for the Ruby programming language

Naohisa Goto; Pjotr Prins; Mitsuteru Nakao; Raoul J. P. Bonnal; Jan Aerts; Toshiaki Katayama

Summary: The BioRuby software toolkit contains a comprehensive set of free development tools and libraries for bioinformatics and molecular biology, written in the Ruby programming language. BioRuby has components for sequence analysis, pathway analysis, protein modelling and phylogenetic analysis; it supports many widely used data formats and provides easy access to databases, external programs and public web services, including BLAST, KEGG, GenBank, MEDLINE and GO. BioRuby comes with a tutorial, documentation and an interactive environment, which can be used in the shell, and in the web browser. Availability: BioRuby is free and open source software, made available under the Ruby license. BioRuby runs on all platforms that support Ruby, including Linux, Mac OS X and Windows. And, with JRuby, BioRuby runs on the Java Virtual Machine. The source code is available from http://www.bioruby.org/. Contact: [email protected]


Nucleic Acids Research | 2010

CyanoBase: the cyanobacteria genome database update 2010

Mitsuteru Nakao; Shinobu Okamoto; Mitsuyo Kohara; Tsunakazu Fujishiro; Takatomo Fujisawa; Shusei Sato; Satoshi Tabata; Takakazu Kaneko; Yasukazu Nakamura

CyanoBase (http://genome.kazusa.or.jp/cyanobase) is the genome database for cyanobacteria, which are model organisms for photosynthesis. The database houses cyanobacteria species information, complete genome sequences, genome-scale experiment data, gene information, gene annotations and mutant information. In this version, we updated these datasets and improved the navigation and the visual display of the data views. In addition, a web service API now enables users to retrieve the data in various formats with other tools, seamlessly.


Bioinformatics | 1999

Modeling and predicting transcriptional units of Escherichia coli genes using hidden Markov models

Tetsushi Yada; Mitsuteru Nakao; Yasushi Totoki; Kenta Nakai

Motivation: The hidden Markov model (HMM) is a valuable technique for gene-finding, especially because its flexibility enables the inclusion of various sequence features. Recent programs for bacterial gene-finding include the information of ribosomal binding site (RBS) to improve the recognition accuracy of the start codon, using this feature. We report here our attempt to extend the model into the total transcriptional unit, enabling the prediction of operon structures. Results: First, we improved the prediction accuracy of coding sequences (CDSs) by employing the models of ‘typical’, ‘atypical’ and ‘negative (false-positive)’ classes as well as the models of RBS and its downstream spacer. The sensitivity of exactly predicting the 204 experimentally confirmed CDSs reached 90.2% in an objective test. Based on the prediction result of CDSs, the positions of the promoters and terminators were predicted. Our model could exactly recognize 60% of 390 known transcriptional units. Thus, the accuracy and significance of this prediction problem is far from trivial. We would like to propose this problem as an open theme in bioinformatics because the ongoing or planned postsequencing projects will produce much data for future improvements. Availability: The table of predicted transcriptional units of Escherichia coli will be distributed upon request.


Nucleic Acids Research | 2014

CyanoBase and RhizoBase: databases of manually curated annotations for cyanobacterial and rhizobial genomes

Takatomo Fujisawa; Shinobu Okamoto; Toshiaki Katayama; Mitsuteru Nakao; Hidehisa Yoshimura; Hiromi Kajiya-Kanegae; Sumiko Yamamoto; Chiyoko Yano; Yuka Yanaka; Hiroko Maita; Takakazu Kaneko; Satoshi Tabata; Yasukazu Nakamura

To understand newly sequenced genomes of closely related species, comprehensively curated reference genome databases are becoming increasingly important. We have extended CyanoBase (http://genome.microbedb.jp/cyanobase), a genome database for cyanobacteria, and newly developed RhizoBase (http://genome.microbedb.jp/rhizobase), a genome database for rhizobia, nitrogen-fixing bacteria associated with leguminous plants. Both databases focus on the representation and reusability of reference genome annotations, which are continuously updated by manual curation. Domain experts have extracted names, products and functions of each gene reported in the literature. To ensure effectiveness of this procedure, we developed the TogoAnnotation system offering a web-based user interface and a uniform storage of annotations for the curators of the CyanoBase and RhizoBase databases. The number of references investigated for CyanoBase increased from 2260 in our previous report to 5285, and for RhizoBase, we perused 1216 references. The results of these intensive annotations are displayed on the GeneView pages of each database. Advanced users can also retrieve this information through the representational state transfer-based web application programming interface in an automated manner.


Nucleic Acids Research | 2006

Large-scale identification and characterization of alternative splicing variants of human gene transcripts using 56,419 completely sequenced and manually annotated full-length cDNAs.

Jun-ichi Takeda; Yutaka Suzuki; Mitsuteru Nakao; Roberto A. Barrero; Kanako O. Koyanagi; Lihua Jin; Chie Motono; Hiroko Hata; Takao Isogai; Keiichi Nagai; Tetsuji Otsuki; Vladimir Kuryshev; Masafumi Shionyu; Kei Yura; Mitiko Go; Jean Thierry-Mieg; Danielle Thierry-Mieg; Stefan Wiemann; Nobuo Nomura; Sumio Sugano; Takashi Gojobori; Tadashi Imanishi

We report the first genome-wide identification and characterization of alternative splicing in human gene transcripts based on analysis of the full-length cDNAs. Applying both manual and computational analyses for 56 419 completely sequenced and precisely annotated full-length cDNAs selected for the H-Invitational human transcriptome annotation meetings, we identified 6877 alternative splicing genes with 18 297 different alternative splicing variants. A total of 37 670 exons were involved in these alternative splicing events. The encoded protein sequences were affected in 6005 of the 6877 genes. Notably, alternative splicing affected protein motifs in 3015 genes, subcellular localizations in 2982 genes and transmembrane domains in 1348 genes. We also identified interesting patterns of alternative splicing, in which two distinct genes seemed to be bridged, nested or having overlapping protein coding sequences (CDSs) of different reading frames (multiple CDS). In these cases, completely unrelated proteins are encoded by a single locus. Genome-wide annotations of alternative splicing, relying on full-length cDNAs, should lay firm groundwork for exploring in detail the diversification of protein function, which is mediated by the fast expanding universe of alternative splicing variants.


Nucleic Acids Research | 2007

H-DBAS: Alternative splicing database of completely sequenced and manually annotated full-length cDNAs based on H-Invitational

Jun-ichi Takeda; Yutaka Suzuki; Mitsuteru Nakao; Tsuyoshi Kuroda; Sumio Sugano; Takashi Gojobori; Tadashi Imanishi

The Human-transcriptome DataBase for Alternative Splicing (H-DBAS) is a specialized database of alternatively spliced human transcripts. In this database, each of the alternative splicing (AS) variants corresponds to a completely sequenced and carefully annotated human full-length cDNA, one of those collected for the H-Invitational human-transcriptome annotation meeting. H-DBAS contains 38 664 representative alternative splicing variants (RASVs) in 11 744 loci, in total. The data is retrievable by various features of AS, which were annotated according to manual annotations, such as by patterns of ASs, consequently invoked alternations in the encoded amino acids and affected protein motifs, GO terms, predicted subcellular localization signals and transmembrane domains. The database also records recently identified very complex patterns of AS, in which two distinct genes seemed to be bridged, nested or degenerated (multiple CDS): in all three cases, completely unrelated proteins are encoded by a single locus. By using AS Viewer, each AS event can be analyzed in the context of full-length cDNAs, enabling the users empirical understanding of the relation between AS event and the consequent alternations in the encoded amino acid sequences together with various kinds of affected protein motifs. H-DBAS is accessible at .


Journal of Biomedical Semantics | 2014

BioHackathon series in 2011 and 2012: penetration of ontology and linked data in life science domains

Toshiaki Katayama; Mark D. Wilkinson; Kiyoko F. Aoki-Kinoshita; Shuichi Kawashima; Yasunori Yamamoto; Atsuko Yamaguchi; Shinobu Okamoto; Shin Kawano; Jin Dong Kim; Yue Wang; Hongyan Wu; Yoshinobu Kano; Hiromasa Ono; Hidemasa Bono; Simon Kocbek; Jan Aerts; Yukie Akune; Erick Antezana; Kazuharu Arakawa; Bruno Aranda; Joachim Baran; Jerven T. Bolleman; Raoul J. P. Bonnal; Pier Luigi Buttigieg; Matthew Campbell; Yi An Chen; Hirokazu Chiba; Peter J. A. Cock; K. Bretonnel Cohen; Alexandru Constantin

The application of semantic technologies to the integration of biological data and the interoperability of bioinformatics analysis and visualization tools has been the common theme of a series of annual BioHackathons hosted in Japan for the past five years. Here we provide a review of the activities and outcomes from the BioHackathons held in 2011 in Kyoto and 2012 in Toyama. In order to efficiently implement semantic technologies in the life sciences, participants formed various sub-groups and worked on the following topics: Resource Description Framework (RDF) models for specific domains, text mining of the literature, ontology development, essential metadata for biological databases, platforms to enable efficient Semantic Web technology development and interoperability, and the development of applications for Semantic Web data. In this review, we briefly introduce the themes covered by these sub-groups. The observations made, conclusions drawn, and software development projects that emerged from these activities are discussed.


Nucleic Acids Research | 2005

Large-scale analysis of human alternative protein isoforms: pattern classification and correlation with subcellular localization signals

Mitsuteru Nakao; Roberto A. Barrero; Yuri Mukai; Chie Motono; Makiko Suwa; Kenta Nakai

We investigated human alternative protein isoforms of >2600 genes based on full-length cDNA clones and SwissProt. We classified the isoforms and examined their co-occurrence for each gene. Further, we investigated potential relationships between these changes and differential subcellular localization. The two most abundant patterns were the one with different C-terminal regions and the one with an internal insertion, which together account for 43% of the total. Although changes of the N-terminal region are less common than those of the C-terminal region, extension of the C-terminal region is much less common than that of the N-terminal region, probably because of the difficulty of removing stop codons in one isoform. We also found that there are some frequently used combinations of co-occurrence in alternative isoforms. We interpret this as evidence that there is some structural relationship which produces a repertoire of isoformal patterns. Finally, many terminal changes are predicted to cause differential subcellular localization, especially in targeting either peroxisomes or mitochondria. Our study sheds new light on the enrichment of the human proteome through alternative splicing and related events. Our database of alternative protein isoforms is available through the internet.

Collaboration


Dive into the Mitsuteru Nakao's collaboration.

Top Co-Authors

Avatar
Top Co-Authors

Avatar
Top Co-Authors

Avatar
Top Co-Authors

Avatar
Top Co-Authors

Avatar
Top Co-Authors

Avatar
Top Co-Authors

Avatar
Top Co-Authors

Avatar
Top Co-Authors

Avatar

Jan Aerts

Katholieke Universiteit Leuven

View shared research outputs
Top Co-Authors

Avatar
Researchain Logo
Decentralizing Knowledge