Network


Latest external collaboration on country level. Dive into details by clicking on the dots.

Hotspot


Dive into the research topics where Takashi Gojobori is active.

Publication


Featured researches published by Takashi Gojobori.


Nucleic Acids Research | 2000

Codon usage tabulated from international DNA sequence databases: status for the year 2000

Yasukazu Nakamura; Takashi Gojobori; Toshimichi Ikemura

The frequencies of each of the 257 468 complete protein coding sequences (CDSs) have been compiled from the taxonomical divisions of the GenBank DNA sequence database. The sum of the codons used by 8792 organisms has also been calculated. The data files can be obtained from the anonymous ftp sites of DDBJ, Kazusa and EBI. A list of the codon usage of genes and the sum of the codons used by each organism can be obtained through the web site http://www.kazusa.or.jp/codon/. The present study also reports recent developments on the WWW site. The new web interface provides data in the CodonFrequency-compatible format as well as in the traditional table format. The use of the database is facilitated by keyword based search analysis and the availability of codon usage tables for selected genes from each species. These new tools will provide users with the ability to further analyze for variations in codon usage among different genomes.


Nature | 2001

Functional annotation of a full-length mouse cDNA collection

Jun Kawai; Akira Shinagawa; Kazuhiro Shibata; Masataka Yoshino; Masayoshi Itoh; Yoshiyuki Ishii; Takahiro Arakawa; Ayako Hara; Yoshifumi Fukunishi; Hideaki Konno; Jun Adachi; Shiro Fukuda; Katsunori Aizawa; Masaki Izawa; Kenichiro Nishi; Hidenori Kiyosawa; Shinji Kondo; Itaru Yamanaka; Tsuyoshi Saito; Yasushi Okazaki; Takashi Gojobori; Hidemasa Bono; Takeya Kasukawa; R. Saito; Koji Kadota; Hideo Matsuda; Michael Ashburner; Serge Batalov; Tom L. Casavant; W. Fleischmann

The RIKEN Mouse Gene Encyclopaedia Project, a systematic approach to determining the full coding potential of the mouse genome, involves collection and sequencing of full-length complementary DNAs and physical mapping of the corresponding genes to the mouse genome. We organized an international functional annotation meeting (FANTOM) to annotate the first 21,076 cDNAs to be analysed in this project. Here we describe the first RIKEN clone collection, which is one of the largest described for any organism. Analysis of these cDNAs extends known gene families and identifies new ones.The RIKEN Mouse Gene Encyclopaedia Project, a systematic approach to determining the full coding potential of the mouse genome, involves collection and sequencing of full-length complementary DNAs and physical mapping of the corresponding genes to the mouse genome. We organized an international functional annotation meeting (FANTOM) to annotate the first 21,076 cDNAs to be analysed in this project. Here we describe the first RIKEN clone collection, which is one of the largest described for any organism. Analysis of these cDNAs extends known gene families and identifies new ones.


Nature | 2002

The genome sequence and structure of rice chromosome 1

Takuji Sasaki; Takashi Matsumoto; Kimiko Yamamoto; Katsumi Sakata; Tomoya Baba; Yuichi Katayose; Jianzhong Wu; Yoshihito Niimura; Zhukuan Cheng; Yoshiaki Nagamura; Baltazar A. Antonio; Hiroyuki Kanamori; Satomi Hosokawa; Masatoshi Masukawa; Koji Arikawa; Yoshino Chiden; Mika Hayashi; Masako Okamoto; Tsuyu Ando; Hiroyoshi Aoki; Kohei Arita; Masao Hamada; Chizuko Harada; Saori Hijishita; Mikiko Honda; Yoko Ichikawa; Atsuko Idonuma; Masumi Iijima; Michiko Ikeda; Maiko Ikeno

The rice species Oryza sativa is considered to be a model plant because of its small genome size, extensive genetic map, relative ease of transformation and synteny with other cereal crops. Here we report the essentially complete sequence of chromosome 1, the longest chromosome in the rice genome. We summarize characteristics of the chromosome structure and the biological insight gained from the sequence. The analysis of 43.3 megabases (Mb) of non-overlapping sequence reveals 6,756 protein coding genes, of which 3,161 show homology to proteins of Arabidopsis thaliana, another model plant. About 30% (2,073) of the genes have been functionally categorized. Rice chromosome 1 is (G + C)-rich, especially in its coding regions, and is characterized by several gene families that are dispersed or arranged in tandem repeats. Comparison with a draft sequence indicates the importance of a high-quality finished sequence.


Nature | 2008

Big data: The future of biocuration.

Doug Howe; Maria Costanzo; Petra Fey; Takashi Gojobori; Linda Hannick; Winston Hide; David P. Hill; Renate Kania; Mary Schaeffer; Susan St Pierre; Simon N. Twigger; Owen R. White; Seung Y. Rhee

To thrive, the field that links biologists and their data urgently needs structure, recognition and support.


Nature | 2010

The dynamic genome of Hydra

Jarrod Chapman; Ewen F. Kirkness; Oleg Simakov; Steven E. Hampson; Therese Mitros; Therese Weinmaier; Thomas Rattei; Prakash G. Balasubramanian; Jon Borman; Dana Busam; Kathryn Disbennett; Cynthia Pfannkoch; Nadezhda Sumin; Granger Sutton; Lakshmi Viswanathan; Brian Walenz; David Goodstein; Uffe Hellsten; Takeshi Kawashima; Simon Prochnik; Nicholas H. Putnam; Shengquiang Shu; Bruce Blumberg; Catherine E. Dana; Lydia Gee; Dennis F. Kibler; Lee Law; Dirk Lindgens; Daniel E. Martínez; Jisong Peng

The freshwater cnidarian Hydra was first described in 1702 and has been the object of study for 300 years. Experimental studies of Hydra between 1736 and 1744 culminated in the discovery of asexual reproduction of an animal by budding, the first description of regeneration in an animal, and successful transplantation of tissue between animals. Today, Hydra is an important model for studies of axial patterning, stem cell biology and regeneration. Here we report the genome of Hydra magnipapillata and compare it to the genomes of the anthozoan Nematostella vectensis and other animals. The Hydra genome has been shaped by bursts of transposable element expansion, horizontal gene transfer, trans-splicing, and simplification of gene structure and gene content that parallel simplification of the Hydra life cycle. We also report the sequence of the genome of a novel bacterium stably associated with H. magnipapillata. Comparisons of the Hydra genome to the genomes of other animals shed light on the evolution of epithelia, contractile tissues, developmentally regulated transcription factors, the Spemann–Mangold organizer, pluripotency genes and the neuromuscular junction.


Archives of Virology | 1998

Classification, nomenclature, and database development for hepatitis C virus (HCV) and related viruses: proposals for standardization

Betty H. Robertson; G. Myers; C. Howard; T. Brettin; Jens Bukh; B. Gaschen; Takashi Gojobori; Geert Maertens; Masashi Mizokami; O. Nainan; S. Netesov; K. Nishioka; T. Shin-i; Peter Simmonds; Donald B. Smith; Lieven Stuyver; Amy J. Weiner

SummaryThis paper presents a summary of the recommendations that were formulated for the purposes of unifying the nomenclature for hepatitis C virus (HCV), based upon guidelines of the International Committee on Virus Taxonomy (ICTV), and provides guidelines of the incorporation of sequence data into an HCV database that will be available to researchers through the internet. Based upon the available data, the genus Hepacivirus should be regarded as comprising as comprising a single species with HCV-1 as the prototype. All currently known isolates of HCV can be divided into six phylogenetically distinct groups, and we recommend that these groups are described as clades 1 to 6. Whether or not these should be regarded as different species within the Hepacivirus genus requires additional clinical, virological, and immunological information. Clades 1, 2, 4, and 5 would correspond to genotype 1, 2, 4, and 5 while clade 3 would comprise genotype 3 and genotype 10, and clade 6 comprise genotypes 6, 7, 8, 9, and 11. We propose that existing subtype designations are reassigned within these clades based upon publication priority, the existence of a complete genome sequence and prevalence. The assignment of isolates to new clades and subtypes should be confined to isolates characterized from epidemiologically unlinked individuals. Comparisons should be based on nucleotide sequence of at least two coding regions and preferably of complete genome sequences, and should be based on phylogenetic analysis rather than percent identity. A forum for discussion and contributions to these recommendations will be made available at the international HCV database at http://s2as02.genes.nig.ac.jp.


Proceedings of the National Academy of Sciences of the United States of America | 2002

Metabolic efficiency and amino acid composition in the proteomes of Escherichia coli and Bacillus subtilis.

Hiroshi Akashi; Takashi Gojobori

Biosynthesis of an Escherichia coli cell, with organic compounds as sources of energy and carbon, requires approximately 20 to 60 billion high-energy phosphate bonds [Stouthamer, A. H. (1973) Antonie van Leeuwenhoek 39, 545–565]. A substantial fraction of this energy budget is devoted to biosynthesis of amino acids, the building blocks of proteins. The fueling reactions of central metabolism provide precursor metabolites for synthesis of the 20 amino acids incorporated into proteins. Thus, synthesis of an amino acid entails a dual cost: energy is lost by diverting chemical intermediates from fueling reactions and additional energy is required to convert precursor metabolites to amino acids. Among amino acids, costs of synthesis vary from 12 to 74 high-energy phosphate bonds per molecule. The energetic advantage to encoding a less costly amino acid in a highly expressed gene can be greater than 0.025% of the total energy budget. Here, we provide evidence that amino acid composition in the proteomes of E. coli and Bacillus subtilis reflects the action of natural selection to enhance metabolic efficiency. We employ synonymous codon usage bias as a measure of translation rates and show increases in the abundance of less energetically costly amino acids in highly expressed proteins.


Science | 2009

Mapping Human Genetic Diversity in Asia

Mahmood Ameen Abdulla; Ikhlak Ahmed; Anunchai Assawamakin; Jong Bhak; Samir K. Brahmachari; Gayvelline C. Calacal; Amit Chaurasia; Chien-Hsiun Chen; Jieming Chen; Yuan-Tsong Chen; Jiayou Chu; Eva Maria Cutiongco-de la Paz; Maria Corazon A. De Ungria; Frederick C. Delfin; Juli Edo; Suthat Fuchareon; Ho Ghang; Takashi Gojobori; Junsong Han; Sheng Feng Ho; Boon Peng Hoh; Wei Huang; Hidetoshi Inoko; Pankaj Jha; Timothy A. Jinam; Li Jin; Jongsun Jung; Daoroong Kangwanpong; Jatupol Kampuansai; Giulia C. Kennedy

Patterns of Early Migration In order to gain insight into various migrations that must have happened during movement of early humans into Asia and the subsequent populating of the largest continent on Earth, the HUGO Pan-Asian SNP Consortium (p. 1541) analyzed genetic variation in almost 2000 individuals representing 73 Asian and two non-Asian populations. The results suggest that there may have been a single major migration of people into Asia and a subsequent south-to-north migration across the continent. While most populations from the same linguistic group tend to cluster together in terms of relatedness, several do not, clustering instead with their geographic neighbors, suggesting either substantial recent mixing among the populations or language replacement. Furthermore, data from indigenous Taiwanese populations appear to be inconsistent with the idea of a Taiwan homeland for Austronesian populations. Genetic analyses of Asian peoples suggest that the continent was populated through a single migration event. Asia harbors substantial cultural and linguistic diversity, but the geographic structure of genetic variation across the continent remains enigmatic. Here we report a large-scale survey of autosomal variation from a broad geographic sample of Asian human populations. Our results show that genetic ancestry is strongly correlated with linguistic affiliations as well as geography. Most populations show relatedness within ethnic/linguistic groups, despite prevalent gene flow among populations. More than 90% of East Asian (EA) haplotypes could be found in either Southeast Asian (SEA) or Central-South Asian (CSA) populations and show clinal structure with haplotype diversity decreasing from south to north. Furthermore, 50% of EA haplotypes were found in SEA only and 5% were found in CSA only, indicating that SEA was a major geographic source of EA populations.


Nature Genetics | 2004

Biased biological functions of horizontally transferred genes in prokaryotic genomes.

Yoji Nakamura; Takeshi Itoh; Hideo Matsuda; Takashi Gojobori

Horizontal gene transfer is one of the main mechanisms contributing to microbial genome diversification. To clarify the overall picture of interspecific gene flow among prokaryotes, we developed a new method for detecting horizontally transferred genes and their possible donors by Bayesian inference with training models for nucleotide composition. Our method gives the average posterior probability (horizontal transfer index) for each gene sequence, with a low horizontal transfer index indicating recent horizontal transfer. We found that 14% of open reading frames in 116 prokaryotic complete genomes were subjected to recent horizontal transfer. Based on this data set, we quantitatively determined that the biological functions of horizontally transferred genes, except mobile element genes, are biased to three categories: cell surface, DNA binding and pathogenicity-related functions. Thus, the transferability of genes seems to depend heavily on their functions.


Journal of Molecular Evolution | 1982

Patterns of Nucleotide Substitution in Pseudogenes and Functional Genes

Takashi Gojobori; Wen-Hsiung Li; Dan Graur

SummaryThe pattern of point mutations is inferred from nucleotide substitutions in pseudogenes. The pattern obtained suggests that transition mutations occur somewhat more frequently than transversion mutations and that mutations result more often in A or T than in G or C. Our results are discussed with respect to the predictions from Topal and Frescos model for the molecular basis of point (substitution) mutations (Nature 263:285–289, 1976). The pattern of nucleotide substitution at the first and second positions of codons in functional genes is quite similar to that in pseudogenes, but the relative frequency of the transition C→T in the sense strand is drastically reduced and those of the transversions C→G and G→C are doubled. The differences between the two patterns can be explained by the observation that in the protein evolution amino acid substitutions occur mainly between amino acids with similar biochemical properties (Grantham, Science 185:862–864, 1974). Our results for the patterns of nucleotide substitutions in pseudogenes and in functional genes lead to the prediction that both the coding and non-coding regions of protein coding genes should have high frequencies of A and T. Available data show that the non-coding regions are indeed high in A and T but the coding regions are low in T, though high in A.

Collaboration


Dive into the Takashi Gojobori's collaboration.

Top Co-Authors

Avatar

Kazuho Ikeo

National Institute of Genetics

View shared research outputs
Top Co-Authors

Avatar

Tadashi Imanishi

National Institute of Genetics

View shared research outputs
Top Co-Authors

Avatar

Yoshio Tateno

National Institute of Genetics

View shared research outputs
Top Co-Authors

Avatar
Top Co-Authors

Avatar

Katsuhiko Mineta

King Abdullah University of Science and Technology

View shared research outputs
Top Co-Authors

Avatar

Hideaki Sugawara

National Institute of Genetics

View shared research outputs
Top Co-Authors

Avatar
Top Co-Authors

Avatar
Top Co-Authors

Avatar

Takeshi Itoh

National Institute of Advanced Industrial Science and Technology

View shared research outputs
Top Co-Authors

Avatar

Yoji Nakamura

National Institute of Genetics

View shared research outputs
Researchain Logo
Decentralizing Knowledge