Network


Latest external collaboration on country level. Dive into details by clicking on the dots.

Hotspot


Dive into the research topics where James Ostell is active.

Publication


Featured researches published by James Ostell.


Nucleic Acids Research | 2004

Entrez Gene: gene-centered information at NCBI

Donna Maglott; James Ostell; Kim D. Pruitt; Tatiana Tatusova

Entrez Gene () is NCBIs database for gene-specific information. Entrez Gene includes records from genomes that have been completely sequenced, that have an active research community to contribute gene-specific information or that are scheduled for intense sequence analysis. The content of Entrez Gene represents the result of both curation and automated integration of data from NCBIs Reference Sequence project (RefSeq), from collaborating model organism databases and from other databases within NCBI. Records in Entrez Gene are assigned unique, stable and tracked integers as identifiers. The content (nomenclature, map location, gene products and their attributes, markers, phenotypes and links to citations, sequences, variation details, maps, expression, homologs, protein domains and external databases) is provided via interactive browsing through NCBIs Entrez system, via NCBIs Entrez programing utilities (E-Utilities), and for bulk transfer by ftp.


American Journal of Human Genetics | 2010

Consensus statement: chromosomal microarray is a first-tier clinical diagnostic test for individuals with developmental disabilities or congenital anomalies.

David T. Miller; Adam Mp; Swaroop Aradhya; Leslie G. Biesecker; Arthur R. Brothman; Nigel P. Carter; Deanna M. Church; John A. Crolla; Evan E. Eichler; Charles J. Epstein; W. Andrew Faucett; Lars Feuk; Jan M. Friedman; Ada Hamosh; Laird G. Jackson; Erin B. Kaminsky; Klaas Kok; Ian D. Krantz; Robert M. Kuhn; Charles Lee; James Ostell; Carla Rosenberg; Stephen W. Scherer; Nancy B. Spinner; Dimitri J. Stavropoulos; James Tepperberg; Erik C. Thorland; Joris Vermeesch; Darrel Waggoner; Michael S. Watson

Chromosomal microarray (CMA) is increasingly utilized for genetic testing of individuals with unexplained developmental delay/intellectual disability (DD/ID), autism spectrum disorders (ASD), or multiple congenital anomalies (MCA). Performing CMA and G-banded karyotyping on every patient substantially increases the total cost of genetic testing. The International Standard Cytogenomic Array (ISCA) Consortium held two international workshops and conducted a literature review of 33 studies, including 21,698 patients tested by CMA. We provide an evidence-based summary of clinical cytogenetic testing comparing CMA to G-banded karyotyping with respect to technical advantages and limitations, diagnostic yield for various types of chromosomal aberrations, and issues that affect test interpretation. CMA offers a much higher diagnostic yield (15%-20%) for genetic testing of individuals with unexplained DD/ID, ASD, or MCA than a G-banded karyotype ( approximately 3%, excluding Down syndrome and other recognizable chromosomal syndromes), primarily because of its higher sensitivity for submicroscopic deletions and duplications. Truly balanced rearrangements and low-level mosaicism are generally not detectable by arrays, but these are relatively infrequent causes of abnormal phenotypes in this population (<1%). Available evidence strongly supports the use of CMA in place of G-banded karyotyping as the first-tier cytogenetic diagnostic test for patients with DD/ID, ASD, or MCA. G-banded karyotype analysis should be reserved for patients with obvious chromosomal syndromes (e.g., Down syndrome), a family history of chromosomal rearrangement, or a history of multiple miscarriages.


Nature Genetics | 2007

The NCBI dbGaP database of genotypes and phenotypes

Matthew D. Mailman; Michael Feolo; Yumi Jin; Masato Kimura; Kimberly A Tryka; Rinat Bagoutdinov; Luning Hao; Anne Kiang; Justin Paschall; Lon Phan; Natalia Popova; Stephanie Pretel; Lora Ziyabari; Moira Lee; Yu Shao; Zhen Y Wang; Karl Sirotkin; Minghong Ward; Michael Kholodov; Kerry Zbicz; Jeff Beck; Michael Kimelman; Sergey Shevelev; Don Preuss; Eugene Yaschenko; Alan S. Graeff; James Ostell; Stephen T. Sherry

The National Center for Biotechnology Information has created the dbGaP public repository for individual-level phenotype, exposure, genotype and sequence data and the associations between them. dbGaP assigns stable, unique identifiers to studies and subsets of information from those studies, including documents, individual phenotypic variables, tables of trait data, sets of genotype data, computed phenotype-genotype associations, and groups of study subjects who have given similar consents for use of their data.


Nucleic Acids Research | 2014

RefSeq: an update on mammalian reference sequences

Kim D. Pruitt; Garth Brown; Susan M. Hiatt; Françoise Thibaud-Nissen; Alexander Astashyn; Olga Ermolaeva; Catherine M. Farrell; Jennifer Hart; Melissa J. Landrum; Kelly M. McGarvey; Michael R. Murphy; Nuala A. O’Leary; Shashikant Pujar; Bhanu Rajput; Sanjida H. Rangwala; Lillian D. Riddick; Andrei Shkeda; Hanzhen Sun; Pamela Tamez; Raymond E. Tully; Craig Wallin; David Webb; Janet Weber; Wendy Wu; Michael DiCuccio; Paul Kitts; Donna Maglott; Terence Murphy; James Ostell

The National Center for Biotechnology Information (NCBI) Reference Sequence (RefSeq) database is a collection of annotated genomic, transcript and protein sequence records derived from data in public sequence archives and from computation, curation and collaboration (http://www.ncbi.nlm.nih.gov/refseq/). We report here on growth of the mammalian and human subsets, changes to NCBI’s eukaryotic annotation pipeline and modifications affecting transcript and protein records. Recent changes to NCBI’s eukaryotic genome annotation pipeline provide higher throughput, and the addition of RNAseq data to the pipeline results in a significant expansion of the number of transcripts and novel exons annotated on mammalian RefSeq genomes. Recent annotation changes include reporting supporting evidence for transcript records, modification of exon feature annotation and the addition of a structured report of gene and sequence attributes of biological interest. We also describe a revised protein annotation policy for alternatively spliced transcripts with more divergent predicted proteins and we summarize the current status of the RefSeqGene project.


Genome Research | 2009

The consensus coding sequence (CCDS) project: Identifying a common protein-coding gene set for the human and mouse genomes

Kim D. Pruitt; Jennifer Harrow; Rachel A. Harte; Craig Wallin; Mark Diekhans; Donna Maglott; Steve Searle; Catherine M. Farrell; Jane Loveland; Barbara J. Ruef; Elizabeth Hart; Marie-Marthe Suner; Melissa J. Landrum; Bronwen Aken; Sarah Ayling; Robert Baertsch; Julio Fernandez-Banet; Joshua L. Cherry; Val Curwen; Michael DiCuccio; Manolis Kellis; Jennifer M. Lee; Michael F. Lin; Michael Schuster; Andrew Shkeda; Clara Amid; Garth Brown; Oksana Dukhanina; Adam Frankish; Jennifer Hart

Effective use of the human and mouse genomes requires reliable identification of genes and their products. Although multiple public resources provide annotation, different methods are used that can result in similar but not identical representation of genes, transcripts, and proteins. The collaborative consensus coding sequence (CCDS) project tracks identical protein annotations on the reference mouse and human genomes with a stable identifier (CCDS ID), and ensures that they are consistently represented on the NCBI, Ensembl, and UCSC Genome Browsers. Importantly, the project coordinates on manually reviewing inconsistent protein annotations between sites, as well as annotations for which new evidence suggests a revision is needed, to progressively converge on a complete protein-coding set for the human and mouse reference genomes, while maintaining a high standard of reliability and biological accuracy. To date, the project has identified 20,159 human and 17,707 mouse consensus coding regions from 17,052 human and 16,893 mouse genes. Three evaluation methods indicate that the entries in the CCDS set are highly likely to represent real proteins, more so than annotations from contributing groups not included in CCDS. The CCDS database thus centralizes the function of identifying well-supported, identically-annotated, protein-coding regions.


Nucleic Acids Research | 2016

NCBI prokaryotic genome annotation pipeline.

Tatiana Tatusova; Michael DiCuccio; Azat Badretdin; Vyacheslav Chetvernin; Eric P. Nawrocki; Leonid Zaslavsky; Alexandre Lomsadze; Kim D. Pruitt; Mark Borodovsky; James Ostell

Recent technological advances have opened unprecedented opportunities for large-scale sequencing and analysis of populations of pathogenic species in disease outbreaks, as well as for large-scale diversity studies aimed at expanding our knowledge across the whole domain of prokaryotes. To meet the challenge of timely interpretation of structure, function and meaning of this vast genetic information, a comprehensive approach to automatic genome annotation is critically needed. In collaboration with Georgia Tech, NCBI has developed a new approach to genome annotation that combines alignment based methods with methods of predicting protein-coding and RNA genes and other functional elements directly from sequence. A new gene finding tool, GeneMarkS+, uses the combined evidence of protein and RNA placement by homology as an initial map of annotation to generate and modify ab initio gene predictions across the whole genome. Thus, the new NCBIs Prokaryotic Genome Annotation Pipeline (PGAP) relies more on sequence similarity when confident comparative data are available, while it relies more on statistical predictions in the absence of external evidence. The pipeline provides a framework for generation and analysis of annotation on the full breadth of prokaryotic taxonomy. For additional information on PGAP see https://www.ncbi.nlm.nih.gov/genome/annotation_prok/ and the NCBI Handbook, https://www.ncbi.nlm.nih.gov/books/NBK174280/.


Nature Genetics | 2007

New models of collaboration in genome-wide association studies: the Genetic Association Information Network.

Teri A. Manolio; Laura Lyman Rodriguez; Lisa D. Brooks; Gonçalo R. Abecasis; Dennis G. Ballinger; Mark J. Daly; Peter Donnelly; Stephen V. Faraone; Kelly A. Frazer; Stacey Gabriel; Pablo V. Gejman; Alan E. Guttmacher; Emily L. Harris; Thomas R. Insel; John R. Kelsoe; Eric S. Lander; Norma McCowin; Matthew D. Mailman; Elizabeth G. Nabel; James Ostell; Elizabeth W. Pugh; Stephen T. Sherry; Patrick F. Sullivan; John F. Thompson; James H. Warram; David Wholley; Patrice M. Milos; Francis S. Collins

The Genetic Association Information Network (GAIN) is a public-private partnership established to investigate the genetic basis of common diseases through a series of collaborative genome-wide association studies. GAIN has used new approaches for project selection, data deposition and distribution, collaborative analysis, publication and protection from premature intellectual property claims. These demonstrate a new commitment to shared scientific knowledge that should facilitate rapid advances in understanding the genetics of complex diseases.


Nucleic Acids Research | 2013

IgBLAST: an immunoglobulin variable domain sequence analysis tool

Jian Ye; Ning Ma; Thomas L. Madden; James Ostell

The variable domain of an immunoglobulin (IG) sequence is encoded by multiple genes, including the variable (V) gene, the diversity (D) gene and the joining (J) gene. Analysis of IG sequences typically requires identification of each gene, as well as a comparison of sequence variations in the context of defined regions. General purpose tools, such as the BLAST program, have only limited use for such tasks, as the rearranged nature of an IG sequence and the variable length of each gene requires multiple rounds of BLAST searches for a single IG sequence. Additionally, manual assembly of different genes is difficult and error-prone. To address these issues and to facilitate other common tasks in analysing IG sequences, we have developed the sequence analysis tool IgBLAST (http://www.ncbi.nlm.nih.gov/igblast/). With this tool, users can view the matches to the germline V, D and J genes, details at rearrangement junctions, the delineation of IG V domain framework regions and complementarity determining regions. IgBLAST has the capability to analyse nucleotide and protein sequences and can process sequences in batches. Furthermore, IgBLAST allows searches against the germline gene databases and other sequence databases simultaneously to minimize the chance of missing possibly the best matching germline V gene.


Nature | 2009

Prepublication data sharing.

Ewan Birney; Thomas J. Hudson; Eric D. Green; Chris Gunter; Sean R. Eddy; John A. Rogers; Jennifer R. Harris; S D Ehrlich; Rolf Apweiler; C P Austin; L Berglund; Martin Bobrow; C. Bountra; Anthony J. Brookes; Anne Cambon-Thomsen; Nigel P. Carter; Rex L. Chisholm; Jorge L. Contreras; R M Cooke; William L. Crosby; Ken Dewar; Richard Durbin; Dyke Som.; Joseph R. Ecker; K El Emam; Lars Feuk; Stacey Gabriel; John Gallacher; William M. Gelbart; Antonio Granell

Rapid release of prepublication data has served the field of genomics well. Attendees at a workshop in Toronto recommend extending the practice to other biological data sets.


Genomics | 1990

The National Center for Biotechnology Information

Dennis Benson; Mark S. Boguski; David J. Lipman; James Ostell

You may not have heard of the National Center for Biotechnology Information (NCBI), but chances are good that you’ve used one of our resources, such as the PubMed database of biomedical literature or the BLAST DNA and protein sequence similarity search tool. And you’re not alone: Each day, over 2 million people access NCBI databases and download a total of more than 3 terabytes (trillion bytes) of data.

Collaboration


Dive into the James Ostell's collaboration.

Top Co-Authors

Avatar

Ilene Karsch-Mizrachi

National Institutes of Health

View shared research outputs
Top Co-Authors

Avatar

Kim D. Pruitt

National Institutes of Health

View shared research outputs
Top Co-Authors

Avatar

Tatiana Tatusova

National Institutes of Health

View shared research outputs
Top Co-Authors

Avatar

Michael DiCuccio

National Institutes of Health

View shared research outputs
Top Co-Authors

Avatar

Webb Miller

Pennsylvania State University

View shared research outputs
Top Co-Authors

Avatar

Craig Wallin

University of California

View shared research outputs
Top Co-Authors

Avatar

Donna Maglott

National Institutes of Health

View shared research outputs
Top Co-Authors

Avatar
Top Co-Authors

Avatar

Catherine M. Farrell

National Institutes of Health

View shared research outputs
Top Co-Authors

Avatar

Deanna M. Church

National Institutes of Health

View shared research outputs
Researchain Logo
Decentralizing Knowledge