Network


Latest external collaboration on country level. Dive into details by clicking on the dots.

Hotspot


Dive into the research topics where Joshua Orvis is active.

Publication


Featured researches published by Joshua Orvis.


Science | 2007

Genome sequence of Aedes aegypti, a major arbovirus vector

Vishvanath Nene; Jennifer R. Wortman; Daniel John Lawson; Brian J. Haas; Chinnappa D. Kodira; Zhijian Jake Tu; Brendan J. Loftus; Zhiyong Xi; Karyn Megy; Manfred Grabherr; Quinghu Ren; Evgeny M. Zdobnov; Neil F. Lobo; Kathryn S. Campbell; Susan E. Brown; Maria F. Bonaldo; Jingsong Zhu; Steven P. Sinkins; David G. Hogenkamp; Paolo Amedeo; Peter Arensburger; Peter W. Atkinson; Shelby Bidwell; Jim Biedler; Ewan Birney; Robert V. Bruggner; Javier Costas; Monique R. Coy; Jonathan Crabtree; Matt Crawford

We present a draft sequence of the genome of Aedes aegypti, the primary vector for yellow fever and dengue fever, which at ∼1376 million base pairs is about 5 times the size of the genome of the malaria vector Anopheles gambiae. Nearly 50% of the Ae. aegypti genome consists of transposable elements. These contribute to a factor of ∼4 to 6 increase in average gene length and in sizes of intergenic regions relative to An. gambiae and Drosophila melanogaster. Nonetheless, chromosomal synteny is generally maintained among all three insects, although conservation of orthologous gene order is higher (by a factor of ∼2) between the mosquito species than between either of them and the fruit fly. An increase in genes encoding odorant binding, cytochrome P450, and cuticle domains relative to An. gambiae suggests that members of these protein families underpin some of the biological differences between the two mosquito species.


Nucleic Acids Research | 2007

The TIGR Rice Genome Annotation Resource: improvements and new features

Shu Ouyang; Wei Zhu; John A. Hamilton; Haining Lin; Matthew Campbell; Kevin L. Childs; Françoise Thibaud-Nissen; Renae L. Malek; Yuandan Lee; Li Zheng; Joshua Orvis; Brian J. Haas; Jennifer R. Wortman; C. Robin Buell

In The Institute for Genomic Research Rice Genome Annotation project (), we have continued to update the rice genome sequence with new data and improve the quality of the annotation. In our current release of annotation (Release 4.0; January 12, 2006), we have identified 42 653 non-transposable element-related genes encoding 49 472 gene models as a result of the detection of alternative splicing. We have refined our identification methods for transposable element-related genes resulting in 13 237 genes that are related to transposable elements. Through incorporation of multiple transcript and proteomic expression data sets, we have been able to annotate 24 799 genes (31 739 gene models), representing ∼50% of the total gene models, as expressed in the rice genome. All structural and functional annotation is viewable through our Rice Genome Browser which currently supports 59 tracks. Enhanced data access is available through web interfaces, FTP downloads and a Data Extractor tool developed in order to support discrete dataset downloads.


Science | 2010

A catalog of reference genomes from the human microbiome.

Karen E. Nelson; George M. Weinstock; Sarah K. Highlander; Kim C. Worley; Heather Huot Creasy; Jennifer R. Wortman; Douglas B. Rusch; Makedonka Mitreva; Erica Sodergren; Asif T. Chinwalla; Michael Feldgarden; Dirk Gevers; Brian J. Haas; Ramana Madupu; Doyle V. Ward; Bruce Birren; Richard A. Gibbs; Barbara A. Methé; Joseph F. Petrosino; Robert L. Strausberg; Granger Sutton; Owen White; Richard Wilson; Scott Durkin; Michelle G. Giglio; Sharvari Gujja; Clint Howarth; Chinnappa D. Kodira; Nikos C. Kyrpides; Teena Mehta

News from the Inner Tube of Life A major initiative by the U.S. National Institutes of Health to sequence 900 genomes of microorganisms that live on the surfaces and orifices of the human body has established standardized protocols and methods for such large-scale reference sequencing. By combining previously accumulated data with new data, Nelson et al. (p. 994) present an initial analysis of 178 bacterial genomes. The sampling so far barely scratches the surface of the microbial diversity found on humans, but the work provides an important baseline for future analyses. Standardized protocols and methods are being established for large-scale sequencing of the microorganisms living on humans. The human microbiome refers to the community of microorganisms, including prokaryotes, viruses, and microbial eukaryotes, that populate the human body. The National Institutes of Health launched an initiative that focuses on describing the diversity of microbial species that are associated with health and disease. The first phase of this initiative includes the sequencing of hundreds of microbial reference genomes, coupled to metagenomic sequencing from multiple body sites. Here we present results from an initial reference genome sequencing of 178 microbial genomes. From 547,968 predicted polypeptides that correspond to the gene complement of these strains, previously unidentified (“novel”) polypeptides that had both unmasked sequence length greater than 100 amino acids and no BLASTP match to any nonreference entry in the nonredundant subset were defined. This analysis resulted in a set of 30,867 polypeptides, of which 29,987 (~97%) were unique. In addition, this set of microbial genomes allows for ~40% of random sequences from the microbiome of the gastrointestinal tract to be associated with organisms based on the match criteria used. Insights into pan-genome analysis suggest that we are still far from saturating microbial species genetic data sets. In addition, the associated metrics and standards used by our group for quality assurance are presented.


Genome Biology | 2008

Automated eukaryotic gene structure annotation using EVidenceModeler and the Program to Assemble Spliced Alignments

Brian J. Haas; Wei Zhu; Mihaela Pertea; Jonathan E. Allen; Joshua Orvis; Owen White; C. Robin Buell; Jennifer R. Wortman

EVidenceModeler (EVM) is presented as an automated eukaryotic gene structure annotation tool that reports eukaryotic gene structures as a weighted consensus of all available evidence. EVM, when combined with the Program to Assemble Spliced Alignments (PASA), yields a comprehensive, configurable annotation system that predicts protein-coding genes and alternatively spliced isoforms. Our experiments on both rice and human genome sequences demonstrate that EVM produces automated gene structure annotation approaching the quality of manual curation.


PLOS Genetics | 2008

Genomic Islands in the Pathogenic Filamentous Fungus Aspergillus fumigatus

Natalie D. Fedorova; Nora Khaldi; Vinita Joardar; Rama Maiti; Paolo Amedeo; Michael J. Anderson; Jonathan Crabtree; Joana C. Silva; Jonathan H. Badger; Ahmed Abdulrahman Albarraq; Sam Angiuoli; Howard Bussey; Paul Bowyer; Peter J. Cotty; Paul S. Dyer; Amy Egan; Kevin Galens; Claire M. Fraser-Liggett; Brian J. Haas; Jason M. Inman; Richard Kent; Sébastien Lemieux; Iran Malavazi; Joshua Orvis; Terry Roemer; Catherine M. Ronning; Jaideep Sundaram; Granger Sutton; Geoff Turner; J. Craig Venter

We present the genome sequences of a new clinical isolate of the important human pathogen, Aspergillus fumigatus, A1163, and two closely related but rarely pathogenic species, Neosartorya fischeri NRRL181 and Aspergillus clavatus NRRL1. Comparative genomic analysis of A1163 with the recently sequenced A. fumigatus isolate Af293 has identified core, variable and up to 2% unique genes in each genome. While the core genes are 99.8% identical at the nucleotide level, identity for variable genes can be as low 40%. The most divergent loci appear to contain heterokaryon incompatibility (het) genes associated with fungal programmed cell death such as developmental regulator rosA. Cross-species comparison has revealed that 8.5%, 13.5% and 12.6%, respectively, of A. fumigatus, N. fischeri and A. clavatus genes are species-specific. These genes are significantly smaller in size than core genes, contain fewer exons and exhibit a subtelomeric bias. Most of them cluster together in 13 chromosomal islands, which are enriched for pseudogenes, transposons and other repetitive elements. At least 20% of A. fumigatus-specific genes appear to be functional and involved in carbohydrate and chitin catabolism, transport, detoxification, secondary metabolism and other functions that may facilitate the adaptation to heterogeneous environments such as soil or a mammalian host. Contrary to what was suggested previously, their origin cannot be attributed to horizontal gene transfer (HGT), but instead is likely to involve duplication, diversification and differential gene loss (DDL). The role of duplication in the origin of lineage-specific genes is further underlined by the discovery of genomic islands that seem to function as designated “gene dumps” and, perhaps, simultaneously, as “gene factories”.


Nature Biotechnology | 2010

Draft genome sequence of the oilseed species Ricinus communis

Agnes P. Chan; Jonathan Crabtree; Qi Zhao; Hernan Lorenzi; Joshua Orvis; Daniela Puiu; Admasu Melake-Berhan; Kristine M Jones; Julia C. Redman; Grace Q. Chen; Edgar B. Cahoon; Melaku Gedil; Mario Stanke; Brian J. Haas; Jennifer R. Wortman; Claire M. Fraser-Liggett; Jacques Ravel; Pablo D. Rabinowicz

Castor bean (Ricinus communis) is an oilseed crop that belongs to the spurge (Euphorbiaceae) family, which comprises ∼6,300 species that include cassava (Manihot esculenta), rubber tree (Hevea brasiliensis) and physic nut (Jatropha curcas). It is primarily of economic interest as a source of castor oil, used for the production of high-quality lubricants because of its high proportion of the unusual fatty acid ricinoleic acid. However, castor bean genomics is also relevant to biosecurity as the seeds contain high levels of ricin, a highly toxic, ribosome-inactivating protein. Here we report the draft genome sequence of castor bean (4.6-fold coverage), the first for a member of the Euphorbiaceae. Whereas most of the key genes involved in oil synthesis and turnover are single copy, the number of members of the ricin gene family is larger than previously thought. Comparative genomics analysis suggests the presence of an ancient hexaploidization event that is conserved across the dicotyledonous lineage.Castor bean (Ricinus communis) is an oil crop that belongs to the spurge (Euphorbiaceae) family. Its seeds are the source of castor oil, used for the production of high-quality lubricants due to its high proportion of the unusual fatty acid ricinoleic acid. Castor bean seeds also produce ricin, a highly toxic ribosome inactivating protein, making castor bean relevant for biosafety. We report here the 4.6X draft genome sequence of castor bean, representing the first reported Euphorbiaceae genome sequence. Our analysis shows that most key castor oil metabolism genes are single-copy while the ricin gene family is larger than previously thought. Comparative genomics analysis suggests the presence of an ancient hexaploidization event that is conserved across the dicotyledonous lineage.


Nucleic Acids Research | 2014

The Aspergillus Genome Database: multispecies curation and incorporation of RNA-Seq data to improve structural gene annotations

Gustavo C. Cerqueira; Martha B. Arnaud; Diane O. Inglis; Marek S. Skrzypek; Gail Binkley; Matt Simison; Stuart R. Miyasato; Jonathan Binkley; Joshua Orvis; Prachi Shah; Farrell Wymore; Gavin Sherlock; Jennifer R. Wortman

The Aspergillus Genome Database (AspGD; http://www.aspgd.org) is a freely available web-based resource that was designed for Aspergillus researchers and is also a valuable source of information for the entire fungal research community. In addition to being a repository and central point of access to genome, transcriptome and polymorphism data, AspGD hosts a comprehensive comparative genomics toolbox that facilitates the exploration of precomputed orthologs among the 20 currently available Aspergillus genomes. AspGD curators perform gene product annotation based on review of the literature for four key Aspergillus species: Aspergillus nidulans, Aspergillus oryzae, Aspergillus fumigatus and Aspergillus niger. We have iteratively improved the structural annotation of Aspergillus genomes through the analysis of publicly available transcription data, mostly expressed sequenced tags, as described in a previous NAR Database article (Arnaud et al. 2012). In this update, we report substantive structural annotation improvements for A. nidulans, A. oryzae and A. fumigatus genomes based on recently available RNA-Seq data. Over 26 000 loci were updated across these species; although those primarily comprise the addition and extension of untranslated regions (UTRs), the new analysis also enabled over 1000 modifications affecting the coding sequence of genes in each target genome.


Nucleic Acids Research | 2012

The Aspergillus Genome Database (AspGD): recent developments in comprehensive multispecies curation, comparative genomics and community resources

Martha B. Arnaud; Gustavo C. Cerqueira; Diane O. Inglis; Marek S. Skrzypek; Jonathan Binkley; Marcus C. Chibucos; Jonathan Crabtree; Clinton Howarth; Joshua Orvis; Prachi Shah; Farrell Wymore; Gail Binkley; Stuart R. Miyasato; Matt Simison; Gavin Sherlock; Jennifer R. Wortman

The Aspergillus Genome Database (AspGD; http://www.aspgd.org) is a freely available, web-based resource for researchers studying fungi of the genus Aspergillus, which includes organisms of clinical, agricultural and industrial importance. AspGD curators have now completed comprehensive review of the entire published literature about Aspergillus nidulans and Aspergillus fumigatus, and this annotation is provided with streamlined, ortholog-based navigation of the multispecies information. AspGD facilitates comparative genomics by providing a full-featured genomics viewer, as well as matched and standardized sets of genomic information for the sequenced aspergilli. AspGD also provides resources to foster interaction and dissemination of community information and resources. We welcome and encourage feedback at [email protected].


Nucleic Acids Research | 2010

The Aspergillus Genome Database, a curated comparative genomics resource for gene, protein and sequence information for the Aspergillus research community

Martha B. Arnaud; Marcus C. Chibucos; Maria C. Costanzo; Jonathan Crabtree; Diane O. Inglis; Adil Lotia; Joshua Orvis; Prachi Shah; Marek S. Skrzypek; Gail Binkley; Stuart R. Miyasato; Jennifer R. Wortman; Gavin Sherlock

The Aspergillus Genome Database (AspGD) is an online genomics resource for researchers studying the genetics and molecular biology of the Aspergilli. AspGD combines high-quality manual curation of the experimental scientific literature examining the genetics and molecular biology of Aspergilli, cutting-edge comparative genomics approaches to iteratively refine and improve structural gene annotations across multiple Aspergillus species, and web-based research tools for accessing and exploring the data. All of these data are freely available at http://www.aspgd.org. We welcome feedback from users and the research community at [email protected].


Standards in Genomic Sciences | 2011

The IGS Standard Operating Procedure for Automated Prokaryotic Annotation

Kevin Galens; Joshua Orvis; Sean J. Daugherty; Heather Huot Creasy; Sam Angiuoli; Owen White; Jennifer R. Wortman; Anup Mahurkar; Michelle G. Giglio

The Institute for Genome Sciences (IGS) has developed a prokaryotic annotation pipeline that is used for coding gene/RNA prediction and functional annotation of Bacteria and Archaea. The fully automated pipeline accepts one or many genomic sequences as input and produces output in a variety of standard formats. Functional annotation is primarily based on similarity searches and motif finding combined with a hierarchical rule based annotation system. The output annotations can also be loaded into a relational database and accessed through visualization tools.

Collaboration


Dive into the Joshua Orvis's collaboration.

Top Co-Authors

Avatar
Top Co-Authors

Avatar
Top Co-Authors

Avatar
Top Co-Authors

Avatar

Owen White

J. Craig Venter Institute

View shared research outputs
Top Co-Authors

Avatar
Top Co-Authors

Avatar
Top Co-Authors

Avatar
Top Co-Authors

Avatar
Top Co-Authors

Avatar
Top Co-Authors

Avatar
Researchain Logo
Decentralizing Knowledge