Jinhui Shen
University of Texas Southwestern Medical Center
Network
Latest external collaboration on country level. Dive into details by clicking on the dots.
Publication
Featured researches published by Jinhui Shen.
Scientific Reports | 2016
Qian Cong; Jinhui Shen; Dominika Borek; Robert K. Robbins; Zbyszek Otwinowski; Nick V. Grishin
Comparison of complete genomes of closely related species enables research on speciation and how phenotype is determined by genotype. Lepidoptera, an insect order of 150,000 species with diverse phenotypes, is well-suited for such comparative genomics studies if new genomes, which cover additional Lepidoptera families are acquired. We report a 729 Mbp genome assembly of the Calycopis cecrops, the first genome from the family Lycaenidae and the largest available Lepidoptera genome. As detritivore, Calycopis shows expansion in detoxification and digestion enzymes. We further obtained complete genomes of 8 Calycopis specimens: 3 C. cecrops and 5 C. isobeon, including a dry specimen stored in the museum for 30 years. The two species differ subtly in phenotype and cannot be differentiated by mitochondrial DNA. However, nuclear genomes revealed a deep split between them. Genes that can clearly separate the two species (speciation hotspots) mostly pertain to circadian clock, mating behavior, transcription regulation, development and cytoskeleton. The speciation hotspots and their function significantly overlap with those we previously found in Pterourus, suggesting common speciation mechanisms in these butterflies.
Genome Biology and Evolution | 2016
Qian Cong; Jinhui Shen; Andrew D. Warren; Dominika Borek; Zbyszek Otwinowski; Nick V. Grishin
For 200 years, zoologists have relied on phenotypes to learn about the evolution of animals. A glance at the genotype, even through several gene markers, revolutionized our understanding of animal phylogeny. Recent advances in sequencing techniques allow researchers to study speciation mechanisms and the link between genotype and phenotype using complete genomes. We sequenced and assembled a complete genome of the Cloudless Sulphur (Phoebis sennae) from a single wild-caught specimen. This genome was used as reference to compare genomes of six specimens, three from the eastern populations (Oklahoma and north Texas), referred to as a subspecies Phoebis sennae eubule, and three from the southwestern populations (south Texas) known as a subspecies Phoebis sennae marcellina. While the two subspecies differ only subtly in phenotype and mitochondrial DNA, comparison of their complete genomes revealed consistent and significant differences, which are more prominent than those between tiger swallowtails Pterourus canadensis and Pterourus glaucus. The two sulphur taxa differed in histone methylation regulators, chromatin-associated proteins, circadian clock, and early development proteins. Despite being well separated on the whole-genome level, the two taxa show introgression, with gene flow mainly from P. s. marcellina to P. s. eubule. Functional analysis of introgressed genes reveals enrichment in transmembrane transporters. Many transporters are responsible for nutrient uptake, and their introgression may be of selective advantage for caterpillars to feed on more diverse food resources. Phylogenetically, complete genomes place family Pieridae away from Papilionidae, which is consistent with previous analyses based on several gene markers.
Meta Gene | 2015
Jinhui Shen; Qian Cong; Nick V. Grishin
Due to the intriguing morphology, lifecycle, and diversity of butterflies and moths, Lepidoptera are emerging as model organisms for the study of genetics, evolution and speciation. The progress of these studies relies on decoding Lepidoptera genomes, both nuclear and mitochondrial. Here we describe a protocol to obtain mitogenomes from Next Generation Sequencing reads performed for whole-genome sequencing and report the complete mitogenome of Papilio (Pterourus) glaucus. The circular mitogenome is 15,306 bp in length and rich in A and T. It contains 13 protein-coding genes (PCGs), 22 transfer-RNA-coding genes (tRNA), and 2 ribosomal-RNA-coding genes (rRNA), with a gene order typical for mitogenomes of Lepidoptera. We performed phylogenetic analyses based on PCG and RNA-coding genes or protein sequences using Bayesian Inference and Maximum Likelihood methods. The phylogenetic trees consistently show that among species with available mitogenomes Papilio glaucus is the closest to Papilio (Agehana) maraho from Asia.
F1000Research | 2016
Jinhui Shen; Qian Cong; Lisa N. Kinch; Dominika Borek; Zbyszek Otwinowski; Nick V. Grishin
The Small Cabbage White ( Pieris rapae) is originally a Eurasian butterfly. Being accidentally introduced into North America, Australia, and New Zealand a century or more ago, it spread throughout the continents and rapidly established as one of the most abundant butterfly species. Although it is a serious pest of cabbage and other mustard family plants with its caterpillars reducing crops to stems, it is also a source of pierisin, a protein unique to the Whites that shows cytotoxicity to cancer cells. To better understand the unusual biology of this omnipresent agriculturally and medically important butterfly, we sequenced and annotated the complete genome from USA specimens. At 246 Mbp, it is among the smallest Lepidoptera genomes reported to date. While 1.5% positions in the genome are heterozygous, they are distributed highly non-randomly along the scaffolds, and nearly 20% of longer than 1000 base-pair segments are SNP-free (median length: 38000 bp). Computational simulations of population evolutionary history suggest that American populations started from a very small number of introduced individuals, possibly a single fertilized female, which is in agreement with historical literature. Comparison to other Lepidoptera genomes reveals several unique families of proteins that may contribute to the unusual resilience of Pieris. The nitrile-specifier proteins divert the plant defense chemicals to non-toxic products. The apoptosis-inducing pierisins could offer a defense mechanism against parasitic wasps. While only two pierisins from Pieris rapae were characterized before, the genome sequence revealed eight, offering additional candidates as anti-cancer drugs. The reference genome we obtained lays the foundation for future studies of the Cabbage White and other Pieridae species.
Proceedings of the Royal Society B: Biological Sciences | 2017
Qian Cong; Jinhui Shen; Dominika Borek; Robert K. Robbins; Paul A. Opler; Zbyszek Otwinowski; Nick V. Grishin
Two species of hairstreak butterflies from the genus Calycopis are known in the United States: C. cecrops and C. isobeon. Analysis of mitochondrial COI barcodes of Calycopis revealed cecrops-like specimens from the eastern US with atypical barcodes that were 2.6% different from either USA species, but similar to Central American Calycopis species. To address the possibility that the specimens with atypical barcodes represent an undescribed cryptic species, we sequenced complete genomes of 27 Calycopis specimens of four species: C. cecrops, C. isobeon, C. quintana and C. bactra. Some of these specimens were collected up to 60 years ago and preserved dry in museum collections, but nonetheless produced genomes as complete as fresh samples. Phylogenetic trees reconstructed using the whole mitochondrial and nuclear genomes were incongruent. While USA Calycopis with atypical barcodes grouped with Central American species C. quintana by mitochondria, nuclear genome trees placed them within typical USA C. cecrops in agreement with morphology, suggesting mitochondrial introgression. Nuclear genomes also show introgression, especially between C. cecrops and C. isobeon. About 2.3% of each C. cecrops genome has probably (p-value < 0.01, FDR < 0.1) introgressed from C. isobeon and about 3.4% of each C. isobeon genome may have come from C. cecrops. The introgressed regions are enriched in genes encoding transmembrane proteins, mitochondria-targeting proteins and components of the larval cuticle. This study provides the first example of mitochondrial introgression in Lepidoptera supported by complete genome sequencing. Our results caution about relying solely on COI barcodes and mitochondrial DNA for species identification or discovery.
Mitochondrial DNA Part B | 2017
Jing Zhang; Qian Cong; Jinhui Shen; Xiao-Ling Fan; Min Wang; Nick V. Grishin
Abstract We assembled a complete mitochondrial genome of a unique Australian skipper butterfly Euschemon rafflesia (Hesperiidae) from next generation sequencing reads. The 15,447 bp mitogenome covers 13 protein-coding genes (PCGs), 22 transfer RNA genes (tRNAs), 2 ribosomal RNA genes (rRNAs), and an A + T-rich region. Its gene order is typical for mitogenomes of Lepidoptera. Phylogenetic analysis places Euschemon rafflesia as a sister to the rest of Hesperiidae except Coeliadinae.
Mitochondrial DNA Part B | 2016
Jinhui Shen; Qian Cong; Nick V. Grishin
Abstract We obtained a complete mitochondrial genome of a skipper butterfly Achalarus lyciades (Hesperiidae, Eudaminae) from next generation sequencing reads. The 15,612 bp mitogenome covers 13 protein-coding genes (PCGs), 22 transfer RNA genes (tRNAs), 2 ribosomal RNA genes (rRNAs) and an A + T rich region. Its gene order is typical for mitogenomes of Lepidoptera. Phylogenetic analysis placed A. lyciades as a sister to Lobocla bifasciatus, the only other Eudaminae with available mitogenome.
Genomics | 2017
Qian Cong; Jinhui Shen; Wenlin Li; Dominika Borek; Zbyszek Otwinowski; Nick V. Grishin
Sequencing complete genomes of all major phylogenetic groups of organisms opens unprecedented opportunities to study evolution and genetics. We report draft genomes of Calephelis nemesis and Calephelis virginiensis, representatives of the family Riodinidae. They complete the genomic coverage of butterflies at the family level. At 809 and 855 Mbp, respectively, they become the largest available Lepidoptera genomes. Comparison of butterfly genomes shows that the divergence between Riodinidae and Lycaenidae dates to the time when other families started to diverge into subfamilies. Thus, Riodinidae may be considered a subfamily of Lycaenidae. Calephelis species exhibit unique gene expansions in actin-disassembling factor, cofilin, and chitinase. The functional implications of these gene expansions are not clear, but they may aid molting of caterpillars covered in extensive setae. The two Calephelis species diverged about 5 million years ago and they differ in proteins involved in metabolism, circadian clock, regulation of development, and immune responses.
Current Genomics | 2017
Jinhui Shen; Qian Cong; Dominika Borek; Zbyszek Otwinowski; Nick V. Grishin
Background: The Hoary Edge Skipper (Achalarus lyciades) is an eastern North America endemic butterfly from the Eudaminae subfamily of skippers named for an underside whitish patch near the hindwing edge. Its caterpillars feed on legumes, in contrast to Grass skippers (subfamily Hesperiinae) which feed exclusively on monocots. Results: To better understand the evolution and phenotypic diversification of Skippers (family Hesperiidae), we sequenced, assembled and annotated a complete genome draft and transcriptome of a wild-caught specimen of A. lyciades and compared it with the available genome of the Clouded Skipper (Lerema accius) from the Grass skipper subfamily. The genome of A. lyciades is nearly twice the size of L. accius (567 Mbp vs. 298 Mbp), however it encodes a smaller number of proteins (15881 vs. 17411). Gene expansions we identified previously in L. accius apparently did not occur in the genome of A. lyciades. For instance, a family of hypothetical cellulases that diverged from an endochitinase (possibly associated with feeding of L. accius caterpillars on nutrient-poor grasses) is absent in A. lyciades. While L. accius underwent gene expansion in pheromone binding proteins, A. lyciades has more opsins. This difference may be related to the mate recognition mechanisms of the two species: visual cues might be more important for the Eudaminae skippers (which have more variable wing patterns), whereas odor might be more important for Grass skippers (that are hardly distinguishable by their wings). Phylogenetically, A. lyciades is a sister species of L. accius, the only other Hesperiidae with a complete genome. Conclusions: A new reference genome of a dicot-feeding skippers, the first from the Eudaminae subfamily, reveals its larger size and suggests hypotheses about phenotypic traits and differences from monocot-feeding skippers.
Mitochondrial DNA Part B | 2017
Jing Zhang; Qian Cong; Jinhui Shen; Rongjiang Wang; Nick V. Grishin
Abstract We assembled a complete mitogenome of an Asian skipper butterfly Burara striata (Hesperiidae, Coeliadinae), the first representative of the genus Burara, from next generation sequencing reads. The 15327 bp mitogenome covers 13 protein-coding genes (PCGs), 22 transfer RNA genes (tRNAs), 2 ribosomal RNA genes (rRNAs), and an A + T rich region. Its gene order is typical for mitogenomes of Lepidoptera. Phylogenetic analysis places Burara striata as a sister to Hasora, and Choaspes as a sister to both of these genera.