Network


Latest external collaboration on country level. Dive into details by clicking on the dots.

Hotspot


Dive into the research topics where Hongseok Tae is active.

Publication


Featured researches published by Hongseok Tae.


Nature Communications | 2012

Draft genome sequence and genetic transformation of the oleaginous alga Nannochloropis gaditana

Randor Radakovits; Robert E. Jinkerson; Susan I. Fuerstenberg; Hongseok Tae; Robert E. Settlage; Jeffrey L. Boore; Matthew C. Posewitz

The potential use of algae in biofuels applications is receiving significant attention. However, none of the current algal model species are competitive production strains. Here we present a draft genome sequence and a genetic transformation method for the marine microalga Nannochloropsis gaditana CCMP526. We show that N. gaditana has highly favourable lipid yields, and is a promising production organism. The genome assembly includes nuclear (~29 Mb) and organellar genomes, and contains 9,052 gene models. We define the genes required for glycerolipid biogenesis and detail the differential regulation of genes during nitrogen-limited lipid biosynthesis. Phylogenomic analysis identifies genetic attributes of this organism, including unique stramenopile photosynthesis genes and gene expansions that may explain the distinguishing photoautotrophic phenotypes observed. The availability of a genome sequence and transformation methods will facilitate investigations into N. gaditana lipid biosynthesis and permit genetic engineering strategies to further improve this naturally productive alga.


PLOS ONE | 2012

Identification of Early Zygotic Genes in the Yellow Fever Mosquito Aedes aegypti and Discovery of a Motif Involved in Early Zygotic Genome Activation

James K. Biedler; Wanqi Hu; Hongseok Tae; Zhijian Tu

During early embryogenesis the zygotic genome is transcriptionally silent and all mRNAs present are of maternal origin. The maternal-zygotic transition marks the time over which embryogenesis changes its dependence from maternal RNAs to zygotically transcribed RNAs. Here we present the first systematic investigation of early zygotic genes (EZGs) in a mosquito species and focus on genes involved in the onset of transcription during 2–4 hr. We used transcriptome sequencing to identify the “pure” (without maternal expression) EZGs by analyzing transcripts from four embryonic time ranges of 0–2, 2–4, 4–8, and 8–12 hr, which includes the time of cellular blastoderm formation and up to the start of gastrulation. Blast of 16,789 annotated transcripts vs. the transcriptome reads revealed evidence for 63 (P<0.001) and 143 (P<0.05) nonmaternally derived transcripts having a significant increase in expression at 2–4 hr. One third of the 63 EZG transcripts do not have predicted introns compared to 10% of all Ae. aegypti genes. We have confirmed by RT-PCR that zygotic transcription starts as early as 2–3 hours. A degenerate motif VBRGGTA was found to be overrepresented in the upstream sequences of the identified EZGs using a motif identification software called SCOPE. We find evidence for homology between this motif and the TAGteam motif found in Drosophila that has been implicated in EZG activation. A 38 bp sequence in the proximal upstream sequence of a kinesin light chain EZG (KLC2.1) contains two copies of the mosquito motif. This sequence was shown to support EZG transcription by luciferase reporter assays performed on injected early embryos, and confers early zygotic activity to a heterologous promoter from a divergent mosquito species. The results of these studies are consistent with the model of early zygotic genome activation via transcriptional activators, similar to what has been found recently in Drosophila.


PLOS ONE | 2013

Characterizing the Genetic Basis for Nicotine Induced Cancer Development: A Transcriptome Sequencing Study.

Jasmin H. Bavarva; Hongseok Tae; Robert E. Settlage; Harold R. Garner

Nicotine is a known risk factor for cancer development and has been shown to alter gene expression in cells and tissue upon exposure. We used Illumina® Next Generation Sequencing (NGS) technology to gain unbiased biological insight into the transcriptome of normal epithelial cells (MCF-10A) to nicotine exposure. We generated expression data from 54,699 transcripts using triplicates of control and nicotine stressed cells. As a result, we identified 138 differentially expressed transcripts, including 39 uncharacterized genes. Additionally, 173 transcripts that are primarily associated with DNA replication, recombination, and repair showed evidence for alternative splicing. We discovered the greatest nicotine stress response by HPCAL4 (up-regulated by 4.71 fold) and NPAS3 (down-regulated by -2.73 fold); both are genes that have not been previously implicated in nicotine exposure but are linked to cancer. We also discovered significant down-regulation (-2.3 fold) and alternative splicing of NEAT1 (lncRNA) that may have an important, yet undiscovered regulatory role. Gene ontology analysis revealed nicotine exposure influenced genes involved in cellular and metabolic processes. This study reveals previously unknown consequences of nicotine stress on the transcriptome of normal breast epithelial cells and provides insight into the underlying biological influence of nicotine on normal cells, marking the foundation for future studies.


Journal of Bacteriology | 2011

Revised Genome Sequence of Brucella suis 1330

Hongseok Tae; Shamira Shallom; Robert E. Settlage; Dale Preston; L. Garry Adams; Harold R. Garner

Brucella suis is a causative agent of porcine brucellosis. We report the resequencing of the original sample upon which the published sequence of Brucella suis 1330 is based and describe the differences between the published assembly and our assembly at 12 loci.


Bioinformatics | 2014

Discretized Gaussian mixture for genotyping of microsatellite loci containing homopolymer runs.

Hongseok Tae; Dong-Yun Kim; John K. McCormick; Robert E. Settlage; Harold R. Garner

MOTIVATION Inferring lengths of inherited microsatellite alleles with single base pair resolution from short sequence reads is challenging due to several sources of noise caused by the repetitive nature of microsatellites and the technologies used to generate raw sequence data. RESULTS We have developed a program, GenoTan, using a discretized Gaussian mixture model combined with a rules-based approach to identify inherited variation of microsatellite loci from short sequence reads without paired-end information. It effectively distinguishes length variants from noise including insertion/deletion errors in homopolymer runs by addressing the bidirectional aspect of insertion and deletion errors in sequence reads. Here we first introduce a homopolymer decomposition method which estimates error bias toward insertion or deletion in homopolymer sequence runs. Combining these approaches, GenoTan was able to genotype 94.9% of microsatellite loci accurately from simulated data with 40x sequence coverage quickly while the other programs showed <90% correct calls for the same data and required 5∼30× more computational time than GenoTan. It also showed the highest true-positive rate for real data using mixed sequence data of two Drosophila inbred lines, which was a novel validation approach for genotyping. AVAILABILITY GenoTan is open-source software available at http://genotan.sourceforge.net.


Genome Biology and Evolution | 2016

Genomic Signatures of Speciation in Sympatric and Allopatric Hawaiian Picture-Winged Drosophila.

Lin Kang; Robert E. Settlage; Wyatt McMahon; Katarzyna Michalak; Hongseok Tae; Harold R. Garner; Elizabeth A. Stacy; Donald K. Price; Pawel Michalak

The Hawaiian archipelago provides a natural arena for understanding adaptive radiation and speciation. The Hawaiian Drosophila are one of the most diverse endemic groups in Hawaiì with up to 1,000 species. We sequenced and analyzed entire genomes of recently diverged species of Hawaiian picture-winged Drosophila, Drosophila silvestris and Drosophila heteroneura from Hawaiì Island, in comparison with Drosophila planitibia, their sister species from Maui, a neighboring island where a common ancestor of all three had likely occurred. Genome-wide single nucleotide polymorphism patterns suggest the more recent origin of D. silvestris and D. heteroneura, as well as a pervasive influence of positive selection on divergence of the three species, with the signatures of positive selection more prominent in sympatry than allopatry. Positively selected genes were significantly enriched for functional terms related to sensory detection and mating, suggesting that sexual selection played an important role in speciation of these species. In particular, sequence variation in Olfactory receptor and Gustatory receptor genes seems to play a major role in adaptive radiation in Hawaiian pictured-winged Drosophila.


Journal of Bacteriology | 2012

Complete Genome Sequence of Brucella suis VBI22, Isolated from Bovine Milk

Hongseok Tae; S. Shallom; Robert E. Settlage; G. N. Hawkins; L. G. Adams; Harold R. Garner

Brucella suis is the causative agent of swine brucellosis and is known to be able to infect several different hosts, including cattle, dogs, and horses, without causing disease symptoms. Here we report the complete genome sequence of Brucella suis VBI22, which was isolated from raw milk from an infected cow.


PLOS ONE | 2014

Exome-Wide Somatic Microsatellite Variation Is Altered in Cells with DNA Repair Deficiencies

Zalman Vaksman; Natalie C. Fonville; Hongseok Tae; Harold R. Garner

Microsatellites (MST), tandem repeats of 1–6 nucleotide motifs, are mutational hot-spots with a bias for insertions and deletions (INDELs) rather than single nucleotide polymorphisms (SNPs). The majority of MST instability studies are limited to a small number of loci, the Bethesda markers, which are only informative for a subset of colorectal cancers. In this paper we evaluate non-haplotype alleles present within next-gen sequencing data to evaluate somatic MST variation (SMV) within DNA repair proficient and DNA repair defective cell lines. We confirm that alleles present within next-gen data that do not contribute to the haplotype can be reliably quantified and utilized to evaluate the SMV without requiring comparisons of matched samples. We observed that SMV patterns found in DNA repair proficient cell lines without DNA repair defects, MCF10A, HEK293 and PD20 RV:D2, had consistent patterns among samples. Further, we were able to confirm that changes in SMV patterns in cell lines lacking functional BRCA2, FANCD2 and mismatch repair were consistent with the different pathways perturbed. Using this new exome sequencing analysis approach we show that DNA instability can be identified in a sample and that patterns of instability vary depending on the impaired DNA repair mechanism, and that genes harboring minor alleles are strongly associated with cancer pathways. The MST Minor Allele Caller used for this study is available at https://github.com/zalmanv/MST_minor_allele_caller.


Nature Communications | 2013

Corrigendum: Draft genome sequence and genetic transformation of the oleaginous alga Nannochloropsis gaditana.

Randor Radakovits; Robert E. Jinkerson; Susan I. Fuerstenberg; Hongseok Tae; Robert E. Settlage; Jeffrey L. Boore; Matthew C. Posewitz

Corrigendum: Draft genome sequence and genetic transformation of the oleaginous alga Nannochloropsis gaditana


Genomics | 2012

Improved variation calling via an iterative backbone remapping and local assembly method for bacterial genomes.

Hongseok Tae; Robert E. Settlage; Shamira Shallom; Jasmin H. Bavarva; Dale Preston; Gregory N. Hawkins; L. Garry Adams; Harold R. Garner

Sequencing data analysis remains limiting and problematic, especially for low complexity repeat sequences and transposon elements due to inherent sequencing errors and short sequence read lengths. We have developed a program, ReviSeq, which uses a hybrid method composed of iterative remapping and local assembly upon a bacterial sequence backbone. Application of this method to six Brucella suis field isolates compared to the newly revised B. suis 1330 reference genome identified on average 13, 15, 19 and 9 more variants per sample than STAMPY/SAMtools, BWA/SAMtools, iCORN and BWA/PINDEL pipelines, and excluded on average 4, 2, 3 and 19 variants per sample, respectively. In total, using this iterative approach, we identified on average 87 variants including SNVs, short INDELs and long INDELs per strain when compared to the reference. Our program outperforms other methods especially for long INDEL calling. The program is available at http://reviseq.sourceforge.net.

Collaboration


Dive into the Hongseok Tae's collaboration.

Top Co-Authors

Avatar

Harold R. Garner

National Institutes of Health

View shared research outputs
Top Co-Authors

Avatar

Robert E. Settlage

Virginia Bioinformatics Institute

View shared research outputs
Top Co-Authors

Avatar

Jasmin H. Bavarva

Virginia Bioinformatics Institute

View shared research outputs
Top Co-Authors

Avatar

Lauren J. McIver

Virginia Bioinformatics Institute

View shared research outputs
Top Co-Authors

Avatar

Shamira Shallom

Virginia Bioinformatics Institute

View shared research outputs
Top Co-Authors

Avatar
Top Co-Authors

Avatar
Top Co-Authors

Avatar
Top Co-Authors

Avatar

Natalie C. Fonville

Virginia Bioinformatics Institute

View shared research outputs
Researchain Logo
Decentralizing Knowledge