Network


Latest external collaboration on country level. Dive into details by clicking on the dots.

Hotspot


Dive into the research topics where Kin Fai Au is active.

Publication


Featured researches published by Kin Fai Au.


Genomics, Proteomics & Bioinformatics | 2015

PacBio Sequencing and Its Applications.

Anthony Rhoads; Kin Fai Au

Single-molecule, real-time sequencing developed by Pacific BioSciences offers longer read lengths than the second-generation sequencing (SGS) technologies, making it well-suited for unsolved problems in genome, transcriptome, and epigenetics research. The highly-contiguous de novo assemblies using PacBio sequencing can close gaps in current reference assemblies and characterize structural variation (SV) in personal genomes. With longer reads, we can sequence through extended repetitive regions and detect mutations, many of which are associated with diseases. Moreover, PacBio transcriptome sequencing is advantageous for the identification of gene isoforms and facilitates reliable discoveries of novel genes and novel isoforms of annotated genes, due to its ability to sequence full-length transcripts or fragments with significant lengths. Additionally, PacBio’s sequencing technique provides information that is useful for the direct detection of base modifications, such as methylation. In addition to using PacBio sequencing alone, many hybrid sequencing strategies have been developed to make use of more accurate short reads in conjunction with PacBio long reads. In general, hybrid sequencing strategies are more affordable and scalable especially for small-size laboratories than using PacBio Sequencing alone. The advent of PacBio sequencing has made available much information that could not be obtained via SGS alone.


Nucleic Acids Research | 2010

Detection of splice junctions from paired-end RNA-seq data by SpliceMap

Kin Fai Au; Hui Jiang; Lan Lin; Yi Xing; Wing Hung Wong

Alternative splicing is a prevalent post-transcriptional process, which is not only important to normal cellular function but is also involved in human diseases. The newly developed second generation sequencing technique provides high-throughput data (RNA-seq data) to study alternative splicing events in different types of cells. Here, we present a computational method, SpliceMap, to detect splice junctions from RNA-seq data. This method does not depend on any existing annotation of gene structures and is capable of finding novel splice junctions with high sensitivity and specificity. It can handle long reads (50–100 nt) and can exploit paired-read information to improve mapping accuracy. Several parameters are included in the output to indicate the reliability of the predicted junction and help filter out false predictions. We applied SpliceMap to analyze 23 million paired 50-nt reads from human brain tissue. The results show at this depth of sequencing, RNA-seq can support reliable detection of splice junctions except for those that are present at very low level. Compared to current methods, SpliceMap can achieve 12% higher sensitivity without sacrificing specificity.


Cell | 2012

Activation of Innate Immunity is Required for Efficient Nuclear Reprogramming

Jieun Lee; Nazish Sayed; Arwen L. Hunter; Kin Fai Au; Wing Hung Wong; Edward S. Mocarski; Renee A. Reijo Pera; Eduard Yakubov; John P. Cooke

Retroviral overexpression of reprogramming factors (Oct4, Sox2, Klf4, c-Myc) generates induced pluripotent stem cells (iPSCs). However, the integration of foreign DNA could induce genomic dysregulation. Cell-permeant proteins (CPPs) could overcome this limitation. To date, this approach has proved exceedingly inefficient. We discovered a striking difference in the pattern of gene expression induced by viral versus CPP-based delivery of the reprogramming factors, suggesting that a signaling pathway required for efficient nuclear reprogramming was activated by the retroviral, but not CPP approach. In gain- and loss-of-function studies, we find that the toll-like receptor 3 (TLR3) pathway enables efficient induction of pluripotency by viral or mmRNA approaches. Stimulation of TLR3 causes rapid and global changes in the expression of epigenetic modifiers to enhance chromatin remodeling and nuclear reprogramming. Activation of inflammatory pathways are required for efficient nuclear reprogramming in the induction of pluripotency.


Structure | 2006

Lysine Methylation as a Routine Rescue Strategy for Protein Crystallization

Thomas S. Walter; Christoph Meier; René Assenberg; Kin Fai Au; Jingshan Ren; Anil Verma; Joanne E. Nettleship; Raymond J. Owens; David I. Stuart; Jonathan M. Grimes

Summary Crystallization remains a critical step in X-ray structure determination. Because it is not generally possible to rationally predict crystallization conditions, commercial screens have been developed which sample a wide range of crystallization space. While this approach has proved successful in many cases, a significant number of proteins fail to crystallize despite being soluble and monodispersed. It is established that chemical modification can facilitate the crystallization of otherwise intractable proteins. Here we describe a method for the reductive methylation of lysine residues which is simple, inexpensive, and efficient, and report on its application to ten proteins. We describe the effect of methylation on the physico-chemical properties of these proteins, and show that it led to diffraction-quality crystals from four proteins and structures for three that had hitherto proved refractory to crystallization. The method is suited to both low- and high-throughput laboratories.


PLOS ONE | 2012

Improving PacBio Long Read Accuracy by Short Read Alignment

Kin Fai Au; Jason G. Underwood; Lawrence Lee; Wing Hung Wong

The recent development of third generation sequencing (TGS) generates much longer reads than second generation sequencing (SGS) and thus provides a chance to solve problems that are difficult to study through SGS alone. However, higher raw read error rates are an intrinsic drawback in most TGS technologies. Here we present a computational method, LSC, to perform error correction of TGS long reads (LR) by SGS short reads (SR). Aiming to reduce the error rate in homopolymer runs in the main TGS platform, the PacBio® RS, LSC applies a homopolymer compression (HC) transformation strategy to increase the sensitivity of SR-LR alignment without scarifying alignment accuracy. We applied LSC to 100,000 PacBio long reads from human brain cerebellum RNA-seq data and 64 million single-end 75 bp reads from human brain RNA-seq data. The results show LSC can correct PacBio long reads to reduce the error rate by more than 3 folds. The improved accuracy greatly benefits many downstream analyses, such as directional gene isoform detection in RNA-seq study. Compared with another hybrid correction tool, LSC can achieve over double the sensitivity and similar specificity.


Journal of Clinical Investigation | 2014

ALS-associated mutation FUS-R521C causes DNA damage and RNA splicing defects

Haiyan Qiu; Sebum Lee; Yulei Shang; Wen Yuan Wang; Kin Fai Au; Sherry Kamiya; Sami J. Barmada; Steven Finkbeiner; Hansen Lui; Caitlin E. Carlton; Amy A. Tang; Michael C. Oldham; Hejia Wang; James Shorter; Anthony J. Filiano; Erik D. Roberson; Warren G. Tourtellotte; Bin Chen; Li-Huei Tsai; Eric J. Huang

Autosomal dominant mutations of the RNA/DNA binding protein FUS are linked to familial amyotrophic lateral sclerosis (FALS); however, it is not clear how FUS mutations cause neurodegeneration. Using transgenic mice expressing a common FALS-associated FUS mutation (FUS-R521C mice), we found that mutant FUS proteins formed a stable complex with WT FUS proteins and interfered with the normal interactions between FUS and histone deacetylase 1 (HDAC1). Consequently, FUS-R521C mice exhibited evidence of DNA damage as well as profound dendritic and synaptic phenotypes in brain and spinal cord. To provide insights into these defects, we screened neural genes for nucleotide oxidation and identified brain-derived neurotrophic factor (Bdnf) as a target of FUS-R521C-associated DNA damage and RNA splicing defects in mice. Compared with WT FUS, mutant FUS-R521C proteins formed a more stable complex with Bdnf RNA in electrophoretic mobility shift assays. Stabilization of the FUS/Bdnf RNA complex contributed to Bdnf splicing defects and impaired BDNF signaling through receptor TrkB. Exogenous BDNF only partially restored dendrite phenotype in FUS-R521C neurons, suggesting that BDNF-independent mechanisms may contribute to the defects in these neurons. Indeed, RNA-seq analyses of FUS-R521C spinal cords revealed additional transcription and splicing defects in genes that regulate dendritic growth and synaptic functions. Together, our results provide insight into how gain-of-function FUS mutations affect critical neuronal functions.


Proceedings of the National Academy of Sciences of the United States of America | 2013

Characterization of the human ESC transcriptome by hybrid sequencing

Kin Fai Au; Vittorio Sebastiano; Pegah Tootoonchi Afshar; Jens Durruthy Durruthy; Lawrence Lee; Brian A. Williams; Harm van Bakel; Eric E. Schadt; Renee Reijo-Pera; Jason G. Underwood; Wing Hung Wong

Significance Isoform identification and discovery are an important goal for transcriptome analysis because the majority of human genes express multiple isoforms with context- and tissue-specific functions. Better annotation of isoforms will also benefit downstream analysis such as expression quantification. Current RNA-Seq methods based on short-read sequencing are not reliable for isoform discovery. In this study we developed a new method based on the combined analysis of short reads and long reads generated, respectively, by second- and third-generation sequencing and applied this method to obtain a comprehensive characterization of the transcriptome of the human embryonic stem cell. The results showed that large gain in sensitivity and specificity can be achieved with this strategy. Although transcriptional and posttranscriptional events are detected in RNA-Seq data from second-generation sequencing, full-length mRNA isoforms are not captured. On the other hand, third-generation sequencing, which yields much longer reads, has current limitations of lower raw accuracy and throughput. Here, we combine second-generation sequencing and third-generation sequencing with a custom-designed method for isoform identification and quantification to generate a high-confidence isoform dataset for human embryonic stem cells (hESCs). We report 8,084 RefSeq-annotated isoforms detected as full-length and an additional 5,459 isoforms predicted through statistical inference. Over one-third of these are novel isoforms, including 273 RNAs from gene loci that have not previously been identified. Further characterization of the novel loci indicates that a subset is expressed in pluripotent cells but not in diverse fetal and adult tissues; moreover, their reduced expression perturbs the network of pluripotency-associated genes. Results suggest that gene identification, even in well-characterized human cell lines and tissues, is likely far from complete.


Genome Research | 2013

RNA sequencing reveals a diverse and dynamic repertoire of the Xenopus tropicalis transcriptome over development

Meng How Tan; Kin Fai Au; Arielle L. Yablonovitch; Andrea E. Wills; Jason Chuang; Julie C. Baker; Wing Hung Wong; Jin Billy Li

The Xenopus embryo has provided key insights into fate specification, the cell cycle, and other fundamental developmental and cellular processes, yet a comprehensive understanding of its transcriptome is lacking. Here, we used paired end RNA sequencing (RNA-seq) to explore the transcriptome of Xenopus tropicalis in 23 distinct developmental stages. We determined expression levels of all genes annotated in RefSeq and Ensembl and showed for the first time on a genome-wide scale that, despite a general state of transcriptional silence in the earliest stages of development, approximately 150 genes are transcribed prior to the midblastula transition. In addition, our splicing analysis uncovered more than 10,000 novel splice junctions at each stage and revealed that many known genes have additional unannotated isoforms. Furthermore, we used Cufflinks to reconstruct transcripts from our RNA-seq data and found that ∼13.5% of the final contigs are derived from novel transcribed regions, both within introns and in intergenic regions. We then developed a filtering pipeline to separate protein-coding transcripts from noncoding RNAs and identified a confident set of 6686 noncoding transcripts in 3859 genomic loci. Since the current reference genome, XenTro3, consists of hundreds of scaffolds instead of full chromosomes, we also performed de novo reconstruction of the transcriptome using Trinity and uncovered hundreds of transcripts that are missing from the genome. Collectively, our data will not only aid in completing the assembly of the Xenopus tropicalis genome but will also serve as a valuable resource for gene discovery and for unraveling the fundamental mechanisms of vertebrate embryogenesis.


Nature Genetics | 2016

The primate-specific noncoding RNA HPAT5 regulates pluripotency during human preimplantation development and nuclear reprogramming

Jens Durruthy-Durruthy; Vittorio Sebastiano; Mark Wossidlo; Diana Cepeda; Jun Cui; Edward J. Grow; Jonathan Davila; Moritz Mall; Wing Hung Wong; Joanna Wysocka; Kin Fai Au; Renee A. Reijo Pera

Long intergenic noncoding RNAs (lincRNAs) are derived from thousands of loci in mammalian genomes and are frequently enriched in transposable elements (TEs). Although families of TE-derived lincRNAs have recently been implicated in the regulation of pluripotency, little is known of the specific functions of individual family members. Here we characterize three new individual TE-derived human lincRNAs, human pluripotency-associated transcripts 2, 3 and 5 (HPAT2, HPAT3 and HPAT5). Loss-of-function experiments indicate that HPAT2, HPAT3 and HPAT5 function in preimplantation embryo development to modulate the acquisition of pluripotency and the formation of the inner cell mass. CRISPR-mediated disruption of the genes for these lincRNAs in pluripotent stem cells, followed by whole-transcriptome analysis, identifies HPAT5 as a key component of the pluripotency network. Protein binding and reporter-based assays further demonstrate that HPAT5 interacts with the let-7 microRNA family. Our results indicate that unique individual members of large primate-specific lincRNA families modulate gene expression during development and differentiation to reinforce cell fate.


Science | 2014

Human tRNA Synthetase Catalytic Nulls with Diverse Functions

Wing-Sze Lo; Elisabeth Gardiner; Zhiwen Xu; Ching-Fun Lau; Feng Wang; Jie J. Zhou; John D. Mendlein; Leslie A. Nangle; Kyle P. Chiang; Xiang-Lei Yang; Kin Fai Au; Wing Hung Wong; Min Guo; Mingjie Zhang; Paul Schimmel

Evolving from an enzyme and into a regulator Proteins, the work-horses of the cell, are made on a messenger RNA (mRNA) template. An enzyme called aminoacyl tRNA synthetases (AARSs) attaches the correct amino acid to a transfer RNA so that mRNA is accurately translated. Over evolution, additional sequences have been added to AARSs. Lo et al. found a large number of AARS variants in which the domain responsible for enzyme function was deleted. Ninety-four such variants had diverse signaling activities. Thus, AARSs are used both as enzymes and alternately as regulators of signaling pathways. Science, this issue p. 328 Alternative splicing of aminoacyl transfer RNA synthetases ablates the catalytic domain to yield diverse alternative functions. Genetic efficiency in higher organisms depends on mechanisms to create multiple functions from single genes. To investigate this question for an enzyme family, we chose aminoacyl tRNA synthetases (AARSs). They are exceptional in their progressive and accretive proliferation of noncatalytic domains as the Tree of Life is ascended. Here we report discovery of a large number of natural catalytic nulls (CNs) for each human AARS. Splicing events retain noncatalytic domains while ablating the catalytic domain to create CNs with diverse functions. Each synthetase is converted into several new signaling proteins with biological activities “orthogonal” to that of the catalytic parent. We suggest that splice variants with nonenzymatic functions may be more general, as evidenced by recent findings of other catalytically inactive splice-variant enzymes.

Collaboration


Dive into the Kin Fai Au's collaboration.

Top Co-Authors

Avatar
Top Co-Authors

Avatar
Top Co-Authors

Avatar
Top Co-Authors

Avatar
Top Co-Authors

Avatar

John P. Cooke

Houston Methodist Hospital

View shared research outputs
Top Co-Authors

Avatar
Top Co-Authors

Avatar
Top Co-Authors

Avatar
Top Co-Authors

Avatar
Top Co-Authors

Avatar

Joanne E. Nettleship

Wellcome Trust Centre for Human Genetics

View shared research outputs
Researchain Logo
Decentralizing Knowledge