Mihai Albu
University of Toronto
Network
Latest external collaboration on country level. Dive into details by clicking on the dots.
Publication
Featured researches published by Mihai Albu.
Nature | 2013
Debashish Ray; Hilal Kazan; Kate B. Cook; Matthew T. Weirauch; Hamed Shateri Najafabadi; Xiao Li; Serge Gueroussov; Mihai Albu; Hong Zheng; Ally Yang; Hong Na; Manuel Irimia; Leah H. Matzat; Ryan K. Dale; Sarah A. Smith; Christopher A. Yarosh; Seth M. Kelly; Behnam Nabet; D. Mecenas; Weimin Li; Rakesh S. Laishram; Mei Qiao; Howard D. Lipshitz; Fabio Piano; Anita H. Corbett; Russ P. Carstens; Brendan J. Frey; Richard A. Anderson; Kristen W. Lynch; Luiz O. F. Penalva
RNA-binding proteins are key regulators of gene expression, yet only a small fraction have been functionally characterized. Here we report a systematic analysis of the RNA motifs recognized by RNA-binding proteins, encompassing 205 distinct genes from 24 diverse eukaryotes. The sequence specificities of RNA-binding proteins display deep evolutionary conservation, and the recognition preferences for a large fraction of metazoan RNA-binding proteins can thus be inferred from their RNA-binding domain sequence. The motifs that we identify in vitro correlate well with in vivo RNA-binding data. Moreover, we can associate them with distinct functional roles in diverse types of post-transcriptional regulation, enabling new insights into the functions of RNA-binding proteins both in normal physiology and in human disease. These data provide an unprecedented overview of RNA-binding proteins and their targets, and constitute an invaluable resource for determining post-transcriptional regulatory mechanisms in eukaryotes.
Nature Biotechnology | 2015
Hamed Shateri Najafabadi; Sanie Mnaimneh; Frank W. Schmitges; Michael Garton; Kathy N. Lam; Ally Yang; Mihai Albu; Matthew T. Weirauch; Ernest Radovani; Philip M. Kim; Jack Greenblatt; Brendan J. Frey; Timothy R. Hughes
Cys2-His2 zinc finger (C2H2-ZF) proteins represent the largest class of putative human transcription factors. However, for most C2H2-ZF proteins it is unknown whether they even bind DNA or, if they do, to which sequences. Here, by combining data from a modified bacterial one-hybrid system with protein-binding microarray and chromatin immunoprecipitation analyses, we show that natural C2H2-ZFs encoded in the human genome bind DNA both in vitro and in vivo, and we infer the DNA recognition code using DNA-binding data for thousands of natural C2H2-ZF domains. In vivo binding data are generally consistent with our recognition code and indicate that C2H2-ZF proteins recognize more motifs than all other human transcription factors combined. We provide direct evidence that most KRAB-containing C2H2-ZF proteins bind specific endogenous retroelements (EREs), ranging from currently active to ancient families. The majority of C2H2-ZF proteins, including KRAB proteins, also show widespread binding to regulatory regions, indicating that the human genome contains an extensive and largely unstudied adaptive C2H2-ZF regulatory network that targets a diverse range of genes and pathways.
Cell | 2018
Samuel A. Lambert; Arttu Jolma; Laura F. Campitelli; Pratyush K. Das; Yimeng Yin; Mihai Albu; Xiaoting Chen; Jussi Taipale; Timothy R. Hughes; Matthew T. Weirauch
Transcription factors (TFs) recognize specific DNA sequences to control chromatin and transcription, forming a complex system that guides expression of the genome. Despite keen interest in understanding how TFs control gene expression, it remains challenging to determine how the precise genomic binding sites of TFs are specified and how TF binding ultimately relates to regulation of transcription. This review considers how TFs are identified and functionally characterized, principally through the lens of a catalog of over 1,600 likely human TFs and binding motifs for two-thirds of them. Major classes of human TFs differ markedly in their evolutionary trajectories and expression patterns, underscoring distinct functions. TFs likewise underlie many different aspects of human physiology, disease, and variation, highlighting the importance of continued effort to understand TF-mediated gene regulation.
eLife | 2015
Kamesh Narasimhan; Samuel A. Lambert; Ally Yang; Jeremy Riddell; Sanie Mnaimneh; Hong Zheng; Mihai Albu; Hamed Shateri Najafabadi; John S. Reece-Hoyes; Juan I. Fuxman Bass; Albertha J. M. Walhout; Matthew T. Weirauch; Timothy R. Hughes
Caenorhabditis elegans is a powerful model for studying gene regulation, as it has a compact genome and a wealth of genomic tools. However, identification of regulatory elements has been limited, as DNA-binding motifs are known for only 71 of the estimated 763 sequence-specific transcription factors (TFs). To address this problem, we performed protein binding microarray experiments on representatives of canonical TF families in C. elegans, obtaining motifs for 129 TFs. Additionally, we predict motifs for many TFs that have DNA-binding domains similar to those already characterized, increasing coverage of binding specificities to 292 C. elegans TFs (∼40%). These data highlight the diversification of binding motifs for the nuclear hormone receptor and C2H2 zinc finger families and reveal unexpected diversity of motifs for T-box and DM families. Motif enrichment in promoters of functionally related genes is consistent with known biology and also identifies putative regulatory roles for unstudied TFs. DOI: http://dx.doi.org/10.7554/eLife.06967.001
Bioinformatics | 2015
Hamed Shateri Najafabadi; Mihai Albu; Timothy R. Hughes
Summary: Current methods for motif discovery from chromatin immunoprecipitation followed by sequencing (ChIP-seq) data often identify non-targeted transcription factor (TF) motifs, and are even further limited when peak sequences are similar due to common ancestry rather than common binding factors. The latter aspect particularly affects a large number of proteins from the Cys2His2 zinc finger (C2H2-ZF) class of TFs, as their binding sites are often dominated by endogenous retroelements that have highly similar sequences. Here, we present recognition code-assisted discovery of regulatory elements (RCADE) for motif discovery from C2H2-ZF ChIP-seq data. RCADE combines predictions from a DNA recognition code of C2H2-ZFs with ChIP-seq data to identify models that represent the genuine DNA binding preferences of C2H2-ZF proteins. We show that RCADE is able to identify generalizable binding models even from peaks that are exclusively located within the repeat regions of the genome, where state-of-the-art motif finding approaches largely fail. Availability and implementation: RCADE is available as a webserver and also for download at http://rcade.ccbr.utoronto.ca/. Supplementary information: Supplementary data are available at Bioinformatics online. Contact: [email protected]
Bioinformatics | 2016
Samuel A. Lambert; Mihai Albu; Timothy R. Hughes; Hamed Shateri Najafabadi
Measuring motif similarity is essential for identifying functionally related transcription factors (TFs) and RNA-binding proteins, and for annotating de novo motifs. Here, we describe Motif Similarity Based on Affinity of Targets (MoSBAT), an approach for measuring the similarity of motifs by computing their affinity profiles across a large number of random sequences. We show that MoSBAT successfully associates de novo ChIP-seq motifs with their respective TFs, accurately identifies motifs that are obtained from the same TF in different in vitro assays, and quantitatively reflects the similarity of in vitro binding preferences for pairs of TFs. AVAILABILITY AND IMPLEMENTATION MoSBAT is available as a webserver at mosbat.ccbr.utoronto.ca, and for download at github.com/csglab/MoSBAT. CONTACT [email protected] or [email protected] information: Supplementary data are available at Bioinformatics online.
G3: Genes, Genomes, Genetics | 2018
Marjan Barazandeh; Samuel A. Lambert; Mihai Albu; Timothy R. Hughes
KRAB C2H2 zinc finger proteins (KZNFs) are the largest and most diverse family of human transcription factors, likely due to diversifying selection driven by novel endogenous retroelements (EREs), but the vast majority lack binding motifs or functional data. Two recent studies analyzed a majority of the human KZNFs using either ChIP-seq (60 proteins) or ChIP-exo (221 proteins) in the same cell type (HEK293). The ChIP-exo paper did not describe binding motifs, however. Thirty-nine proteins are represented in both studies, enabling the systematic comparison of the data sets presented here. Typically, only a minority of peaks overlap, but the two studies nonetheless display significant similarity in ERE binding for 32/39, and yield highly similar DNA binding motifs for 23 and related motifs for 34 (MoSBAT similarity score >0.5 and >0.2, respectively). Thus, there is overall (albeit imperfect) agreement between the two studies. For the 242 proteins represented in at least one study, we selected a highest-confidence motif for each protein, utilizing several motif-derivation approaches, and evaluating motifs within and across data sets. Peaks for the majority (158) are enriched (96% with AUC >0.6 predicting peak vs. nonpeak) for a motif that is supported by the C2H2 “recognition code,” consistent with intrinsic sequence specificity driving DNA binding in cells. An additional 63 yield motifs enriched in peaks, but not supported by the recognition code, which could reflect indirect binding. Altogether, these analyses validate both data sets, and provide a reference motif set with associated quality metrics.
Cell | 2014
Matthew T. Weirauch; Ally Yang; Mihai Albu; Alejandro Montenegro-Montero; Philipp Drewe; Hamed Shateri Najafabadi; Samuel A. Lambert; Ishminder Mann; Kate B. Cook; Hong Zheng; Alejandra Goity; Harm van Bakel; Jean-Claude Lozano; Mary Galli; Mathew G. Lewsey; Eryong Huang; Tuhin Mukherjee; Xiaoting Chen; John S. Reece-Hoyes; Sridhar Govindarajan; Gad Shaulsky; Albertha J. M. Walhout; François-Yves Bouget; Gunnar Rätsch; Luis F. Larrondo; Joseph R. Ecker; Timothy R. Hughes
Archive | 2015
Kamesh Narasimhan; Samuel A. Lambert; Ally Yang; Jeremy Riddell; Sanie Mnaimneh; Hong Zheng; Mihai Albu; Hamed Shateri Najafabadi; John S. Reece Hoyes; Juan I. Fuxman Bass; Albertha J. M. Walhout; Matthew T. Weirauch; Timothy R. Hughes
Archive | 2015
Kamesh Narasimhan; Samuel A. Lambert; Ally Yang; Jeremy Riddell; Sanie Mnaimneh; Hong Zheng; Mihai Albu; Hamed Shateri Najafabadi; John S. Reece Hoyes; Juan I. Fuxman Bass; Albertha J. M. Walhout; Matthew T. Weirauch; Timothy R. Hughes