Yamile Marquez
Medical University of Vienna
Network
Latest external collaboration on country level. Dive into details by clicking on the dots.
Publication
Featured researches published by Yamile Marquez.
Genome Research | 2012
Yamile Marquez; John W. S. Brown; Craig G. Simpson; Andrea Barta; Maria Kalyna
Alternative splicing (AS) is a key regulatory mechanism that contributes to transcriptome and proteome diversity. As very few genome-wide studies analyzing AS in plants are available, we have performed high-throughput sequencing of a normalized cDNA library which resulted in a high coverage transcriptome map of Arabidopsis. We detect ∼150,000 splice junctions derived mostly from typical plant introns, including an eightfold increase in the number of U12 introns (2069). Around 61% of multiexonic genes are alternatively spliced under normal growth conditions. Moreover, we provide experimental validation of 540 AS transcripts (from 256 genes coding for important regulatory factors) using high-resolution RT-PCR and Sanger sequencing. Intron retention (IR) is the most frequent AS event (∼40%), but many IRs have relatively low read coverage and are less well-represented in assembled transcripts. Additionally, ∼51% of Arabidopsis genes produce AS transcripts which do not involve IR. Therefore, the significance of IR in generating transcript diversity was generally overestimated in previous assessments. IR analysis allowed the identification of a large set of cryptic introns inside annotated coding exons. Importantly, a significant fraction of these cryptic introns are spliced out in frame, indicating a role in protein diversity. Furthermore, we show extensive AS coupled to nonsense-mediated decay in AFC2, encoding a highly conserved LAMMER kinase which phosphorylates splicing factors, thus establishing a complex loop in AS regulation. We provide the most comprehensive analysis of AS to date which will serve as a valuable resource for the plant community to study transcriptome complexity and gene regulation.
The Plant Cell | 2013
Yamile Marquez; Maria Kalyna; Andrea Barta
Alternative splicing (AS) of precursor mRNAs (pre-mRNAs) from multiexon genes allows organisms to increase their coding potential and regulate gene expression through multiple mechanisms. Recent transcriptome-wide analysis of AS using RNA sequencing has revealed that AS is highly pervasive in plants. Pre-mRNAs from over 60% of intron-containing genes undergo AS to produce a vast repertoire of mRNA isoforms. The functions of most splice variants are unknown. However, emerging evidence indicates that splice variants increase the functional diversity of proteins. Furthermore, AS is coupled to transcript stability and translation through nonsense-mediated decay and microRNA-mediated gene regulation. Widespread changes in AS in response to developmental cues and stresses suggest a role for regulated splicing in plant development and stress responses. Here, we review recent progress in uncovering the extent and complexity of the AS landscape in plants, its regulation, and the roles of AS in gene regulation. The prevalence of AS in plants has raised many new questions that require additional studies. New tools based on recent technological advances are allowing genome-wide analysis of RNA elements in transcripts and of chromatin modifications that regulate AS. Application of these tools in plants will provide significant new insights into AS regulation and crosstalk between AS and other layers of gene regulation.
Nucleic Acids Research | 2012
Maria Kalyna; Craig G. Simpson; Naeem H. Syed; Dominika Lewandowska; Yamile Marquez; Branislav Kusenda; Jacqueline Marshall; John Fuller; Linda Cardle; James W. McNicol; Huy Q. Dinh; Andrea Barta; John W. S. Brown
Alternative splicing (AS) coupled to nonsense-mediated decay (NMD) is a post-transcriptional mechanism for regulating gene expression. We have used a high-resolution AS RT–PCR panel to identify endogenous AS isoforms which increase in abundance when NMD is impaired in the Arabidopsis NMD factor mutants, upf1-5 and upf3-1. Of 270 AS genes (950 transcripts) on the panel, 102 transcripts from 97 genes (32%) were identified as NMD targets. Extrapolating from these data around 13% of intron-containing genes in the Arabidopsis genome are potentially regulated by AS/NMD. This cohort of naturally occurring NMD-sensitive AS transcripts also allowed the analysis of the signals for NMD in plants. We show the importance of AS in introns in 5′ or 3′UTRs in modulating NMD-sensitivity of mRNA transcripts. In particular, we identified upstream open reading frames overlapping the main start codon as a new trigger for NMD in plants and determined that NMD is induced if 3′-UTRs were >350 nt. Unexpectedly, although many intron retention transcripts possess NMD features, they are not sensitive to NMD. Finally, we have shown that AS/NMD regulates the abundance of transcripts of many genes important for plant development and adaptation including transcription factors, RNA processing factors and stress response genes.
Trends in Plant Science | 2012
Naeem H. Syed; Maria Kalyna; Yamile Marquez; Andrea Barta; John W. S. Brown
More than 60% of intron-containing genes undergo alternative splicing (AS) in plants. This number will increase when AS in different tissues, developmental stages, and environmental conditions are explored. Although the functional impact of AS on protein complexity is still understudied in plants, recent examples demonstrate its importance in regulating plant processes. AS also regulates transcript levels and the link with nonsense-mediated decay and generation of unproductive mRNAs illustrate the need for both transcriptional and AS data in gene expression analyses. AS has influenced the evolution of the complex networks of regulation of gene expression and variation in AS contributed to adaptation of plants to their environment and therefore will impact strategies for improving plant and crop phenotypes.
Nucleic Acids Research | 2013
Olga Bannikova; Marek Zywicki; Yamile Marquez; Tatsiana Skrahina; Maria Kalyna; Andrea Barta
AtCyp59 is a multidomain cyclophilin containing a peptidyl-prolyl cis/trans isomerase (PPIase) domain and an evolutionarily highly conserved RRM domain. Deregulation of this class of cyclophilins has been shown to affect transcription and to influence phosphorylation of the C-terminal repeat domain of the largest subunit of the RNA polymerase II. We used a genomic SELEX method for identifying RNA targets of AtCyp59. Analysis of the selected RNAs revealed an RNA-binding motif (G[U/C]N[G/A]CC[A/G]) and we show that it is evolutionarily conserved. Binding to this motif was verified by gel shift assays in vitro and by RNA immunopreciptation assays of AtCyp59 in vivo. Most importantly, we show that binding also occurs on unprocessed transcripts in vivo and that binding of specific RNAs inhibits the PPIase activity of AtCyp59 in vitro. Surprisingly, genome-wide analysis showed that the RNA motif is present in about 70% of the annotated transcripts preferentially in exons. Taken together, the available data suggest that these cyclophilins might have an important function in transcription regulation.
Nucleic Acids Research | 2017
Runxuan Zhang; Cristiane P. G. Calixto; Yamile Marquez; Peter Venhuizen; Nikoleta A. Tzioutziou; Wenbin Guo; Mark Spensley; Juan Carlos Entizne; Dominika Lewandowska; Sara ten Have; Nicolas Frei dit Frey; Heribert Hirt; Allan B. James; Hugh G. Nimmo; Andrea Barta; Maria Kalyna; John W. S. Brown
Abstract Alternative splicing generates multiple transcript and protein isoforms from the same gene and thus is important in gene expression regulation. To date, RNA-sequencing (RNA-seq) is the standard method for quantifying changes in alternative splicing on a genome-wide scale. Understanding the current limitations of RNA-seq is crucial for reliable analysis and the lack of high quality, comprehensive transcriptomes for most species, including model organisms such as Arabidopsis, is a major constraint in accurate quantification of transcript isoforms. To address this, we designed a novel pipeline with stringent filters and assembled a comprehensive Reference Transcript Dataset for Arabidopsis (AtRTD2) containing 82,190 non-redundant transcripts from 34 212 genes. Extensive experimental validation showed that AtRTD2 and its modified version, AtRTD2-QUASI, for use in Quantification of Alternatively Spliced Isoforms, outperform other available transcriptomes in RNA-seq analysis. This strategy can be implemented in other species to build a pipeline for transcript-level expression and alternative splicing analyses.
New Phytologist | 2015
Runxuan Zhang; Cristiane P. G. Calixto; Nikoleta A. Tzioutziou; Allan B. James; Craig G. Simpson; Wenbin Guo; Yamile Marquez; Maria Kalyna; Rob Patro; Eduardo Eyras; Andrea Barta; Hugh G. Nimmo; John W. S. Brown
Summary RNA‐sequencing (RNA‐seq) allows global gene expression analysis at the individual transcript level. Accurate quantification of transcript variants generated by alternative splicing (AS) remains a challenge. We have developed a comprehensive, nonredundant Arabidopsis reference transcript dataset (AtRTD) containing over 74 000 transcripts for use with algorithms to quantify AS transcript isoforms in RNA‐seq. The AtRTD was formed by merging transcripts from TAIR10 and novel transcripts identified in an AS discovery project. We have estimated transcript abundance in RNA‐seq data using the transcriptome‐based alignment‐free programmes Sailfish and Salmon and have validated quantification of splicing ratios from RNA‐seq by high resolution reverse transcription polymerase chain reaction (HR RT‐PCR). Good correlations between splicing ratios from RNA‐seq and HR RT‐PCR were obtained demonstrating the accuracy of abundances calculated for individual transcripts in RNA‐seq. The AtRTD is a resource that will have immediate utility in analysing Arabidopsis RNA‐seq data to quantify differential transcript abundance and expression.
The Plant Cell | 2015
John W. S. Brown; Craig G. Simpson; Yamile Marquez; Geoffrey M. Gadd; Andrea Barta; Maria Kalyna
Transcript annotation in plant databases is incomplete and often inaccurate, leading to misinterpretation. As more and more RNA-seq data are generated, plant scientists need to be aware of potential pitfalls and understand the nature and impact of specific alternative splicing transcripts on protein production. A primary area of concern and the topic of this article is the (mis)annotation of open reading frames and premature termination codons. The basic message is that to adequately address expression and functions of transcript isoforms, it is necessary to be able to predict their fate in terms of whether protein isoforms are generated or specific transcripts are unproductive or degraded.
bioRxiv | 2016
Runxuan Zhang; Cristiane P. G. Calixto; Yamile Marquez; Peter Venhuizen; Nikoleta A. Tzioutziou; Wenbin Guo; Mark Spensley; Nicolas Frei dit Frey; Heribert Hirt; Allan B. James; Hugh G. Nimmo; Andrea Barta; Maria Kalyna; John W. S. Brown
Background Alternative splicing is the major post-transcriptional mechanism by which gene expression is regulated and affects a wide range of processes and responses in most eukaryotic organisms. RNA-sequencing (RNA-seq) can generate genome-wide quantification of individual transcript isoforms to identify changes in expression and alternative splicing. RNA-seq is an essential modern tool but its ability to accurately quantify transcript isoforms depends on the diversity, completeness and quality of the transcript information. Results We have developed a new Reference Transcript Dataset for Arabidopsis (AtRTD2) for RNA-seq analysis containing over 82k non-redundant transcripts, whereby 74,194 transcripts originate from 27,667 protein-coding genes. A total of 13,524 protein-coding genes have at least one alternatively spliced transcript in AtRTD2 such that about 60% of the 22,453 protein-coding, intron-containing genes in Arabidopsis undergo alternative splicing. More than 600 putative U12 introns were identified in more than 2,000 transcripts. AtRTD2 was generated from transcript assemblies of ca. 8.5 billion pairs of reads from 285 RNA-seq data sets obtained from 129 RNA-seq libraries and merged along with the previous version, AtRTD, and Araport11 transcript assemblies. AtRTD2 increases the diversity of transcripts and through application of stringent filters represents the most extensive and accurate transcript collection for Arabidopsis to date. We have demonstrated a generally good correlation of alternative splicing ratios from RNA-seq data analysed by Salmon and experimental data from high resolution RT-PCR. However, we have observed inaccurate quantification of transcript isoforms for genes with multiple transcripts which have variation in the lengths of their UTRs. This variation is not effectively corrected in RNA-seq analysis programmes and will therefore impact RNA-seq analyses generally. To address this, we have tested different genome-wide modifications of AtRTD2 to improve transcript quantification and alternative splicing analysis. As a result, we release AtRTD2-QUASI specifically for use in Quantification of Alternatively Spliced Isoforms and demonstrate that it out-performs other available transcriptomes for RNA-seq analysis. Conclusions We have generated a new transcriptome resource for RNA-seq analyses in Arabidopsis (AtRTD2) designed to address quantification of different isoforms and alternative splicing in gene expression studies. Experimental validation of alternative splicing changes identified inaccuracies in transcript quantification due to UTR length variation. To solve this problem, we also release a modified reference transcriptome, AtRTD2-QUASI for quantification of transcript isoforms, which shows high correlation with experimental data.
Genome Research | 2015
Yamile Marquez; Markus Höpfler; Zahra Ayatollahi; Andrea Barta; Maria Kalyna