Network


Latest external collaboration on country level. Dive into details by clicking on the dots.

Hotspot


Dive into the research topics where Sam Griffiths-Jones is active.

Publication


Featured researches published by Sam Griffiths-Jones.


Nucleic Acids Research | 2007

miRBase: tools for microRNA genomics

Sam Griffiths-Jones; Harpreet K Saini; Stijn van Dongen; Anton J. Enright

miRBase is the central online repository for microRNA (miRNA) nomenclature, sequence data, annotation and target prediction. The current release (10.0) contains 5071 miRNA loci from 58 species, expressing 5922 distinct mature miRNA sequences: a growth of over 2000 sequences in the past 2 years. miRBase provides a range of data to facilitate studies of miRNA genomics: all miRNAs are mapped to their genomic coordinates. Clusters of miRNA sequences in the genome are highlighted, and can be defined and retrieved with any inter-miRNA distance. The overlap of miRNA sequences with annotated transcripts, both protein- and non-coding, are described. Finally, graphical views of the locations of a wide range of genomic features in model organisms allow for the first time the prediction of the likely boundaries of many miRNA primary transcripts. miRBase is available at http://microrna.sanger.ac.uk/.


Nucleic Acids Research | 2006

miRBase: microRNA sequences, targets and gene nomenclature

Sam Griffiths-Jones; Russell Grocock; Stijn van Dongen; Alex Bateman; Anton J. Enright

The miRBase database aims to provide integrated interfaces to comprehensive microRNA sequence data, annotation and predicted gene targets. miRBase takes over functionality from the microRNA Registry and fulfils three main roles: the miRBase Registry acts as an independent arbiter of microRNA gene nomenclature, assigning names prior to publication of novel miRNA sequences. miRBase Sequences is the primary online repository for miRNA sequence data and annotation. miRBase Targets is a comprehensive new database of predicted miRNA target genes. miRBase is available at .


Nucleic Acids Research | 2011

miRBase: integrating microRNA annotation and deep-sequencing data

Ana Kozomara; Sam Griffiths-Jones

miRBase is the primary online repository for all microRNA sequences and annotation. The current release (miRBase 16) contains over 15u2009000 microRNA gene loci in over 140 species, and over 17u2009000 distinct mature microRNA sequences. Deep-sequencing technologies have delivered a sharp rise in the rate of novel microRNA discovery. We have mapped reads from short RNA deep-sequencing experiments to microRNAs in miRBase and developed web interfaces to view these mappings. The user can view all read data associated with a given microRNA annotation, filter reads by experiment and count, and search for microRNAs by tissue- and stage-specific expression. These data can be used as a proxy for relative expression levels of microRNA sequences, provide detailed evidence for microRNA annotations and alternative isoforms of mature microRNAs, and allow us to revisit previous annotations. miRBase is available online at: http://www.mirbase.org/.


Nature | 2005

Sequencing of Aspergillus nidulans and comparative analysis with A. fumigatus and A. oryzae

James E. Galagan; Sarah E. Calvo; Christina A. Cuomo; Li-Jun Ma; Jennifer R. Wortman; Serafim Batzoglou; Su-In Lee; Meray Baştürkmen; Christina C. Spevak; John Clutterbuck; Vladimir V. Kapitonov; Jerzy Jurka; Claudio Scazzocchio; Mark L. Farman; Jonathan Butler; Seth Purcell; Steve Harris; Gerhard H. Braus; Oliver W. Draht; Silke Busch; Christophe d'Enfert; Christiane Bouchier; Gustavo H. Goldman; Deborah Bell-Pedersen; Sam Griffiths-Jones; John H. Doonan; Jae-Hyuk Yu; Kay Vienken; Arnab Pain; Michael Freitag

The aspergilli comprise a diverse group of filamentous fungi spanning over 200 million years of evolution. Here we report the genome sequence of the model organism Aspergillus nidulans, and a comparative study with Aspergillus fumigatus, a serious human pathogen, and Aspergillus oryzae, used in the production of sake, miso and soy sauce. Our analysis of genome structure provided a quantitative evaluation of forces driving long-term eukaryotic genome evolution. It also led to an experimentally validated model of mating-type locus evolution, suggesting the potential for sexual reproduction in A. fumigatus and A. oryzae. Our analysis of sequence conservation revealed over 5,000 non-coding regions actively conserved across all three species. Within these regions, we identified potential functional elements including a previously uncharacterized TPP riboswitch and motifs suggesting regulation in filamentous fungi by Puf family genes. We further obtained comparative and experimental evidence indicating widespread translational regulation by upstream open reading frames. These results enhance our understanding of these widely studied fungi as well as provide new insight into eukaryotic genome evolution and gene regulation.


Nucleic Acids Research | 2004

The microRNA Registry

Sam Griffiths-Jones

The miRNA Registry provides a service for the assignment of miRNA gene names prior to publication. A comprehensive and searchable database of published miRNA sequences is accessible via a web interface (http://www.sanger.ac.uk/Software/Rfam/mirna/), and all sequence and annotation data are freely available for download. Release 2.0 of the database contains 506 miRNA entries from six organisms.


The Plant Cell | 2008

Criteria for Annotation of Plant MicroRNAs

Blake C. Meyers; Michael J. Axtell; Bonnie Bartel; David P. Bartel; David C. Baulcombe; John L. Bowman; Xiaofeng Cao; James C. Carrington; Xuemei Chen; Pamela J. Green; Sam Griffiths-Jones; Steven E. Jacobsen; Allison C. Mallory; Robert A. Martienssen; R. Scott Poethig; Yijun Qi; Hervé Vaucheret; Olivier Voinnet; Yuichiro Watanabe; Detlef Weigel; Jian-Kang Zhu

MicroRNAs (miRNAs) are ∼21 nucleotide noncoding RNAs produced by Dicer-catalyzed excision from stem-loop precursors. Many plant miRNAs play critical roles in development, nutrient homeostasis, abiotic stress responses, and pathogen responses via interactions with specific target mRNAs. miRNAs are not the only Dicer-derived small RNAs produced by plants: A substantial amount of the total small RNA abundance and an overwhelming amount of small RNA sequence diversity is contributed by distinct classes of 21- to 24-nucleotide short interfering RNAs. This fact, coupled with the rapidly increasing rate of plant small RNA discovery, demands an increased rigor in miRNA annotations. Herein, we update the specific criteria required for the annotation of plant miRNAs, including experimental and computational data, as well as refinements to standard nomenclature.


PLOS Biology | 2003

The genome sequence of Caenorhabditis briggsae: A platform for comparative genomics

Lincoln Stein; Zhirong Bao; Darin Blasiar; Thomas Blumenthal; Michael R. Brent; Nansheng Chen; Asif T. Chinwalla; Laura Clarke; Chris Clee; Avril Coghlan; Alan Coulson; Peter D'Eustachio; David H. A. Fitch; Lucinda A. Fulton; Robert Fulton; Sam Griffiths-Jones; Todd W. Harris; LaDeana W. Hillier; Ravi S. Kamath; Patricia E. Kuwabara; Elaine R. Mardis; Marco A. Marra; Tracie L. Miner; Patrick Minx; James C. Mullikin; Robert W. Plumb; Jane Rogers; Jacqueline E. Schein; Marc Sohrmann; John Spieth

The soil nematodes Caenorhabditis briggsae and Caenorhabditis elegans diverged from a common ancestor roughly 100 million years ago and yet are almost indistinguishable by eye. They have the same chromosome number and genome sizes, and they occupy the same ecological niche. To explore the basis for this striking conservation of structure and function, we have sequenced the C. briggsae genome to a high-quality draft stage and compared it to the finished C. elegans sequence. We predict approximately 19,500 protein-coding genes in the C. briggsae genome, roughly the same as in C. elegans. Of these, 12,200 have clear C. elegans orthologs, a further 6,500 have one or more clearly detectable C. elegans homologs, and approximately 800 C. briggsae genes have no detectable matches in C. elegans. Almost all of the noncoding RNAs (ncRNAs) known are shared between the two species. The two genomes exhibit extensive colinearity, and the rate of divergence appears to be higher in the chromosomal arms than in the centers. Operons, a distinctive feature of C. elegans, are highly conserved in C. briggsae, with the arrangement of genes being preserved in 96% of cases. The difference in size between the C. briggsae (estimated at approximately 104 Mbp) and C. elegans (100.3 Mbp) genomes is almost entirely due to repetitive sequence, which accounts for 22.4% of the C. briggsae genome in contrast to 16.5% of the C. elegans genome. Few, if any, repeat families are shared, suggesting that most were acquired after the two species diverged or are undergoing rapid evolution. Coclustering the C. elegans and C. briggsae proteins reveals 2,169 protein families of two or more members. Most of these are shared between the two species, but some appear to be expanding or contracting, and there seem to be as many as several hundred novel C. briggsae gene families. The C. briggsae draft sequence will greatly improve the annotation of the C. elegans genome. Based on similarity to C. briggsae, we found strong evidence for 1,300 new C. elegans genes. In addition, comparisons of the two genomes will help to understand the evolutionary forces that mold nematode genomes.


Bioinformatics | 2005

RALEE---RNA ALignment Editor in Emacs

Sam Griffiths-Jones

UNLABELLEDnProduction of high quality multiple sequence alignments of structured RNAs relies on an iterative combination of manual editing and structure prediction. An essential feature of an RNA alignment editor is the facility to mark-up the alignment based on how it matches a given secondary structure prediction, but few available alignment editors offer such a feature. The RALEE (RNA ALignment Editor in Emacs) tool provides a simple environment for RNA multiple sequence alignment editing, including structure-specific colour schemes, utilizing helper applications for structure prediction and many more conventional editing functions. This is accomplished by extending the commonly used text editor, Emacs, which is available for Linux, most UNIX systems, Windows and Mac OS.nnnAVAILABILITYnThe ELISP source code for RALEE is freely available from http://www.sanger.ac.uk/Users/sgj/ralee/ along with documentation and [email protected]


Biochemical Society Transactions | 2001

Plant protein families and their relationships to food allergy

Peter R. Shewry; Frédéric Beaudoin; John Jenkins; Sam Griffiths-Jones; E. N. C. Mills

The analysis of plant proteins has a long and distinguished history, with work dating back over 250 years. Much of the work has focused on seed proteins, which are important in animal nutrition and food processing. Early studies classified plant proteins into groups based on solubility (Osborne fractions) or protein function. More recently, families have been defined based on stuctural and evolutionary relationships. One of the most widespread groups of plant proteins is the prolaminin superfamily, which comprises cereal seed storage proteins, a range of low-molecular-mass sulphur-rich proteins (many of which are located in seeds) and some cell wall glycoproteins. This superfamily includes several major types of plant allergen: non-specific lipid transfer proteins, cereal seed inhibitors of alpha-amylase and/or trypsin, and 2 S albumin storage proteins of dicotyledonous seeds.


Current protocols in human genetics | 2003

Identifying Protein Domains with the Pfam Database

Robert D. Finn; Sam Griffiths-Jones; Alex Bateman

Pfam is a database of such protein domain families, with each family represented by multiple sequence alignments and profile hidden Markov models (HMMs). In addition, each family has associated annotation, literature references and links to other databases. The entries in Pfam are available via the worldwide web and in flatfile format. This unit contains detailed information on how to access and utilise the information present in the Pfam database, namely the families, multiple alignments and annotation. Details on running Pfam, both remotely and locally are presented.

Collaboration


Dive into the Sam Griffiths-Jones's collaboration.

Top Co-Authors

Avatar

Alex Bateman

European Bioinformatics Institute

View shared research outputs
Top Co-Authors

Avatar
Top Co-Authors

Avatar

Anton J. Enright

European Bioinformatics Institute

View shared research outputs
Top Co-Authors

Avatar
Top Co-Authors

Avatar
Top Co-Authors

Avatar

Stijn van Dongen

European Bioinformatics Institute

View shared research outputs
Top Co-Authors

Avatar
Top Co-Authors

Avatar

David P. Bartel

Massachusetts Institute of Technology

View shared research outputs
Top Co-Authors

Avatar

James C. Carrington

Donald Danforth Plant Science Center

View shared research outputs
Top Co-Authors

Avatar

Xuemei Chen

University of California

View shared research outputs
Researchain Logo
Decentralizing Knowledge