Network


Latest external collaboration on country level. Dive into details by clicking on the dots.

Hotspot


Dive into the research topics where Peter J. A. Cock is active.

Publication


Featured researches published by Peter J. A. Cock.


Bioinformatics | 2009

Biopython: freely available Python tools for computational molecular biology and bioinformatics.

Peter J. A. Cock; Tiago Antao; Jeffrey T. Chang; Brad Chapman; Cymon J. Cox; Andrew Dalke; Iddo Friedberg; Thomas Hamelryck; Frank Kauff; Bartosz Wilczyński; Michiel J. L. de Hoon

Summary: The Biopython project is a mature open source international collaboration of volunteer developers, providing Python libraries for a wide range of bioinformatics problems. Biopython includes modules for reading and writing different sequence file formats and multiple sequence alignments, dealing with 3D macro molecular structures, interacting with common tools such as BLAST, ClustalW and EMBOSS, accessing key online databases, as well as providing numerical methods for statistical learning. Availability: Biopython is freely available, with documentation and source code at www.biopython.org under the Biopython license. Contact: All queries should be directed to the Biopython mailing lists, see www.biopython.org/wiki/[email protected].


Nucleic Acids Research | 2010

The Sanger FASTQ file format for sequences with quality scores, and the Solexa/Illumina FASTQ variants

Peter J. A. Cock; Christopher J. Fields; Naohisa Goto; Michael L. Heuer; Peter M. Rice

FASTQ has emerged as a common file format for sharing sequencing read data combining both the sequence and an associated per base quality score, despite lacking any formal definition to date, and existing in at least three incompatible variants. This article defines the FASTQ format, covering the original Sanger standard, the Solexa/Illumina variants and conversion between them, based on publicly available information such as the MAQ documentation and conventions recently agreed by the Open Bioinformatics Foundation projects Biopython, BioPerl, BioRuby, BioJava and EMBOSS. Being an open access publication, it is hoped that this description, with the example files provided as Supplementary Data, will serve in future as a reference for this important file format.


Briefings in Bioinformatics | 2013

Using Tablet for visual exploration of second-generation sequencing data

Iain Milne; Gordon Stephen; Micha Bayer; Peter J. A. Cock; Leighton Pritchard; Linda Cardle; Paul D. Shaw; A. David Marshall

The advent of second-generation sequencing (2GS) has provided a range of significant new challenges for the visualization of sequence assemblies. These include the large volume of data being generated, short-read lengths and different data types and data formats associated with the diversity of new sequencing technologies. This article illustrates how Tablet-a high-performance graphical viewer for visualization of 2GS assemblies and read mappings-plays an important role in the analysis of these data. We present Tablet, and through a selection of use cases, demonstrate its value in quality assurance and scientific discovery, through features such as whole-reference coverage overviews, variant highlighting, paired-end read mark-up, GFF3-based feature tracks and protein translations. We discuss the computing and visualization techniques utilized to provide a rich and responsive graphical environment that enables users to view a range of file formats with ease. Tablet installers can be freely downloaded from http://bioinf.hutton.ac.uk/tablet in 32 or 64-bit versions for Windows, OS X, Linux or Solaris. For further details on the Tablet, contact [email protected].


PLOS Pathogens | 2011

Genomic insights into the origin of parasitism in the emerging plant pathogen Bursaphelenchus xylophilus.

Taisei Kikuchi; James A. Cotton; Jonathan J. Dalzell; Koichi Hasegawa; Natsumi Kanzaki; Paul McVeigh; Takuma Takanashi; Isheng J. Tsai; Samuel A. Assefa; Peter J. A. Cock; Thomas D. Otto; Martin Hunt; Adam J. Reid; Alejandro Sanchez-Flores; Kazuko Tsuchihara; Toshiro Yokoi; Mattias C. Larsson; Johji Miwa; Aaron G. Maule; Norio Sahashi; John T. Jones; Matthew Berriman

Bursaphelenchus xylophilus is the nematode responsible for a devastating epidemic of pine wilt disease in Asia and Europe, and represents a recent, independent origin of plant parasitism in nematodes, ecologically and taxonomically distinct from other nematodes for which genomic data is available. As well as being an important pathogen, the B. xylophilus genome thus provides a unique opportunity to study the evolution and mechanism of plant parasitism. Here, we present a high-quality draft genome sequence from an inbred line of B. xylophilus, and use this to investigate the biological basis of its complex ecology which combines fungal feeding, plant parasitic and insect-associated stages. We focus particularly on putative parasitism genes as well as those linked to other key biological processes and demonstrate that B. xylophilus is well endowed with RNA interference effectors, peptidergic neurotransmitters (including the first description of ins genes in a parasite) stress response and developmental genes and has a contracted set of chemosensory receptors. B. xylophilus has the largest number of digestive proteases known for any nematode and displays expanded families of lysosome pathway genes, ABC transporters and cytochrome P450 pathway genes. This expansion in digestive and detoxification proteins may reflect the unusual diversity in foods it exploits and environments it encounters during its life cycle. In addition, B. xylophilus possesses a unique complement of plant cell wall modifying proteins acquired by horizontal gene transfer, underscoring the impact of this process on the evolution of plant parasitism by nematodes. Together with the lack of proteins homologous to effectors from other plant parasitic nematodes, this confirms the distinctive molecular basis of plant parasitism in the Bursaphelenchus lineage. The genome sequence of B. xylophilus adds to the diversity of genomic data for nematodes, and will be an important resource in understanding the biology of this unusual parasite.


BMC Genomics | 2012

Identification and localisation of the NB-LRR gene family within the potato genome.

Florian Jupe; Leighton Pritchard; Graham J. Etherington; Katrin MacKenzie; Peter J. A. Cock; Frank Wright; Sanjeev Kumar Sharma; Dan M. Bolser; Glenn J. Bryan; Jonathan D. G. Jones; Ingo Hein

BackgroundThe potato genome sequence derived from the Solanum tuberosum Group Phureja clone DM1-3 516 R44 provides unparalleled insight into the genome composition and organisation of this important crop. A key class of genes that comprises the vast majority of plant resistance (R) genes contains a nucleotide-binding and leucine-rich repeat domain, and is collectively known as NB-LRRs.ResultsAs part of an effort to accelerate the process of functional R gene isolation, we performed an amino acid motif based search of the annotated potato genome and identified 438 NB-LRR type genes among the ~39,000 potato gene models. Of the predicted genes, 77 contain an N-terminal toll/interleukin 1 receptor (TIR)-like domain, and 107 of the remaining 361 non-TIR genes contain an N-terminal coiled-coil (CC) domain. Physical map positions were established for 370 predicted NB-LRR genes across all 12 potato chromosomes. The majority of NB-LRRs are physically organised within 63 identified clusters, of which 50 are homogeneous in that they contain NB-LRRs derived from a recent common ancestor.ConclusionsBy establishing the phylogenetic and positional relationship of potato NB-LRRs, our analysis offers significant insight into the evolution of potato R genes. Furthermore, the data provide a blueprint for future efforts to identify and more rapidly clone functional NB-LRR genes from Solanum species.


Plant Journal | 2013

Resistance gene enrichment sequencing (RenSeq) enables reannotation of the NB‐LRR gene family from sequenced plant genomes and rapid mapping of resistance loci in segregating populations

Florian Jupe; Kamil Witek; Walter Verweij; Jadwiga Śliwka; Leighton Pritchard; Graham J. Etherington; Daniel MacLean; Peter J. A. Cock; Richard M. Leggett; Glenn J. Bryan; Linda Cardle; Ingo Hein; Jonathan D. G. Jones

Summary RenSeq is a NB-LRR (nucleotide binding-site leucine-rich repeat) gene-targeted, Resistance gene enrichment and sequencing method that enables discovery and annotation of pathogen resistance gene family members in plant genome sequences. We successfully applied RenSeq to the sequenced potato Solanum tuberosum clone DM, and increased the number of identified NB-LRRs from 438 to 755. The majority of these identified R gene loci reside in poorly or previously unannotated regions of the genome. Sequence and positional details on the 12 chromosomes have been established for 704 NB-LRRs and can be accessed through a genome browser that we provide. We compared these NB-LRR genes and the corresponding oligonucleotide baits with the highest sequence similarity and demonstrated that ∼80% sequence identity is sufficient for enrichment. Analysis of the sequenced tomato S. lycopersicum ‘Heinz 1706’ extended the NB-LRR complement to 394 loci. We further describe a methodology that applies RenSeq to rapidly identify molecular markers that co-segregate with a pathogen resistance trait of interest. In two independent segregating populations involving the wild Solanum species S. berthaultii (Rpi-ber2) and S. ruiz-ceballosii (Rpi-rzc1), we were able to apply RenSeq successfully to identify markers that co-segregate with resistance towards the late blight pathogen Phytophthora infestans. These SNP identification workflows were designed as easy-to-adapt Galaxy pipelines.


Genome Biology | 2014

The genome and life-stage specific transcriptomes of Globodera pallida elucidate key aspects of plant parasitism by a cyst nematode

James A. Cotton; Catherine J. Lilley; Laura M. Jones; Taisei Kikuchi; Adam J. Reid; Peter Thorpe; Isheng J. Tsai; Helen Beasley; Vivian C. Blok; Peter J. A. Cock; Sebastian Eves-van den Akker; Nancy Holroyd; Martin Hunt; Sophie Mantelin; Hardeep Naghra; Arnab Pain; Juan E. Palomares-Rius; Magdalena Zarowiecki; Matthew Berriman; John T. Jones; Peter E. Urwin

BackgroundGlobodera pallida is a devastating pathogen of potato crops, making it one of the most economically important plant parasitic nematodes. It is also an important model for the biology of cyst nematodes. Cyst nematodes and root-knot nematodes are the two most important plant parasitic nematode groups and together represent a global threat to food security.ResultsWe present the complete genome sequence of G. pallida, together with transcriptomic data from most of the nematode life cycle, particularly focusing on the life cycle stages involved in root invasion and establishment of the biotrophic feeding site. Despite the relatively close phylogenetic relationship with root-knot nematodes, we describe a very different gene family content between the two groups and in particular extensive differences in the repertoire of effectors, including an enormous expansion of the SPRY domain protein family in G. pallida, which includes the SPRYSEC family of effectors. This highlights the distinct biology of cyst nematodes compared to the root-knot nematodes that were, until now, the only sedentary plant parasitic nematodes for which genome information was available. We also present in-depth descriptions of the repertoires of other genes likely to be important in understanding the unique biology of cyst nematodes and of potential drug targets and other targets for their control.ConclusionsThe data and analyses we present will be central in exploiting post-genomic approaches in the development of much-needed novel strategies for the control of G. pallida and related pathogens.


Molecular Plant Pathology | 2009

Identification and functional characterization of effectors in expressed sequence tags from various life cycle stages of the potato cyst nematode Globodera pallida

John T. Jones; Amar Kumar; Liliya A. Pylypenko; Amarnath Thirugnanasambandam; Lydia Castelli; Sean Chapman; Peter J. A. Cock; Eric Grenier; Catherine J. Lilley; Mark S. Phillips; Vivian C. Blok

In this article, we describe the analysis of over 9000 expressed sequence tags (ESTs) from cDNA libraries obtained from various life cycle stages of Globodera pallida. We have identified over 50 G. pallida effectors from this dataset using bioinformatics analysis, by screening clones in order to identify secreted proteins up-regulated after the onset of parasitism and using in situ hybridization to confirm the expression in pharyngeal gland cells. A substantial gene family encoding G. pallida SPRYSEC proteins has been identified. The expression of these genes is restricted to the dorsal pharyngeal gland cell. Different members of the SPRYSEC family of proteins from G. pallida show different subcellular localization patterns in plants, with some localized to the cytoplasm and others to the nucleus and nucleolus. Differences in subcellular localization may reflect diverse functional roles for each individual protein or, more likely, variety in the compartmentalization of plant proteins targeted by the nematode. Our data are therefore consistent with the suggestion that the SPRYSEC proteins suppress host defences, as suggested previously, and that they achieve this through interaction with a range of host targets.


Amino Acids | 2009

Evolution of prokaryotic two-component systems: insights from comparative genomics

David E. Whitworth; Peter J. A. Cock

Two-component systems (TCSs) are diverse and abundant signal transduction pathways found predominantly in prokaryotes. This review focuses on insights into TCS evolution made possible by the sequencing of whole prokaryotic genomes. Typical TCSs comprise an autophosphorylating protein (a histidine kinase), which transfers a phosphoryl group onto an effector protein (a response regulator), thus modulating its activity. Histidine kinases and response regulators are usually found encoded as pairs of adjacent genes within a genome, with multiple examples in most prokaryotes. Recent studies have shed light on major themes of TCS evolution, including gene duplication, gene gain/loss, gene fusion/fission, domain gain/loss, domain shuffling and the emergence of complexity. Coupled with an understanding of the structural and biophysical properties of many TCS proteins, it has become increasingly possible to draw inferences regarding the functional consequences of such evolutionary changes. In turn, this increase in understanding has the potential to enhance both our ability to rationally engineer TCSs, and also allow us to more powerfully correlate TCS evolution with behavioural phenotypes and ecological niche occupancy.


Molecular Plant Pathology | 2012

Genetic characterization of the HrpL regulon of the fire blight pathogen Erwinia amylovora reveals novel virulence factors

R. Ryan McNally; Ian K. Toth; Peter J. A. Cock; Leighton Pritchard; Peter E. Hedley; Jenny Morris; Youfu Zhao; George W. Sundin

The bacterial pathogen Erwinia amylovora is the causal agent of fire blight, an economically significant disease of apple and pear. Disease initiation by E. amylovora requires the translocation of effector proteins into host cells via the hypersensitive response and pathogenicity (hrp) type III secretion system (T3SS). The alternative sigma factor HrpL positively regulates the transcription of structural and translocated components of the T3SS via hrp promoter elements. To characterize genome-wide HrpL-dependent gene expression in E. amylovora Ea1189, wild-type and Ea1189ΔhrpL strains were cultured in hrp-inducing minimal medium, and total RNA was compared using a custom microarray designed to represent the annotated genes of E. amylovora ATCC 49946. The results revealed 24 genes differentially regulated in Ea1189ΔhrpL relative to Ea1189 with fold-change expression ratios greater than 1.5; of these, 19 genes exhibited decreased transcript abundance and five genes showed increased transcript abundance relative to Ea1189. To expand our understanding of the HrpL regulon and to elucidate direct versus indirect HrpL-mediated effects on gene expression, the genome of E. amylovora ATCC 49946 was examined in silico using a hidden Markov model assembled from known Erwinia spp. hrp promoters. This technique identified 15 putative type III novel hrp promoters, seven of which were validated with quantitative polymerase chain reaction based on expression analyses. It was found that HrpL-regulated genes encode all known components of the hrp T3SS, as well as five putative type III effectors. Eight genes displayed apparent indirect HrpL regulation, suggesting that the HrpL regulon is connected to downstream signalling networks. The construction of deletion mutants of three novel HrpL-regulated genes resulted in the identification of additional virulence factors as well as mutants displaying abnormal motility and biofilm phenotypes.

Collaboration


Dive into the Peter J. A. Cock's collaboration.

Top Co-Authors

Avatar
Top Co-Authors

Avatar
Top Co-Authors

Avatar
Top Co-Authors

Avatar
Top Co-Authors

Avatar
Top Co-Authors

Avatar
Top Co-Authors

Avatar
Top Co-Authors

Avatar
Top Co-Authors

Avatar
Top Co-Authors

Avatar
Researchain Logo
Decentralizing Knowledge