Is this you? Create Your Porfile

Stéphanie Bocs

Centre national de la recherche scientifique

Archive Network Publication Hotspot Collaboration

Network

Latest external collaboration on country level. Dive into details by clicking on the dots.

Explore More

Hotspot

Dive into the research topics where Stéphanie Bocs is active.

Explore More

Publication

Featured researches published by Stéphanie Bocs.

Nature | 2012

The banana (Musa acuminata) genome and the evolution of monocotyledonous plants.

Angélique D’Hont; Jean-Marc Aury; Franc-Christophe Baurens; Françoise Carreel; Olivier Garsmeur; Benjamin Noel; Stéphanie Bocs; Gaëtan Droc; Mathieu Rouard; Corinne Da Silva; Kamel Jabbari; Céline Cardi; Julie Poulain; Marlène Souquet; Karine Labadie; Cyril Jourda; Juliette Lengellé; Marguerite Rodier-Goud; Adriana Alberti; Maria Bernard; Margot Corréa; Saravanaraj Ayyampalayam; Michael R. McKain; Jim Leebens-Mack; Diane Burgess; Michael Freeling; Didier Mbéguié-A-Mbéguié; Matthieu Chabannes; Thomas Wicker; Olivier Panaud

Bananas (Musa spp.), including dessert and cooking types, are giant perennial monocotyledonous herbs of the order Zingiberales, a sister group to the well-studied Poales, which include cereals. Bananas are vital for food security in many tropical and subtropical countries and the most popular fruit in industrialized countries. The Musa domestication process started some 7,000 years ago in Southeast Asia. It involved hybridizations between diverse species and subspecies, fostered by human migrations, and selection of diploid and triploid seedless, parthenocarpic hybrids thereafter widely dispersed by vegetative propagation. Half of the current production relies on somaclones derived from a single triploid genotype (Cavendish). Pests and diseases have gradually become adapted, representing an imminent danger for global banana production. Here we describe the draft sequence of the 523-megabase genome of a Musa acuminata doubled-haploid genotype, providing a crucial stepping-stone for genetic improvement of banana. We detected three rounds of whole-genome duplications in the Musa lineage, independently of those previously described in the Poales lineage and the one we detected in the Arecales lineage. This first monocotyledon high-continuity whole-genome sequence reported outside Poales represents an essential bridge for comparative genome analysis in plants. As such, it clarifies commelinid-monocotyledon phylogenetic relationships, reveals Poaceae-specific features and has led to the discovery of conserved non-coding sequences predating monocotyledon–eudicotyledon divergence.

Nature Biotechnology | 2003

The genome sequence of the entomopathogenic bacterium Photorhabdus luminescens.

Eric Duchaud; Christophe Rusniok; Lionel Frangeul; Carmen Buchrieser; Alain Givaudan; Sead Taourit; Stéphanie Bocs; Caroline Boursaux-Eude; Michael Chandler; Jean-François Charles; Elie Dassa; Richard Derose; Sylviane Derzelle; Georges Freyssinet; Claudine Médigue; Anne Lanois; Kerrie Powell; Patricia Siguier; Rachel Vincent; Vincent Paul Mary Wingate; Mohamed Zouine; Philippe Glaser; Noël Boemare; Antoine Danchin; Frank Kunst

Photorhabdus luminescens is a symbiont of nematodes and a broad-spectrum insect pathogen. The complete genome sequence of strain TT01 is 5,688,987 base pairs (bp) long and contains 4,839 predicted protein-coding genes. Strikingly, it encodes a large number of adhesins, toxins, hemolysins, proteases and lipases, and contains a wide array of antibiotic synthesizing genes. These proteins are likely to play a role in the elimination of competitors, host colonization, invasion and bioconversion of the insect cadaver, making P. luminescens a promising model for the study of symbiosis and host-pathogen interactions. Comparison with the genomes of related bacteria reveals the acquisition of virulence factors by extensive horizontal transfer and provides clues about the evolution of an insect pathogen. Moreover, newly identified insecticidal proteins may be effective alternatives for the control of insect pests.

Nature Genetics | 2011

The genome of Theobroma cacao

Xavier Argout; Jérôme Salse; Jean-Marc Aury; Mark J. Guiltinan; Gaëtan Droc; Jérôme Gouzy; Mathilde Allègre; Cristian Chaparro; Thierry Legavre; Siela N. Maximova; Michael Abrouk; Florent Murat; Olivier Fouet; Julie Poulain; Manuel Ruiz; Yolande Roguet; Maguy Rodier-Goud; Jose Fernandes Barbosa-Neto; François Sabot; Dave Kudrna; Jetty S. S. Ammiraju; Stephan C. Schuster; John E. Carlson; Erika Sallet; Thomas Schiex; Anne Dievart; Melissa Kramer; Laura Gelley; Zi Shi; Aurélie Bérard

We sequenced and assembled the draft genome of Theobroma cacao, an economically important tropical-fruit tree crop that is the source of chocolate. This assembly corresponds to 76% of the estimated genome size and contains almost all previously described genes, with 82% of these genes anchored on the 10 T. cacao chromosomes. Analysis of this sequence information highlighted specific expansion of some gene families during evolution, for example, flavonoid-related genes. It also provides a major source of candidate genes for T. cacao improvement. Based on the inferred paleohistory of the T. cacao genome, we propose an evolutionary scenario whereby the ten T. cacao chromosomes were shaped from an ancestor through eleven chromosome fusions.

Nucleic Acids Research | 2006

MaGe: a microbial genome annotation system supported by synteny results

David Vallenet; Laurent Labarre; Zoé Rouy; Valérie Barbe; Stéphanie Bocs; Stéphane Cruveiller; Aurélie Lajus; Géraldine Pascal; Claude Scarpelli; Claudine Médigue

Magnifying Genomes (MaGe) is a microbial genome annotation system based on a relational database containing information on bacterial genomes, as well as a web interface to achieve genome annotation projects. Our system allows one to initiate the annotation of a genome at the early stage of the finishing phase. MaGes main features are (i) integration of annotation data from bacterial genomes enhanced by a gene coding re-annotation process using accurate gene models, (ii) integration of results obtained with a wide range of bioinformatics methods, among which exploration of gene context by searching for conserved synteny and reconstruction of metabolic pathways, (iii) an advanced web interface allowing multiple users to refine the automatic assignment of gene product functions. MaGe is also linked to numerous well-known biological databases and systems. Our system has been thoroughly tested during the annotation of complete bacterial genomes (Acinetobacter baylyi ADP1, Pseudoalteromonas haloplanktis, Frankia alni) and is currently used in the context of several new microbial genome annotation projects. In addition, MaGe allows for annotation curation and exploration of already published genomes from various genera (e.g. Yersinia, Bacillus and Neisseria). MaGe can be accessed at .

Science | 2014

The coffee genome provides insight into the convergent evolution of caffeine biosynthesis

Lorenzo Carretero-Paulet; Alexis Dereeper; Gaëtan Droc; Romain Guyot; Marco Pietrella; Chunfang Zheng; Adriana Alberti; François Anthony; Giuseppe Aprea; Jean-Marc Aury; Pascal Bento; Maria Bernard; Stéphanie Bocs; Claudine Campa; Alberto Cenci; Marie Christine Combes; Dominique Crouzillat; Corinne Da Silva; Loretta Daddiego; Fabien De Bellis; Stéphane Dussert; Olivier Garsmeur; Thomas Gayraud; Valentin Guignon; Katharina Jahn; Véronique Jamilloux; Thierry Joët; Karine Labadie; Tianying Lan; Julie Leclercq

Coffee, tea, and chocolate converge Caffeine has evolved multiple times among plant species, but no one knows whether these events involved similar genes. Denoeud et al. sequenced the Coffea canephora (coffee) genome and identified a conserved gene order (see the Perspective by Zamir). Although this species underwent fewer genome duplications than related species, the relevant caffeine genes experienced tandem duplications that expanded their numbers within this species. Scientists have seen similar but independent expansions in distantly related species of tea and cacao, suggesting that caffeine might have played an adaptive role in coffee evolution. Science, this issue p. 1181; see also p. 1124 The genetic origins of coffee’s constituents reveal intriguing links to cacao and tea. Coffee is a valuable beverage crop due to its characteristic flavor, aroma, and the stimulating effects of caffeine. We generated a high-quality draft genome of the species Coffea canephora, which displays a conserved chromosomal gene order among asterid angiosperms. Although it shows no sign of the whole-genome triplication identified in Solanaceae species such as tomato, the genome includes several species-specific gene family expansions, among them N-methyltransferases (NMTs) involved in caffeine production, defense-related genes, and alkaloid and flavonoid enzymes involved in secondary compound synthesis. Comparative analyses of caffeine NMTs demonstrate that these genes expanded through sequential tandem duplications independently of genes from cacao and tea, suggesting that caffeine in eudicots is of polyphyletic origin.

Journal of Bacteriology | 2008

“Candidatus Cloacamonas Acidaminovorans”: Genome Sequence Reconstruction Provides a First Glimpse of a New Bacterial Division

Eric Pelletier; Annett Kreimeyer; Stéphanie Bocs; Zoé Rouy; Gabor Gyapay; Rakia Chouari; Delphine Rivière; Akila Ganesan; Patrick Daegelen; Abdelghani Sghir; Georges N. Cohen; Claudine Médigue; Jean Weissenbach; Denis Le Paslier

Many microorganisms live in anaerobic environments. Most of these microorganisms have not yet been cultivated. Here, we present, from a metagenomic analysis of an anaerobic digester of a municipal wastewater treatment plant, a reconstruction of the complete genome of a bacterium belonging to the WWE1 candidate division. In silico proteome analysis indicated that this bacterium might derive most of its carbon and energy from the fermentation of amino acids, and hence, it was provisionally classified as Candidatus Cloacamonas acidaminovorans. Candidatus Cloacamonas acidaminovorans is probably a syntrophic bacterium that is present in many anaerobic digesters. This report highlights how environmental sequence data might provide genomic and functional information about a new bacterial clade whose members are involved in anaerobic digestion.

Nucleic Acids Research | 2003

AMIGene: Annotation of MIcrobial Genes

Stéphanie Bocs; Stéphane Cruveiller; David Vallenet; Gregory Nuel; Claudine Médigue

UNLABELLEDnAMIGene (Annotation of MIcrobial Genes) is an application for automatically identifying the most likely coding sequences (CDSs) in a large contig or a complete bacterial genome sequence. The first step in AMIGene is dedicated to the construction of Markov models that fit the input genomic data (i.e. the gene model), followed by the combination of well-known gene-finding methods and an heuristic approach for the selection of the most likely CDSs. The web interface allows the user to select one or several gene models applied to the analysis of the input sequence by the AMIGene program and to visualize the list of predicted CDSs graphically and in a downloadable text format. The AMIGene web site is accessible at the following address: http://www.genoscope.cns.fr/agc/tools/amigene/index.html ([email protected]).

Nucleic Acids Research | 2005

MICheck: a web tool for fast checking of syntactic annotations of bacterial genomes

Stéphane Cruveiller; Jérôme Le Saux; David Vallenet; Aurélie Lajus; Stéphanie Bocs; Claudine Médigue

The annotation of newly sequenced bacterial genomes begins with running several automatic analysis methods, with major emphasis on the identification of protein-coding genes. DNA sequences are heterogeneous in local nucleotide composition and this leads sometimes to sequences being annotated as authentic genes when they are not protein-coding genes or are true but uncharacterized protein-coding genes. This first annotation step is generally followed by an expert manual annotation of the predicted genes. The genomic data (sequence and annotations) organized in an appropriate databank file format is subsequently submitted to an entry point of the International Nucleotide Sequence Database. These procedures are inevitably subject to mistakes, and this can lead to unintentional syntactic annotation errors being stored in public databanks. Here, we present a new web program, MICheck (MIcrobial genome Checker), that enables rapid verification of sets of annotated genes and frameshifts in previously published bacterial genomes. The web interface allows one easily to investigate the MICheck results, i.e. inaccurate or missed gene annotations: a graphical representation is drawn, in which the genomic context of a unique coding DNA sequence annotation or a predicted frameshift is given, using information on the coding potential (curves) and annotation of the neighbouring genes. We illustrate some capabilities of the MICheck site through the analysis of 20 bacterial genomes, 9 of which were selected for their ‘Reviewed’ status in the National Center for Biotechnology Information (NCBI) Reference Sequence Project (RefSeq). In the context of the numerous re-annotation projects for microbial genomes, this tool can be seen as a preliminary step before the functional re-annotation step to check quickly for missing or wrongly annotated genes. The MICheck website is accessible at the following address: .

BMC Bioinformatics | 2002

Re-annotation of genome microbial CoDing-Sequences: finding new genes and inaccurately annotated genes

Stéphanie Bocs; Antoine Danchin; Claudine Médigue

BackgroundAnalysis of any newly sequenced bacterial genome starts with the identification of protein-coding genes. Despite the accumulation of multiple complete genome sequences, which provide useful comparisons with close relatives among other organisms during the annotation process, accurate gene prediction remains quite difficult. A major reason for this situation is that genes are tightly packed in prokaryotes, resulting in frequent overlap. Thus, detection of translation initiation sites and/or selection of the correct coding regions remain difficult unless appropriate biological knowledge (about the structure of a gene) is imbedded in the approach.ResultsWe have developed a new program that automatically identifies biologically significant candidate genes in a bacterial genome. Twenty-six complete prokaryotic genomes were analyzed using this tool, and the accuracy of gene finding was assessed by comparison with existing annotations. This analysis revealed that, despite the enormous effort of genome program annotators, a small but not negligible number of genes annotated within the framework of sequencing projects are likely to be partially inaccurate or plainly wrong. Moreover, the analysis of several putative new genes shows that, as expected, many short genes have escaped annotation. In most cases, these new genes revealed frameshifts that could be either artifacts or genuine frameshifts. Some entirely unexpected new genes have also been identified. This allowed us to get a more complete picture of prokaryotic genomes. The results of this procedure are progressively integrated into the SWISS-PROT reference databank.ConclusionsThe results described in the present study show that our procedure is very satisfactory in terms of gene finding accuracy. Except in few cases, discrepancies between our results and annotations provided by individual authors can be accounted for by the nature of each annotation process or by specific characteristics of some genomes. This stresses that close cooperation between scientists, regular update and curation of the findings in databases are clearly required to reduce the level of errors in genome annotation (and hence in reducing the unfortunate spreading of errors through centralized data libraries).

Journal of Virology | 2013

Three Infectious Viral Species Lying in Wait in the Banana Genome

Matthieu Chabannes; Franc-Christophe Baurens; Pierre-Olivier Duroy; Stéphanie Bocs; Marie-Stéphanie Vernerey; Marguerite Rodier-Goud; Valérie Barbe; Philippe Gayral; Marie-Line Iskra-Caruana

ABSTRACT Plant pararetroviruses integrate serendipitously into their host genomes. The banana genome harbors integrated copies of banana streak virus (BSV) named endogenous BSV (eBSV) that are able to release infectious pararetrovirus. In this investigation, we characterized integrants of three BSV species—Goldfinger (eBSGFV), Imove (eBSImV), and Obino lEwai (eBSOLV)—in the seedy Musa balbisiana Pisang klutuk wulung (PKW) by studying their molecular structure, genomic organization, genomic landscape, and infectious capacity. All eBSVs exhibit extensive viral genome duplications and rearrangements. eBSV segregation analysis on an F1 population of PKW combined with fluorescent in situ hybridization analysis showed that eBSImV, eBSOLV, and eBSGFV are each present at a single locus. eBSOLV and eBSGFV contain two distinct alleles, whereas eBSImV has two structurally identical alleles. Genotyping of both eBSV and viral particles expressed in the progeny demonstrated that only one allele for each species is infectious. The infectious allele of eBSImV could not be identified since the two alleles are identical. Finally, we demonstrate that eBSGFV and eBSOLV are located on chromosome 1 and eBSImV is located on chromosome 2 of the reference Musa genome published recently. The structure and evolution of eBSVs suggest sequential integration into the plant genome, and haplotype divergence analysis confirms that the three loci display differential evolution. Based on our data, we propose a model for BSV integration and eBSV evolution in the Musa balbisiana genome. The mutual benefits of this unique host-pathogen association are also discussed.

Explore More