Florence Jaffrézic
Institut national de la recherche agronomique
Network
Latest external collaboration on country level. Dive into details by clicking on the dots.
Publication
Featured researches published by Florence Jaffrézic.
Briefings in Bioinformatics | 2013
Marie-Agnès Dillies; Andrea Rau; Julie Aubert; Christelle Hennequet-Antier; Marine Jeanmougin; Nicolas Servant; Céline Keime; Guillemette Marot; David Castel; Jordi Estellé; Gregory Guernec; Bernd Jagla; Luc Jouneau; Denis Laloë; Caroline Le Gall; Brigitte Schaëffer; Stéphane Le Crom; Mickael Guedj; Florence Jaffrézic
During the last 3 years, a number of approaches for the normalization of RNA sequencing data have emerged in the literature, differing both in the type of bias adjustment and in the statistical strategy adopted. However, as data continue to accumulate, there has been no clear consensus on the appropriate normalization method to be used or the impact of a chosen method on the downstream analysis. In this work, we focus on a comprehensive comparison of seven recently proposed normalization methods for the differential analysis of RNA-seq data, with an emphasis on the use of varied real and simulated datasets involving different species and experimental designs to represent data characteristics commonly observed in practice. Based on this comparison study, we propose practical recommendations on the appropriate normalization method to be used and its impact on the differential analysis of RNA-seq data.
PLOS ONE | 2009
Laurence Flori; Sébastien Fritz; Florence Jaffrézic; Mekki Boussaha; Ivo Gut; Simon Heath; Jean-Louis Foulley; Mathieu Gautier
Dairy cattle breeds have been subjected over the last fifty years to intense artificial selection towards improvement of milk production traits. In this study, we performed a whole genome scan for differentiation using 42,486 SNPs in the three major French dairy cattle breeds (Holstein, Normande and Montbéliarde) to identify the main physiological pathways and regions which were affected by this selection. After analyzing the population structure, we estimated FST within and across the three breeds for each SNP under a pure drift model. We further considered two different strategies to evaluate the effect of selection at the genome level. First, smoothing FST values over each chromosome with a local variable bandwidth kernel estimator allowed identifying 13 highly significant regions subjected to strong and/or recent positive selection. Some of them contained genes within which causal variants with strong effect on milk production traits (GHR) or coloration (MC1R) have already been reported. To go further in the interpretation of the observed signatures of selection we subsequently concentrated on the annotation of differentiated genes defined according to the FST value of SNPs localized close or within them. To that end we performed a comprehensive network analysis which suggested a central role of somatotropic and gonadotropic axes in the response to selection. Altogether, these observations shed light on the antagonism, at the genome level, between milk production and reproduction traits in highly producing dairy cows.
BMC Genomics | 2009
Mathieu Gautier; Laurence Flori; Andrea Riebler; Florence Jaffrézic; Denis Laloë; Ivo Gut; Katayoun Moazami-Goudarzi; Jean-Louis Foulley
BackgroundThe recent settlement of cattle in West Africa after several waves of migration from remote centres of domestication has imposed dramatic changes in their environmental conditions, in particular through exposure to new pathogens. West African cattle populations thus represent an appealing model to unravel the genome response to adaptation to tropical conditions. The purpose of this study was to identify footprints of adaptive selection at the whole genome level in a newly collected data set comprising 36,320 SNPs genotyped in 9 West African cattle populations.ResultsAfter a detailed analysis of population structure, we performed a scan for SNP differentiation via a previously proposed Bayesian procedure including extensions to improve the detection of loci under selection. Based on these results we identified 53 genomic regions and 42 strong candidate genes. Their physiological functions were mainly related to immune response (MHC region which was found under strong balancing selection, CD79A, CXCR4, DLK1, RFX3, SEMA4A, TICAM1 and TRIM21), nervous system (NEUROD6, OLFM2, MAGI1, SEMA4A and HTR4) and skin and hair properties (EDNRB, TRSP1 and KRTAP8-1).ConclusionThe main possible underlying selective pressures may be related to climatic conditions but also to the host response to pathogens such as Trypanosoma(sp). Overall, these results might open the way towards the identification of important variants involved in adaptation to tropical conditions and in particular to resistance to tropical infectious diseases.
Bioinformatics | 2013
Andrea Rau; Mélina Gallopin; Gilles Celeux; Florence Jaffrézic
Motivation: RNA sequencing is now widely performed to study differential expression among experimental conditions. As tests are performed on a large number of genes, stringent false-discovery rate control is required at the expense of detection power. Ad hoc filtering techniques are regularly used to moderate this correction by removing genes with low signal, with little attention paid to their impact on downstream analyses. Results: We propose a data-driven method based on the Jaccard similarity index to calculate a filtering threshold for replicated RNA sequencing data. In comparisons with alternative data filters regularly used in practice, we demonstrate the effectiveness of our proposed method to correctly filter lowly expressed genes, leading to increased detection power for moderately to highly expressed genes. Interestingly, this data-driven threshold varies among experiments, highlighting the interest of the method proposed here. Availability: The proposed filtering method is implemented in the R package HTSFilter available on Bioconductor. Contact: [email protected] Supplementary information: Supplementary data are available at Bioinformatics online.
Genetics Selection Evolution | 2005
Tom Druet; Florence Jaffrézic; Vincent Ducrocq
Application of test-day models for the genetic evaluation of dairy populations requires the solution of large mixed model equations. The size of the (co)variance matrices required with such models can be reduced through the use of its first eigenvectors. Here, the first two eigenvectors of (co)variance matrices estimated for dairy traits in first lactation were used as covariables to jointly estimate genetic parameters of the first three lactations. These eigenvectors appear to be similar across traits and have a biological interpretation, one being related to the level of production and the other to persistency. Furthermore, they explain more than 95% of the total genetic variation. Variances and heritabilities obtained with this model were consistent with previous studies. High correlations were found among production levels in different lactations. Persistency measures were less correlated. Genetic correlations between second and third lactations were close to one, indicating that these can be considered as the same trait. Genetic correlations within lactation were high except between extreme parts of the lactation. This study shows that the use of eigenvectors can reduce the rank of (co)variance matrices for the test-day model and can provide consistent genetic parameters.
Statistical Applications in Genetics and Molecular Biology | 2010
Andrea Rau; Florence Jaffrézic; Jean Louis Foulley; R. W. Doerge
Gene regulatory networks refer to the interactions that occur among genes and other cellular products. The topology of these networks can be inferred from measurements of changes in gene expression over time. However, because the measurement device (i.e., microarrays) typically yields information on thousands of genes over few biological replicates, these systems are quite difficult to elucidate. An approach with proven effectiveness for inferring networks is the Dynamic Bayesian Network. We have developed an iterative empirical Bayesian procedure with a Kalman filter that estimates the posterior distributions of network parameters. We compare our method to similar existing methods on simulated data and real microarray time series data. We find that the proposed method performs comparably on both model-based and data-based simulations in considerably less computational time. The R and C code used to implement the proposed method are publicly available in the R package ebdbNet.
PLOS ONE | 2011
Anna Anselmo; Laurence Flori; Florence Jaffrézic; Teresa Rutigliano; María C. Cecere; Naima Cortes-Perez; François Lefèvre; Claire Rogel-Gaillard; Elisabetta Giuffra
MicroRNAs are small non-coding RNAs approximately 22 nt long that modulate gene expression in animals and plants. It has been recently demonstrated that herpesviruses encode miRNAs to control the post-transcriptional regulation of expression from their own genomes and possibly that of their host, thus adding an additional layer of complexity to the physiological cross-talk between host and pathogen. The present study focussed on the interactions between porcine dendritic cells (DCs) and the Pseudorabies virus (PRV), an alpha-herpesvirus causing Aujeszkys disease in pigs. A catalogue of porcine and viral miRNAs, expressed eight hours post-infection, was established by deep sequencing. An average of 2 million reads per sample with a size of 21–24 nucleotides was obtained from six libraries representing three biological replicates of infected and mock-infected DCs. Almost 95% of reads mapped to the draft pig genome sequence and pig miRNAs previously annotated in dedicated databases were detected by sequence alignment. In silico prediction allowed the identification of unknown porcine as well as of five miRNAs transcribed by the Large Latency Transcript (LLT) of PRV. The gene target prediction of the viral miRNAs and the Ingenuity Pathway Analysis of differentially expressed pig miRNAs were conducted to contextualize the identified small RNA molecules and functionally characterize their involvement in the post-transcriptional regulation of gene expression. The results support a role for PRV miRNAs in the maintenance of the host cell latency state through the down-regulation of immediate-early viral genes which is similar to other herpesviruses. The differentially expressed swine miRNAs identified a unique network of target genes with highly significant functions in the development and function of the nervous system and in infectious mechanisms, suggesting that the modulation of both host and viral miRNAs is necessary for the establishment of PRV latency.
PLOS ONE | 2012
Sandrine Le Guillou; Nezha Sdassi; Johann Laubier; Bruno Passet; Marthe Vilotte; Johan Castille; Denis Laloë; Jacqueline Polyte; Stephan Bouet; Florence Jaffrézic; E. P. Cribiu; Jean-Luc Vilotte; Fabienne Le Provost
Background MicroRNA (miRNA) are negative regulators of gene expression, capable of exerting pronounced influences upon the translation and stability of mRNA. They are potential regulators of normal mammary gland development and of the maintenance of mammary epithelial progenitor cells. This study was undertaken to determine the role of miR-30b on the establishment of a functional mouse mammary gland. miR-30b is a member of the miR-30 family, composed of 6 miRNA that are highly conserved in vertebrates. It has been suggested to play a role in the differentiation of several cell types. Methodology/Principal Findings The expression of miR-30b was found to be regulated during mammary gland development. Transgenic mice overexpressing miR-30b in mammary epithelial cells were used to investigate its role. During lactation, mammary histological analysis of the transgenic mice showed a reduction in the size of alveolar lumen, a defect of the lipid droplets and a growth defect of pups fed by transgenic females. Moreover some mammary epithelial differentiated structures persisted during involution, suggesting a delay in the process. The genes whose expression was affected by the overexpression of miR-30b were characterized by microarray analysis. Conclusion/Significance Our data suggests that miR-30b is important for the biology of the mammary gland and demonstrates that the deregulation of only one miRNA could affect lactation and involution.
BMC Genomics | 2008
Benoit Guyonnet; Guillemette Marot; Jean-Louis Dacheux; Marie-José Mercat; Sandrine Schwob; Florence Jaffrézic; Jean-Luc Gatti
BackgroundMammalians gamete production takes place in the testis but when they exit this organ, although spermatozoa have acquired a specialized and distinct morphology, they are immotile and infertile. It is only after their travel in the epididymis that sperm gain their motility and fertility. Epididymis is a crescent shaped organ adjacent to the testis that can be divided in three gross morphological regions, head (caput), body (corpus) and tail (cauda). It contains a long and unique convoluted tubule connected to the testis via the efferent ducts and finished by joining the vas deferens in its caudal part.ResultsIn this study, the testis, the efferent ducts (vas efferens, VE), nine distinct successive epididymal segments and the deferent duct (vas deferens, VD) of four adult boars of known fertility were isolated and their mRNA extracted. The gene expression of each of these samples was analyzed using a pig generic 9 K nylon microarray (AGENAE program; GEO accession number: GPL3729) spotted with 8931 clones derived from normalized cDNA banks from different pig tissues including testis and epididymis. Differentially expressed transcripts were obtained with moderated t-tests and F-tests and two data clustering algorithms based either on partitioning around medoid (top down PAM) or hierarchical clustering (bottom up HCL) were combined for class discovery and gene expression analysis. Tissue clustering defined seven transcriptomic units: testis, vas efferens and five epididymal transcriptomic units. Meanwhile transcripts formed only four clusters related to the tissues. We have then used a specific statistical method to sort out genes specifically over-expressed (markers) in testis, VE or in each of the five transcriptomic units of the epididymis (including VD). The specific regional expression of some of these genes was further validated by PCR and Q-PCR. We also searched for specific pathways and functions using available gene ontology information.ConclusionThis study described for the first time the complete transcriptomes of the testis, the epididymis, the vas efferens and the vas deferens on the same species. It described new genes or genes not yet reported over-expressed in these boar tissues, as well as new control mechanisms. It emphasizes and fulfilled the gap between studies done in rodents and human, and provides tools that will be useful for further studies on the biochemical processes responsible for the formation and maintain of the epididymal regionalization and the development of a fertile spermatozoa.
Genetics Research | 2007
Florence Jaffrézic; Guillemette Marot; Séverine Degrelle; Isabelle Hue; Jean-Louis Foulley
The importance of variance modelling is now widely known for the analysis of microarray data. In particular the power and accuracy of statistical tests for differential gene expressions are highly dependent on variance modelling. The aim of this paper is to use a structural model on the variances, which includes a condition effect and a random gene effect, and to propose a simple estimation procedure for these parameters by working on the empirical variances. The proposed variance model was compared with various methods on both real and simulated data. It proved to be more powerful than the gene-by-gene analysis and more robust to the number of false positives than the homogeneous variance model. It performed well compared with recently proposed approaches such as SAM and VarMixt even for a small number of replicates, and performed similarly to Limma. The main advantage of the structural model is that, thanks to the use of a linear mixed model on the logarithm of the variances, various factors of variation can easily be incorporated in the model, which is not the case for previously proposed empirical Bayes methods. It is also very fast to compute and is adapted to the comparison of more than two conditions.