Is this you? Create Your Porfile

Sarah Cohen-Boulakia

French Institute for Research in Computer Science and Automation

Archive Network Publication Hotspot Collaboration

Network

Latest external collaboration on country level. Dive into details by clicking on the dots.

Explore More

Hotspot

Dive into the research topics where Sarah Cohen-Boulakia is active.

Explore More

Publication

Featured researches published by Sarah Cohen-Boulakia.

statistical and scientific database management | 2015

OpenAlea: scientific workflows combining data analysis and simulation

Christophe Pradal; Christian Fournier; Patrick Valduriez; Sarah Cohen-Boulakia

Analyzing biological data (e.g., annotating genomes, assembling NGS data...) may involve very complex and interlinked steps where several tools are combined together. Scientific workflow systems have reached a level of maturity that makes them able to support the design and execution of such in-silico experiments, and thus making them increasingly popular in the bioinformatics community. However, in some emerging application domains such as system biology, developmental biology or ecology, the need for data analysis is combined with the need to model complex multi-scale biological systems, possibly involving multiple simulation steps. This requires the scientific workflow to deal with retro-action to understand and predict the relationships between structure and function of these complex systems. OpenAlea (openalea.gforge.inria.fr) is the only scientific workflow system able to uniformly address the problem, which made it successful in the scientific community. One of its main originality is to introduce higher-order dataflows as a means to uniformly combine classical data analysis with modeling and simulation. In this demonstration paper, we provide for the first time the description of the OpenAlea system involving an original combination of features. We illustrate the demonstration on a high-throughput workflow in phenotyping, phenomics, and environmental control designed to study the interplay between plant architecture and climatic change.

Future Generation Computer Systems | 2017

InfraPhenoGrid: A scientific workflow infrastructure for Plant Phenomics on the Grid

Christophe Pradal; Simon Artzet; Jérôme Chopard; Dimitri Dupuis; Christian Fournier; Michael Mielewczik; Vincent Negre; Pascal Neveu; Didier Parigot; Patrick Valduriez; Sarah Cohen-Boulakia

Plant phenotyping consists in the observation of physical and biochemical traits of plant genotypes in response to environmental conditions. Challenges , in particular in context of climate change and food security, are numerous. High-throughput platforms have been introduced to observe the dynamic growth of a large number of plants in different environmental conditions. Instead of considering a few genotypes at a time (as it is the case when phenomic traits are measured manually), such platforms make it possible to use completely new kinds of approaches. However, the data sets produced by such widely instrumented platforms are huge, constantly augmenting and produced by increasingly complex experiments, reaching a point where distributed computation is mandatory to extract knowledge from data. In this paper, we introduce InfraPhenoGrid, the infrastructure we designed and deploy to efficiently manage data sets produced by the PhenoArch plant phenomics platform in the context of the French Phenome Project. Our solution consists in deploying scientific workflows on a Grid using a middle-ware to pilot workflow executions. Our approach is user-friendly in the sense that despite the intrinsic complexity of the infrastructure, running scientific workflows and understanding results obtained (using provenance information) is kept as simple as possible for end-users.

very large data bases | 2015

Rank aggregation with ties: experiments and analysis

Bryan Brancotte; Bo Yang; Guillaume Blin; Sarah Cohen-Boulakia; Alain Denise; Sylvie Hamel

The problem of aggregating multiple rankings into one consensus ranking is an active research topic especially in the database community. Various studies have implemented methods for rank aggregation and may have come up with contradicting conclusions upon which algorithms work best. Comparing such results is cumbersome, as the original studies mixed different approaches and used very different evaluation datasets and metrics. Additionally, in real applications, the rankings to be aggregated may not be permutations where elements are strictly ordered, but they may have ties where some elements are placed at the same position. However, most of the studies have not considered ties. This paper introduces the first large scale study of algorithms for rank aggregation with ties. More precisely, (i) we review rank aggregation algorithms and determine whether or not they can handle ties; (ii) we propose the first implementation to compute the exact solution of the Rank Aggregation with ties problem; (iii) we evaluate algorithms for rank aggregation with ties on a very large panel of both real and carefully generated synthetic datasets; (iv) we provide guidance on the algorithms to be favored depending on dataset features.

Future Generation Computer Systems | 2017

Scientific workflows for computational reproducibility in the life sciences: Status, challenges and opportunities

Sarah Cohen-Boulakia; Khalid Belhajjame; Olivier Collin; Jérôme Chopard; Christine Froidevaux; Alban Gaignard; Konrad Hinsen; Pierre Larmande; Yvan Le Bras; Frédéric Lemoine; Fabien Mareuil; Hervé Ménager; Christophe Pradal; Christophe Blanchet

IAMPS 2015 (International Workshop on Image Analysis Methods for the Plant Sciences) | 2015

Phenomenal: a software framework for model-assisted analysis of high throughput plant phenotyping data

Christian Fournier; Simon Artzet; Jérôme Chopard; Michael Mielewczik; Nicolas Brichet; Llorenç Cabrera; Xavier Sirault; Sarah Cohen-Boulakia; Christophe Pradal

Ercim News | 2018

Distributed management of scientific workflows for high-throughput plant phenotyping

Christophe Pradal; Sarah Cohen-Boulakia; Gaetan Heidsieck; Esther Pacitti; Francois Tardieu; Patrick Valduriez

Archive | 2016

Traitement de données bioinformatiques massives (Big Data)

Sarah Cohen-Boulakia; Patrick Valduriez

IC2016: Ingénierie des Connaissances | 2016

Représentation systémique multi-échelle des processus biologiques de la bactérie.

Vincent Henry; Arnaud Ferré; Christine Froidevaux; Anne Goelzer; Vincent Fromion; Sarah Cohen-Boulakia; Sandra Derozier; Marc Dinh; Ghislain Fiévet; Stephan Fischer; Jean-François Gibrat; Valentin Loux; Sabine Pérès

Ref : TIP140WEB - "Bioprocédés" | 2015

Interrogation et gestion de données bio-informatiques pour la biologie moléculaire

Sarah Cohen-Boulakia; Patrick Valduriez

Archive | 2015

New Results - Generic methodological results

Christophe Pradal; Sarah Cohen-Boulakia; Christian Fournier; Didier Parigot; Patrick Valduriez; Yann Guédon; Jean Peyhardi; Christophe Godin; Romain Azaïs; Jean-Baptiste Durand; Alain Jean-Marie

Explore More