Cyril Grouin
Centre national de la recherche scientifique
Network
Latest external collaboration on country level. Dive into details by clicking on the dots.
Publication
Featured researches published by Cyril Grouin.
Journal of the American Medical Informatics Association | 2013
Cyril Grouin; Natalia Grabar; Thierry Hamon; Sophie Rosset; Xavier Tannier; Pierre Zweigenbaum
OBJECTIVEnTo identify the temporal relations between clinical events and temporal expressions in clinical reports, as defined in the i2b2/VA 2012 challenge.nnnDESIGNnTo detect clinical events, we used rules and Conditional Random Fields. We built Random Forest models to identify event modality and polarity. To identify temporal expressions we built on the HeidelTime system. To detect temporal relations, we systematically studied their breakdown into distinct situations; we designed an oracle method to determine the most prominent situations and the most suitable associated classifiers, and combined their results.nnnRESULTSnWe achieved F-measures of 0.8307 for event identification, based on rules, and 0.8385 for temporal expression identification. In the temporal relation task, we identified nine main situations in three groups, experimentally confirming shared intuitions: within-sentence relations, section-related time, and across-sentence relations. Logistic regression and Naïve Bayes performed best on the first and third groups, and decision trees on the second. We reached a 0.6231 global F-measure, improving by 7.5 points our official submission.nnnCONCLUSIONSnCarefully hand-crafted rules obtained good results for the detection of events and temporal expressions, while a combination of classifiers improved temporal link prediction. The characterization of the oracle recall of situations allowed us to point at directions where further work would be most useful for temporal relation detection: within-sentence relations and linking History of Present Illness events to the admission date. We suggest that the systematic situation breakdown proposed in this paper could also help improve other systems addressing this task.
Biomedical Informatics Insights | 2013
Pierre Zweigenbaum; Thomas Lavergne; Natalia Grabar; Thierry Hamon; Sophie Rosset; Cyril Grouin
Medical entity recognition is currently generally performed by data-driven methods based on supervised machine learning. Expert-based systems, where linguistic and domain expertise are directly provided to the system are often combined with data-driven systems. We present here a case study where an existing expert-based medical entity recognition system, Ogmios, is combined with a data-driven system, Caramba, based on a linear-chain Conditional Random Field (CRF) classifier. Our case study specifically highlights the risk of overfitting incurred by an expert-based system. We observe that it prevents the combination of the 2 systems from obtaining improvements in precision, recall, or F-measure, and analyze the underlying mechanisms through a post-hoc feature-level analysis. Wrapping the expert-based system alone as attributes input to a CRF classifier does boost its F-measure from 0.603 to 0.710, bringing it on par with the data-driven system. The generalization of this method remains to be further investigated.
Biomedical Informatics Insights | 2012
Alexander Pak; Delphine Bernhard; Patrick Paroubek; Cyril Grouin
In this paper, we present the system we have developed for participating in the second task of the i2b2/VA 2011 challenge dedicated to emotion detection in clinical records. On the official evaluation, we ranked 6th out of 26 participants. Our best configuration, based upon a combination of both a machine-learning based approach and manually-defined transducers, obtained a 0.5383 global F-measure, while the distribution of the other 26 participants’ results is characterized by mean = 0.4875, stdev = 0.0742, min = 0.2967, max = 0.6139, and median = 0.5027. Combination of machine learning and transducer is achieved by computing the union of results from both approaches, each using a hierarchy of sentiment specific classifiers.
5th Linguistics Annotation Workshop (The LAW V) | 2011
Cyril Grouin; Sophie Rosset; Pierre Zweigenbaum; Karën Fort; Olivier Galibert; Ludovic Quintard
i2b2 Medication Extraction Challenge Workshop | 2010
Cyril Grouin; Asma Ben Abacha; Delphine Bernhard; Bruno Cartoni; Louise Deléger; Brigitte Grau; Anne-Laure Ligozat; Anne-Lyse Minard; Sophie Rosset; Pierre Zweigenbaum
CLEF (Working Notes) | 2015
Aurélie Névéol; Cyril Grouin; Xavier Tannier; Thierry Hamon; Liadh Kelly; Lorraine Goeuriot; Pierre Zweigenbaum
JEP-TALN-RECITAL 2012, Workshop DEFT 2012: D'Efi Fouille de Textes (DEFT 2012 Workshop: Text Mining Challenge) | 2012
Patrick Paroubek; Pierre Zweigenbaum; Dominic Forest; Cyril Grouin
Proceedings of the American Medical Informatics Association Conference AMIA | 2011
Olivier Dameron; Pascal Van Hille; Lynda Temal; Arnaud Rosier; Louise Deléger; Cyril Grouin; Pierre Zweigenbaum; Anita Burgun
CLEF (Working Notes) | 2017
Aurélie Névéol; Aude Robert; Robert Anderson; Kevin Bretonnel Cohen; Cyril Grouin; Thomas Lavergne; Grégoire Rey; Claire Rondet; Pierre Zweigenbaum
AMIA | 2014
Louise Deléger; Cyril Grouin; Aurélie Névéol