Karën Fort
French Institute for Research in Computer Science and Automation
Network
Latest external collaboration on country level. Dive into details by clicking on the dots.
Publication
Featured researches published by Karën Fort.
meeting of the association for computational linguistics | 2007
Karën Fort; Bruno Guillaume
PrepLex is a lexicon of French prepositions which provides all the syntactic information needed for parsing. It was built by comparing and merging several authoritative lexical sources. This lexicon also includes information about the prepositions or classes of prepositions that appear in French verb subcategorization frames. This resource has been developed as a first step in making current French preposition lexicons available for effective natural language processing.
Database | 2016
Lynette Hirschman; Karën Fort; Stéphanie Boué; Nikos C. Kyrpides; Rezarta Islamaj Doğan; Kevin Bretonnel Cohen
Crowdsourcing is increasingly utilized for performing tasks in both natural language processing and biocuration. Although there have been many applications of crowdsourcing in these fields, there have been fewer high-level discussions of the methodology and its applicability to biocuration. This paper explores crowdsourcing for biocuration through several case studies that highlight different ways of leveraging ‘the crowd’; these raise issues about the kind(s) of expertise needed, the motivations of participants, and questions related to feasibility, cost and quality. The paper is an outgrowth of a panel session held at BioCreative V (Seville, September 9–11, 2015). The session consisted of four short talks, followed by a discussion. In their talks, the panelists explored the role of expertise and the potential to improve crowd performance by training; the challenge of decomposing tasks to make them amenable to crowdsourcing; and the capture of biological data and metadata through community editing. Database URL: http://www.mitre.org/publications/technical-papers/crowdsourcing-and-curation-perspectives
Handbook of Linguistic Annotation | 2017
K. Bretonnel Cohen; Karin Verspoor; Karën Fort; Christopher S. Funk; Michael Bada; Martha Palmer; Lawrence Hunter
The Colorado Richly Annotated Full Text (CRAFT) corpus consists of full-text journal articles. The primary motivation for the annotation project was the accumulating body of evidence indicating that the bodies of journal articles contain much information that is not present in the abstracts, and that the textual and structural characteristics of article bodies are different from those of abstracts. The development of CRAFT was characterized by a “multi-model” annotation task. The sample population was all journal articles that had been used by the Mouse Genome Informatics group as evidence for at least one Gene Ontology or Mouse Phenotype Ontology “annotation.” The linguistic annotation is represented in the widely known Penn Treebank format (Marcus et al., Comput. Linguist. 19(2), 313–330, 1993) [50], with the addition of a small number of tags and phrasal categories to accommodate the idiosyncrasies of the domain.
language resources and evaluation | 2014
Marie Candito; Guy Perrier; Bruno Guillaume; Corentin Ribeyre; Karën Fort; Djamé Seddah; Éric Villemonte de la Clergerie
language resources and evaluation | 2014
Alain Couillault; Karën Fort; Gilles Adda; Hugues De Mazancourt
international conference on computational linguistics | 2008
Bruno Guillaume; Joseph Le Roux; Jonathan Marchand; Guy Perrier; Karën Fort; Jennifer Planul
language resources and evaluation | 2016
Karën Fort; Alain Couillault
international conference on computational linguistics | 2016
Bruno Guillaume; Karën Fort; Nicolas Lefebvre
language resources and evaluation | 2018
Alice Millour; Karën Fort
international conference on computational linguistics | 2018
Karën Fort; Bruno Guillaume; Matthieu Constant; Nicolas Lefebvre; Yann-Alan Pilatte