Mariano Felice
University of Cambridge
Network
Latest external collaboration on country level. Dive into details by clicking on the dots.
Publication
Featured researches published by Mariano Felice.
conference on computational natural language learning | 2014
Mariano Felice; Zheng Yuan; Øistein E. Andersen; Helen Yannakoudakis; Ekaterina Kochmar
[We would like to thank] Cambridge English Language Assessment, a division of Cambridge Assessment, for supporting this research.
north american chapter of the association for computational linguistics | 2015
Mariano Felice; Ted Briscoe
We present a novel evaluation method for grammatical error correction that addresses problems with previous approaches and scores systems in terms of improvement on the original text. Our method evaluates corrections at the token level using a globally optimal alignment between the source, a system hypothesis, and a reference. Unlike the M 2 Scorer, our method provides scores for both detection and correction and is sensitive to different types of edit operations.
conference of the european chapter of the association for computational linguistics | 2014
Mariano Felice; Zheng Yuan
This paper explores the generation of artificial errors for correcting grammatical mistakes made by learners of English as a second language. Artificial errors are injected into a set of error-free sentences in a probabilistic manner using statistics from a corpus. Unlike previous approaches, we use linguistic information to derive error generation probabilities and build corpora to correct several error types, including open-class errors. In addition, we also analyse the variables involved in the selection of candidate sentences. Experiments using the NUCLE corpus from the CoNLL 2013 shared task reveal that: 1) training on artificially created errors improves precision at the expense of recall and 2) different types of linguistic information are better suited for correcting different error types.
meeting of the association for computational linguistics | 2017
Christopher Bryant; Mariano Felice; Ted Briscoe
Until now, error type performance for Grammatical Error Correction (GEC) systems could only be measured in terms of recall because system output is not annotated. To overcome this problem, we introduce ERRANT, a grammatical ERRor ANnotation Toolkit designed to automatically extract edits from parallel original and corrected sentences and classify them according to a new, dataset-agnostic, rule-based framework. This not only facilitates error type evaluation at different levels of granularity, but can also be used to reduce annotator workload and standardise existing GEC datasets. Human experts rated the automatic edits as “Good” or “Acceptable” in at least 95% of cases, so we applied ERRANT to the system output of the CoNLL-2014 shared task to carry out a detailed error type analysis for the first time.
ACM Crossroads Student Magazine | 2014
Mariano Felice; Zheng Yuan
Technology has made language learning a more interactive and enjoyable experience, but it has never been smart enough to replace human tutors. However, the latest advances in automated grammatical error correction open up new horizons. Could software ever replace our language teachers?
Machine Translation | 2013
Mariano Felice; Lucia Specia
This paper describes a study on the contribution of linguistically-informed features to the task of quality estimation for machine translation at sentence level. A standard regression algorithm is used to build models using a combination of linguistic and non-linguistic features extracted from the input text and its machine translation. Experiments with three English–Spanish translation datasets show that linguistic features on their own are not able to outperform shallower features based on statistics from the input text, its translation and additional corpora. However, further analysis suggests that linguistic information can be useful to produce better results if carefully combined with other features. An in-depth analysis of the results highlights a number of issues related to the use of linguistic features.
international conference on computational linguistics | 2016
Mariano Felice; Christopher Bryant; Edward John Briscoe
We propose a new method of automatically extracting learner errors from parallel English as a Second Language (ESL) sentences in an effort to regularise annotation formats and reduce inconsistencies. Specifically, given an original and corrected sentence, our method first uses a linguistically enhanced alignment algorithm to determine the most likely mappings between tokens, and secondly employs a rule-based function to decide which alignments should be merged. Our method beats all previous approaches on the tested datasets, achieving state-of-the-art results for automatic error extraction.
workshop on statistical machine translation | 2012
Mariano Felice; Lucia Specia
conference on computational natural language learning | 2013
Zheng Yuan; Mariano Felice
Archive | 2012
Mariano Felice