Mariano Felice | Researchain

Archive Network Publication Hotspot Collaboration

Network

Latest external collaboration on country level. Dive into details by clicking on the dots.

Explore More

Hotspot

Dive into the research topics where Mariano Felice is active.

Explore More

Publication

Featured researches published by Mariano Felice.

conference on computational natural language learning | 2014

Grammatical error correction using hybrid systems and type filtering

Mariano Felice; Zheng Yuan; Øistein E. Andersen; Helen Yannakoudakis; Ekaterina Kochmar

[We would like to thank] Cambridge English Language Assessment, a division of Cambridge Assessment, for supporting this research.

north american chapter of the association for computational linguistics | 2015

Towards a standard evaluation method for grammatical error detection and correction

Mariano Felice; Ted Briscoe

We present a novel evaluation method for grammatical error correction that addresses problems with previous approaches and scores systems in terms of improvement on the original text. Our method evaluates corrections at the token level using a globally optimal alignment between the source, a system hypothesis, and a reference. Unlike the M 2 Scorer, our method provides scores for both detection and correction and is sensitive to different types of edit operations.

conference of the european chapter of the association for computational linguistics | 2014

Generating artificial errors for grammatical error correction

Mariano Felice; Zheng Yuan

This paper explores the generation of artificial errors for correcting grammatical mistakes made by learners of English as a second language. Artificial errors are injected into a set of error-free sentences in a probabilistic manner using statistics from a corpus. Unlike previous approaches, we use linguistic information to derive error generation probabilities and build corpora to correct several error types, including open-class errors. In addition, we also analyse the variables involved in the selection of candidate sentences. Experiments using the NUCLE corpus from the CoNLL 2013 shared task reveal that: 1) training on artificially created errors improves precision at the expense of recall and 2) different types of linguistic information are better suited for correcting different error types.

meeting of the association for computational linguistics | 2017

Automatic Annotation and Evaluation of Error Types for Grammatical Error Correction

Christopher Bryant; Mariano Felice; Ted Briscoe

Until now, error type performance for Grammatical Error Correction (GEC) systems could only be measured in terms of recall because system output is not annotated. To overcome this problem, we introduce ERRANT, a grammatical ERRor ANnotation Toolkit designed to automatically extract edits from parallel original and corrected sentences and classify them according to a new, dataset-agnostic, rule-based framework. This not only facilitates error type evaluation at different levels of granularity, but can also be used to reduce annotator workload and standardise existing GEC datasets. Human experts rated the automatic edits as “Good” or “Acceptable” in at least 95% of cases, so we applied ERRANT to the system output of the CoNLL-2014 shared task to carry out a detailed error type analysis for the first time.

ACM Crossroads Student Magazine | 2014

To err is human, to correct is divine

Mariano Felice; Zheng Yuan

Technology has made language learning a more interactive and enjoyable experience, but it has never been smart enough to replace human tutors. However, the latest advances in automated grammatical error correction open up new horizons. Could software ever replace our language teachers?

Machine Translation | 2013

Investigating the contribution of linguistic information to quality estimation

Mariano Felice; Lucia Specia

This paper describes a study on the contribution of linguistically-informed features to the task of quality estimation for machine translation at sentence level. A standard regression algorithm is used to build models using a combination of linguistic and non-linguistic features extracted from the input text and its machine translation. Experiments with three English–Spanish translation datasets show that linguistic features on their own are not able to outperform shallower features based on statistics from the input text, its translation and additional corpora. However, further analysis suggests that linguistic information can be useful to produce better results if carefully combined with other features. An in-depth analysis of the results highlights a number of issues related to the use of linguistic features.

international conference on computational linguistics | 2016

Automatic Extraction of Learner Errors in ESL Sentences Using Linguistically Enhanced Alignments.

Mariano Felice; Christopher Bryant; Edward John Briscoe

We propose a new method of automatically extracting learner errors from parallel English as a Second Language (ESL) sentences in an effort to regularise annotation formats and reduce inconsistencies. Specifically, given an original and corrected sentence, our method first uses a linguistically enhanced alignment algorithm to determine the most likely mappings between tokens, and secondly employs a rule-based function to decide which alignments should be merged. Our method beats all previous approaches on the tested datasets, achieving state-of-the-art results for automatic error extraction.

workshop on statistical machine translation | 2012