Network


Latest external collaboration on country level. Dive into details by clicking on the dots.

Hotspot


Dive into the research topics where Declan Groves is active.

Publication


Featured researches published by Declan Groves.


workshop on statistical machine translation | 2006

Contextual Bitext-Derived Paraphrases in Automatic MT Evaluation

Karolina Owczarzak; Declan Groves; Josef van Genabith; Andy Way

In this paper we present a novel method for deriving paraphrases during automatic MT evaluation using only the source and reference texts, which are necessary for the evaluation, and word and phrase alignment software. Using target language paraphrases produced through word and phrase alignment a number of alternative reference sentences are constructed automatically for each candidate translation. The method produces lexical and low-level syntactic paraphrases that are relevant to the domain in hand, does not use external knowledge resources, and can be combined with a variety of automatic MT evaluation system.


international conference on computational linguistics | 2004

Robust sub-sentential alignment of phrase-structure trees

Declan Groves; Mary Hearne; Andy Way

Data-Oriented Translation (DOT), based on Data-Oriented Parsing (DOP), is a language-independent MT engine which exploits parsed, aligned bitexts to produce very high quality translations. However, data acquisition constitutes a serious bottleneck as DOT requires parsed sentences aligned at both sentential and sub-structural levels. Manual sub-structural alignment is time-consuming, error-prone and requires considerable knowledge of both source and target languages and how they are related. Automating this process is essential in order to carry out the large-scale translation experiments necessary to assess the full potential of DOT.We present a novel algorithm which automatically induces sub-structural alignments between context-free phrase structure trees in a fast and consistent fashion requiring little or no knowledge of the language pair. We present results from a number of experiments which indicate that our method provides a serious alternative to manual alignment.


Machine Translation | 2005

Hybrid data-driven models of machine translation

Declan Groves; Andy Way

This paper presents an extended, harmonised account of our previous work on combining subsentential alignments from phrase-based statistical machine translation (SMT) and example-based MT (EBMT) systems to create novel hybrid data-driven systems capable of outperforming the baseline SMT and EBMT systems from which they were derived. In previous work, we demonstrated that while an EBMT system is capable of outperforming a phrase-based SMT (PBSMT) system constructed from freely available resources, a hybrid ‘example-based’ SMT system incorporating marker chunks and SMT subsentential alignments is capable of outperforming both baseline translation models for French–English translation. In this paper, we show that similar gains are to be had from constructing a hybrid ‘statistical’ EBMT system. Unlike the previous research, here we use the Europarl training and test sets, which are fast becoming the standard data in the field. On these data sets, while all hybrid ‘statistical’ EBMT variants still fall short of the quality achieved by the baseline PBSMT system, we show that adding the marker chunks to create a hybrid ‘example-based’ SMT system outperforms the two baseline systems from which it is derived. Furthermore, we provide further evidence in favour of hybrid systems by adding an SMT target-language model to the EBMT system, and demonstrate that this too has a positive effect on translation quality. We also show that many of the subsentential alignments derived from the Europarl corpus are created by either the PBSMT or the EBMT system, but not by both. In sum, therefore, despite the obvious convergence of the two paradigms, the crucial differences between SMT and EBMT contribute positively to the overall translation quality. The central thesis of this paper is that any researcher who continues to develop an MT system using either of these approaches will benefit further from integrating the advantages of the other model; dogged adherence to one approach will lead to inferior systems being developed.


international conference natural language processing | 2010

OpenMaTrEx: a free/open-source marker-driven example-based machine translation system

Sandipan Dandapat; Mikel L. Forcada; Declan Groves; Sergio Penkale; John Tinsley; Andy Way

We describe OpenMaTrEx, a free/open-source example-based machine translation (EBMT) system based on the marker hypothesis, comprising a marker-driven chunker, a collection of chunk aligners, and two engines: one based on a simple proof-of-concept monotone EBMT recombinator and a Moses-based statistical decoder. Open-MaTrEx is a free/open-source release of the basic components of MaTrEx, the Dublin City University machine translation system.


Machine Translation | 2013

Predicting sentence translation quality using extrinsic and language independent features

Ergun Biçici; Declan Groves; Josef van Genabith

We develop a top performing model for automatic, accurate, and language independent prediction of sentence-level statistical machine translation (SMT) quality with or without looking at the translation outputs. We derive various feature functions measuring the closeness of a given test sentence to the training data and the difficulty of translating the sentence. We describe mono feature functions that are based on statistics of only one side of the parallel training corpora and duo feature functions that incorporate statistics involving both source and target sides of the training data. Overall, we describe novel, language independent, and SMT system extrinsic features for predicting the SMT performance, which also rank high during feature ranking evaluations. We experiment with different learning settings, with or without looking at the translations, which help differentiate the contribution of different feature sets. We apply partial least squares and feature subset selection, both of which improve the results and we present ranking of the top features selected for each learning setting, providing an exhaustive analysis of the extrinsic features used. We show that by just looking at the test source sentences and not using the translation outputs at all, we can achieve better performance than a baseline system using SMT model dependent features that generated the translations. Furthermore, our prediction system is able to achieve the


cross-language evaluation forum | 2004

Dublin city university at CLEF 2004: experiments with the ImageCLEF st. andrew's collection

Gareth J. F. Jones; Declan Groves; Anna Khasin; Adenike M. Lam-Adesina; Bart Mellebeek; Andy Way


meeting of the association for computational linguistics | 2005

Hybrid Example-Based SMT: the Best of Both Worlds?

Declan Groves; Andy Way

2


conference of the association for machine translation in the americas | 2006

Example-Based Machine Translation of the Basque Language

Nicolas Stroppa; Declan Groves; Andy Way; Kepa Sarasola; Informatika Fakultatea


Archive | 2005

Final Report of the 2005 Language Engineering Workshop on Statistical Machine Translation by Parsing

Andrea Burbank; Marine Carpuat; Stephen Clark; Markus Dreyer; Pamela Fox; Declan Groves; Mary Hearne; I. Dan Melamed; Yihai Shen; Ben Wellington; Dekai Wu

2 nd best performance overall according to the official results of the quality estimation task (QET) challenge when also looking at the translation outputs. Our representation and features achieve the top performance in QET among the models using the SVR learning model.


Armstrong, Stephen and Flanagan, Marian and Graham, Yvette and Groves, Declan and Mellebeek, Bart and Morrissey, Sara and Stroppa, Nicolas and Way, Andy (2006) MaTrEx: machine translation using examples. In: TC-STAR OpenLab on Speech Translation 2006, 30 March - 1 April 2006, Trento, Italy. | 2006

MaTrEx: machine translation using examples

Stephen Armstrong; Marian Flanagan; Yvette Graham; Declan Groves; Bart Mellebeek; Sara Morrissey; Nicolas Stroppa; Andy Way

For the CLEF 2004 ImageCLEF St Andrews Collection task the Dublin City University group carried out three sets of experiments: standard cross-language information retrieval (CLIR) runs using topic translation via machine translation (MT), combination of this run with image matching results from the GIFT/Viper system, and a novel document rescoring approach based on automatic MT evaluation metrics. Our standard MT-based CLIR works well on this task. Encouragingly combination with image matching lists is also observed to produce small positive changes in the retrieval output. However, rescoring using the MT evaluation metrics in their current form significantly reduced retrieval effectiveness.

Collaboration


Dive into the Declan Groves's collaboration.

Top Co-Authors

Avatar

Andy Way

Dublin City University

View shared research outputs
Top Co-Authors

Avatar
Top Co-Authors

Avatar
Top Co-Authors

Avatar
Top Co-Authors

Avatar
Top Co-Authors

Avatar
Top Co-Authors

Avatar
Top Co-Authors

Avatar

Mary Hearne

Dublin City University

View shared research outputs
Top Co-Authors

Avatar
Top Co-Authors

Avatar

Stephen Doherty

University of New South Wales

View shared research outputs
Researchain Logo
Decentralizing Knowledge