Federico Sangati
University of Amsterdam
Network
Latest external collaboration on country level. Dive into details by clicking on the dots.
Publication
Featured researches published by Federico Sangati.
meeting of the association for computational linguistics | 2009
Federico Sangati; Willem H. Zuidema
We present several algorithms for assigning heads in phrase structure trees, based on different linguistic intuitions on the role of heads in natural language syntax. Starting point of our approach is the observation that a head-annotated treebank defines a unique lexicalized tree substitution grammar. This allows us to go back and forth between the two representations, and define objective functions for the unsupervised learning of head assignments in terms of features of the implicit lexicalized tree grammars. We evaluate algorithms based on the match with gold standard head-annotations, and the comparative parsing accuracy of the lexicalized grammars they give rise to. On the first task, we approach the accuracy of hand-designed heuristics for English and inter-annotation-standard agreement for German. On the second task, the implied lexicalized grammars score 4% points higher on parsing accuracy than lexicalized grammars derived by commonly used heuristics.
international workshop conference on parsing technologies | 2009
Federico Sangati; Willem H. Zuidema; Rens Bod
We propose a framework for dependency parsing based on a combination of discriminative and generative models. We use a discriminative model to obtain a k-best list of candidate parses, and subsequently rerank those candidates using a generative model. We show how this approach allows us to evaluate a variety of generative models, without needing different parser implementations. Moreover, we present empirical results that show a small improvement over state-of-the-art dependency parsing of English sentences.
north american chapter of the association for computational linguistics | 2015
Federico Sangati; Andreas van Cranenburgh
We present a novel approach for the identification of multiword expressions (MWEs). The methodology extracts a large set of recurring syntactic fragments from a given treebank using a Tree-Kernel method. Di erently from previous studies, the expressions underlying these fragments are arbitrarily long and can include intervening gaps. In the initial study we use these fragments to identify MWEs as a parsing task (in a supervised manner) as proposed by Green et al. (2011). Here we obtain a small improvement over previous results. In the second part, we compare various association measures in reranking the expressions underlying these fragments in an unsupervised fashion. We show how a newly defined measure (Log Inside Ratio) based on statistical parsing techniques is able to outperform classical association measures in the French data.
empirical methods in natural language processing | 2011
Federico Sangati; Willem H. Zuidema
language resources and evaluation | 2010
Federico Sangati; Willem H. Zuidema; Rens Bod
Proceedings of the Second Workshop on Statistical Parsing of Morphologically Rich Languages | 2011
Andreas van Cranenburgh; Remko Scha; Federico Sangati
Studies in Mycology | 2009
Federico Sangati; Chiara Mazza
Nederlands Tijdschrift Voor Tandheelkunde | 2010
Federico Sangati; Willem H. Zuidema; Rens Bod
language resources and evaluation | 2016
Gyri Smørdal Losnegaard; Federico Sangati; Carla Parra Escartín; Agata Savary; Sascha Bargmann; Johanna Monti
Proceedings of the 13th Workshop on Multiword Expressions (MWE 2017) | 2017
Agata Savary; Carlos Ramisch; Silvio Cordeiro; Federico Sangati; Veronika Vincze; Behrang QasemiZadeh; Marie Candito; Fabienne Cap; Voula Giouli; Ivelina Stoyanova; Antoine Doucet