Network


Latest external collaboration on country level. Dive into details by clicking on the dots.

Hotspot


Dive into the research topics where Alessandro Lenci is active.

Publication


Featured researches published by Alessandro Lenci.


Computational Linguistics | 2010

Distributional memory: A general framework for corpus-based semantics

Marco Baroni; Alessandro Lenci

Research into corpus-based semantics has focused on the development of ad hoc models that treat single tasks, or sets of closely related tasks, as unrelated challenges to be tackled by extracting different kinds of distributional information from the corpus. As an alternative to this “one task, one model” approach, the Distributional Memory framework extracts distributional information once and for all from the corpus, in the form of a set of weighted word-link-word tuples arranged into a third-order tensor. Different matrices are then generated from the tensor, and their rows and columns constitute natural spaces to deal with different semantic problems. In this way, the same distributional information can be shared across tasks such as modeling word similarity judgments, discovering synonyms, concept categorization, predicting selectional preferences of verbs, solving analogy problems, classifying relations between word pairs, harvesting qualia structures with patterns or example pairs, predicting the typical properties of concepts, and classifying verbs into alternation classes. Extensive empirical testing in all these domains shows that a Distributional Memory implementation performs competitively against task-specific algorithms recently reported in the literature for the same tasks, and against our implementations of several state-of-the-art methods. The Distributional Memory approach is thus shown to be tenable despite the constraints imposed by its multi-purpose nature.


TEXT, SPEECH AND LANGUAGE TECHNOLOGY | 2003

Building the Italian Syntactic-Semantic Treebank

Simonetta Montemagni; Francesco Barsotti; Marco Battista; Nicoletta Calzolari; Ornella Corazzari; Alessandro Lenci; Antonio Zampolli; Francesca Fanciulli; Maria Massetani; Remo Raffaelli; Roberto Basili; Maria Teresa Pazienza; Dario Saracino; Fabio Massimo Zanzotto; Nadia Mana; Fabio Pianesi; Rodolfo Delmonte

The paper reports on the design and construction of a multi-layered corpus of Italian, annotated at the syntactic and lexico-semantic levels, whose development is supported by dedicated software augmented with an intelligent interface. The issue of evaluating this type of resource is also addressed.


conference of the european chapter of the association for computational linguistics | 2014

Chasing Hypernyms in Vector Spaces with Entropy

Enrico Santus; Alessandro Lenci; Qin Lu; Sabine Schulte im Walde

In this paper, we introduce SLQS , a new entropy-based measure for the unsupervised identification of hypernymy and its directionality in Distributional Semantic Models (DSMs). SLQS is assessed through two tasks: (i.) identifying the hypernym in hyponym-hypernym pairs, and (ii.) discriminating hypernymy among various semantic relations. In both tasks, SLQS outperforms other state-of-the-art measures.


Archive | 2010

Ontology and the lexicon : a natural language processing perspective

Chu-Ren Huang; Nicoletta Calzolari; Aldo Gangemi; Alessandro Lenci; Alessandro Oltramari; Laurent Prévot

Part I. Fundamental Aspects: 1. Ontology and the lexicon: a multi-disciplinary perspective Laurent Prevot, Chu-Ren Huang, Nicoletta Calzolari, Aldo Gangemi, Alessandro Lenci and Alessandro Oltramari 2. Formal ontology as interlingua: the SUMO and WordNet linking project and GlobalWordNet Adam Pease and Christiane Fellbaum 3. Interfacing WordNet with DOLCE: towards OntoWordNet Aldo Gangemi, Nicola Guarino, Claudio Masolo and Alessandro Oltramari 4. Reasoning over natural language text by means of FrameNet and ontologies Jan Scheffczyk, Collin F. Baker and Srini Narayanan 5. Synergizing ontologies and the lexicon: a roadmap Alessandro Oltramari, Aldo Gangemi, Chu-Ren Huang, Nicoletta Calzolari, Alessandro Lenci and Laurent Prevot Part II. Discovery and Representation of Conceptual Systems: 6. Experiments of ontology construction with formal concept analysis SuJian Li, Qin Lu and Wenjie Li 7. Ontology, lexicon, and fact repository as leveraged to interpret events of change Marjorie McShane, Sergei Nirenburg and Stephen Beale 8. Hantology: conceptual system discovery based on orthographic convention Ya-Min Chou and Chu-Ren Huang 9. Whats in a schema? A formal metamodel for ECG and FrameNet Aldo Gangemi Part III. Interfacing Ontologies and Lexical Resources: 10. Interfacing ontologies and lexical resources Laurent Prevot, Stefano Borgo and Alessandro Oltramari 11. Sinica BOW (Bilingual Ontological WordNet): integration of BilingualWord-Net and SUMO Chu-Ren Huang, Ru-Yng Chang and Hsiang-bin Lee 12. Ontology-based semantic lexicons: mapping between terms and object descriptions Paul Buitelaar 13. Merging global and specialized linguistic ontologies Manuela Speranza and Bernardo Magnini Part IV. Learning and Using Ontological Knowledge: 14. The life cycle of knowledge Alessandro Lenci 15. The omega ontology Andrew Philpot, Eduard Hovy and Patrick Pantel 16. Automatic acquisition of lexico-semantic knowledge for question answering Lonneke van der Plas, Gosse Bouma and Jori Mur 17. Agricultural ontology construction and maintenance in Thai Asanee Kawtrakul and Aurawan Imsombut.The relation between ontologies and language is at the forefront of both natural language processing (NLP) and knowledge engineering. Ontologies, as widely used models in semantic technologies, have much in common with the lexicon. A lexicon organizes words as a conventional inventory of concepts, while an ontology formalizes concepts and their logical relations. A shared lexicon is the prerequisite for knowledge-sharing through language, and a shared ontology is the prerequisite for knowledge-sharing through information technology. In building models of language, computational linguists must be able to map accurately the relations between words and the concepts that they can be linked to. This book focuses on the integration of lexical resources and semantic technologies. It will be of interest to researchers and graduate students in NLP, computational linguistics and knowledge engineering, as well as in semantics, psycholinguistics, lexicology and morphology/syntax.


Proceedings of the 4th Workshop on Linked Data in Linguistics: Resources and Applications | 2015

EVALution 1.0: an Evolving Semantic Dataset for Training and Evaluation of Distributional Semantic Models

Enrico Santus; Frances Yung; Alessandro Lenci; Chu-Ren Huang

In this paper, we introduce EVALution 1.0, a dataset designed for the training and the evaluation of Distributional Semantic Models (DSMs). This version consists of almost 7.5K tuples, instantiating several semantic relations between word pairs (including hypernymy, synonymy, antonymy, meronymy). The dataset is enriched with a large amount of additional information (i.e. relation domain, word frequency, word POS, word semantic field, etc.) that can be used for either filtering the pairs or performing an in-depth analysis of the results. The tuples were extracted from a combination of ConceptNet 5.0 and WordNet 4.0, and subsequently filtered through automatic methods and crowdsourcing in order to ensure their quality. The dataset is freely downloadable1. An extension in RDF format, including also scripts for data processing, is under development.


Proceedings of the Workshop on Geometrical Models of Natural Language Semantics | 2009

One Distributional Memory, Many Semantic Spaces

Marco Baroni; Alessandro Lenci

We propose an approach to corpus-based semantics, inspired by cognitive science, in which different semantic tasks are tackled using the same underlying repository of distributional information, collected once and for all from the source corpus. Task-specific semantic spaces are then built on demand from the repository. A straightforward implementation of our proposal achieves state-of-the-art performance on a number of unrelated tasks.


Lecture Notes in Computer Science | 2004

Automatic Classification and Analysis of Provisions in Italian Legal Texts: A Case Study

Roberto Bartolini; Alessandro Lenci; Simonetta Montemagni; Vito Pirrelli; Claudia Soria

In this paper we address the problem of automatically enriching legal texts with semantic annotation, an essential pre–requisite to effective indexing and retrieval of legal documents. This is done through illustration of SALEM (Semantic Annotation for LEgal Management), a computational system developed for automated semantic annotation of (Italian) law texts. SALEM is an incremental system using Natural Language Processing techniques to perform two tasks: i) classify law paragraphs according to their regulatory content, and ii) extract relevant text fragments corresponding to specific semantic roles that are relevant for the different types of regulatory content. The paper sketches the overall architecture of SALEM and reports results of a preliminary case study on a sample of Italian law texts.


Proceedings of the 9th Web as Corpus Workshop (WaC-9) | 2014

The PAISÀ Corpus of Italian Web Texts

Verena Lyding; Egon W. Stemle; Claudia Borghetti; Marco Brunello; Sara Castagnoli; Felice Dell'Orletta; Henrik Dittmann; Alessandro Lenci; Vito Pirrelli

PAISA is a Creative Commons licensed, large web corpus of contemporary Italian. We describe the design, harvesting, and processing steps involved in its creation.


meeting of the association for computational linguistics | 2007

ISA meets Lara: An incremental word space model for cognitively plausible simulations of semantic learning

Marco Baroni; Alessandro Lenci; Luca Onnis

We introduce Incremental Semantic Analysis, a fully incremental word space model, and we test it on longitudinal child-directed speech data. On this task, ISA outperforms the related Random Indexing algorithm, as well as a SVD-based technique. In addition, the model has interesting properties that might also be characteristic of the semantic space of children.


meeting of the association for computational linguistics | 2003

RDF Instantiation of ISLE/MILE Lexical Entries

Nancy Ide; Alessandro Lenci; Nicoletta Calzolari

In this paper we describe the overall model for MILE lexical entries and provide an instantiation of the model in RDF/OWL. This work has been done with an eye toward the goal of creating a web-based registry of lexical data categories and enabling the description of lexical information by establishing relations among them, and/or using predefined objects that may reside at various locations on the web. It is also assumed that using OWL specifications to enhance specifications of the ontology of lexical objects will eventually enable the exploitation of inferencing engines to retrieve and possibly create lexical information on the fly, as suited to particular contexts. As such, the model and RDF instantiation provided here are in line with the goals of ISO TC37 SC4, and should be fully mappable to the proposed pivot.

Collaboration


Dive into the Alessandro Lenci's collaboration.

Top Co-Authors

Avatar

Chu-Ren Huang

Hong Kong Polytechnic University

View shared research outputs
Top Co-Authors

Avatar

Vito Pirrelli

National Research Council

View shared research outputs
Top Co-Authors

Avatar
Top Co-Authors

Avatar
Top Co-Authors

Avatar
Top Co-Authors

Avatar
Top Co-Authors

Avatar
Top Co-Authors

Avatar
Top Co-Authors

Avatar

Enrico Santus

Japan Advanced Institute of Science and Technology

View shared research outputs
Top Co-Authors

Avatar
Researchain Logo
Decentralizing Knowledge