Network


Latest external collaboration on country level. Dive into details by clicking on the dots.

Hotspot


Dive into the research topics where Sara Silveira is active.

Publication


Featured researches published by Sara Silveira.


international conference on tools with artificial intelligence | 2013

TM-Gen: A Topic Map Generator from Text Documents

Angel Luis Garrido; María Granados Buey; Sandra Escudero; Sergio Ilarri; Eduardo Mena; Sara Silveira

The vast amount of text documents stored in digital format is growing at a frantic rhythm each day. Therefore, tools able to find accurate information searching in natural language information repositories are gaining great interest in recent years. In this context, there are especially interesting tools capable of dealing with large amounts of text information and deriving human-readable summaries. However, one step further is to be able not only to summarize, but to extract the knowledge stored in those texts, and even represent it graphically. In this paper we present an architecture to generate automatically a conceptual representation of knowledge stored in a set of text-based documents. For this purpose we have used the topic maps standard and we have developed a method that combines text mining, statistics, linguistic tools, and semantics to obtain a graphical representation of the information contained therein, which can be coded using a knowledge representation language such as RDF or OWL. The procedure is language-independent, fully automatic, self-adjusting, and it does not need manual configuration by the user. Although the validation of a graphic knowledge representation system is very subjective, we have been able to take advantage of an intermediate product of the process to make a experimental validation of our proposals.


applications of natural language to data bases | 2012

Extracting multi-document summaries with a double clustering approach

Sara Silveira; António Branco

This paper presents a method for extractive multi-document summarization that explores a two-phase clustering approach. First, sentences are clustered by similarity, and one sentence per cluster is selected, to reduce redundancy. Then, in order to group them according to topics, those sentences are clustered considering the collection of keywords. Additionally, the summarization process further includes a sentence simplification step, which aims not only to create simpler and more incisive sentences, but also to make room for the inclusion of relevant content in the summary as much as possible.


information reuse and integration | 2012

Combining a double clustering approach with sentence simplification to produce highly informative multi-document summaries

Sara Silveira; António Branco

This paper presents a method for extractive multi-document summarization that explores a two-phase clustering approach that, combined with a sentence simplification procedure, aims to generate more useful summaries. First, sentences are clustered by similarity, and one sentence per cluster is selected, to reduce redundancy. Then, in order to group them according to topics, those sentences are clustered considering the collection of keywords. Finally, the summarization process includes a sentence simplification step, which aims not only to create simpler and more incisive sentences, but also to make room for the inclusion of further relevant content in the summary. Evaluation reveals that the approach pursued produces highly informative summaries, containing relevant data and no repeated information.


text speech and dialogue | 2012

Using a Double Clustering Approach to Build Extractive Multi-document Summaries

Sara Silveira; António Branco

This paper presents a method for extractive multi-document summarization that explores a two-phase clustering approach. First, sentences are clustered by similarity, and one sentence per cluster is selected, to reduce redundancy. Then, in order to group them according to topics, those sentences are clustered considering the collection of keywords that represent the topics in the set of texts. Evaluation reveals that the approach pursued produces highly informative summaries, containing many relevant data and no repeated information.


meeting of the association for computational linguistics | 2009

LX-Center: a center of online linguistic services

António Branco; Francisco Costa; Eduardo Ferreira; Pedro Martins; Filipe Nunes; João Ricardo Silva; Sara Silveira

This is a paper supporting the demonstration of the LX-Center at ACL-IJCNLP-09. LX-Center is a web center of online linguistic services aimed at both demonstrating a range of language technology tools and at fostering the education, research and development in natural language science and technology.


processing of the portuguese language | 2008

XisQuê: An Online QA Service for Portuguese

António Branco; Lino Rodrigues; João Ricardo Silva; Sara Silveira

This paper describes XisQue ( http://xisque.di.fc.ul.pt ) an online service for real-time, open-domain question answering (QA) on the Portuguese Web.


ibero american conference on ai | 2008

Real-Time Open-Domain QA on the Portuguese Web

António Branco; Lino Rodrigues; João Ricardo Silva; Sara Silveira

This paper presents a system for real-time, open-domain question answering on the Web of documents written in Portuguese, prepared to handle factual questions and available as a freely accessible online service. In order to deliver candidate answers to input questions phrased in Portuguese, this system resorts to a number of shallow processing tools and question answering techniques that are specifically geared to cope with the Portuguese language.


international conference natural language processing | 2014

Uncovering Discourse Relations to Insert Connectives between the Sentences of an Automatic Summary

Sara Silveira; António Branco

This paper presents a machine learning approach to find and classify discourse relations between two unseen sentences. It describes the process of training a classifier that aims to determine (i) if there is any discourse relation among two sentences, and, if a relation is found, (ii) which is that relation. The final goal of this task is to insert discourse connectives between sentences seeking to enhance text cohesion of a summary produced by an extractive summarization system for the Portuguese language.


international conference on agents and artificial intelligence | 2013

Sentence Reduction Algorithms to Improve Multi-document Summarization

Sara Silveira; António Branco

Multi-document summarization aims to create a single summary based on the information conveyed by a collection of texts. After the candidate sentences have been identified and ordered, it is time to select which will be included in the summary. In this paper, we describe an approach that uses sentence reduction, both lexical and syntactic, to help improve the compression step in the summarization process. Three different algorithms are proposed and discussed. Sentence reduction is performed by removing specific sentential constructions conveying information that can be considered to be less relevant to the general message of the summary. Thus, the rationale is that sentence reduction not only removes expendable information, but also makes room for further relevant data in a summary.


language resources and evaluation | 2010

Developing a Deep Linguistic Databank Supporting a Collection of Treebanks: the CINTIL DeepGramBank.

António Branco; Francisco Costa; João Ricardo Silva; Sara Silveira; Sérgio Castro; Mariana Avelãs; Clara Pinto; João Graça

Collaboration


Dive into the Sara Silveira's collaboration.

Top Co-Authors

Avatar
Top Co-Authors

Avatar
Top Co-Authors

Avatar
Top Co-Authors

Avatar
Top Co-Authors

Avatar
Top Co-Authors

Avatar
Top Co-Authors

Avatar
Top Co-Authors

Avatar
Top Co-Authors

Avatar
Top Co-Authors

Avatar
Researchain Logo
Decentralizing Knowledge