Donatella Firmani
Roma Tre University
Network
Latest external collaboration on country level. Dive into details by clicking on the dots.
Publication
Featured researches published by Donatella Firmani.
knowledge discovery and data mining | 2017
Alessio Conte; Donatella Firmani; Caterina Mordente; Maurizio Patrignani; Riccardo Torlone
K-plexes are a formal yet flexible way of defining communities in networks. They generalize the notion of cliques and are more appropriate in most real cases: while a node of a clique C is connected to all other nodes of C, a node of a k-plex may miss up to k connections. Unfortunately, computing all maximal k-plexes is a gruesome task and state-of-the-art algorithms can only process small-size networks. In this paper we propose a new approach for enumerating large k-plexes in networks that speeds up the search by several orders of magnitude, leveraging on (i) methods for strongly reducing the search space and (ii) efficient techniques for the computation of maximal cliques. Several experiments show that our strategy is effective and is able to increase the size of the networks for which the computation of large k-plexes is feasible from a few hundred to several hundred thousand nodes.
knowledge discovery and data mining | 2018
Donatella Firmani; Marco Maiorino; Paolo Merialdo; Elena Nieddu
In Codice Ratio is a research project to study tools and techniques for analyzing the contents of historical documents conserved in the Vatican Secret Archives (VSA). In this paper, we present our efforts to develop a system to support the transcription of medieval manuscripts. The goal is to provide paleographers with a tool to reduce their efforts in transcribing large volumes, as those stored in the VSA, producing good transcriptions for significant portions of the manuscripts. We propose an original approach based on character segmentation. Our solution is able to deal with the dirty segmentation that inevitably occurs in handwritten documents. We use a convolutional neural network to recognize characters, and statistical language models to compose word transcriptions. Our approach requires minimal training effort, making the transcription process more scalable, as the production of training sets requires a few pages and can be easily crowdsourced. We have conducted experiments on manuscripts from the Vatican Registers, an unreleased corpus containing the correspondence of the popes. With training data produced by 120 high school students, our system has been able to produce good transcriptions that can be used by paleographers as a solid basis to speedup the transcription process at a large scale.
international conference on management of data | 2018
Sainyam Galhotra; Donatella Firmani; Barna Saha; Divesh Srivastava
Entity resolution (ER) seeks to identify which records in a data set refer to the same real-world entity. Given the diversity of ways in which entities can be represented, matched and distinguished, ER is known to be a challenging task for automated strategies, but relatively easier for expert humans. In our work, we abstract the knowledge of experts with the notion of a binary oracle. Our oracle can answer questions of the form do records u and v refer to the same entity? under a flexible error model, allowing for some questions to be more difficult to answer correctly than others. Our contribution is a general error correction tool that can be leveraged by a variety of hybrid-human machine ER algorithms, based on a formal way for selecting indirect control queries. In our experiments we demonstrate that correction-less ER algorithms equipped with our tool can perform even better than recent ER algorithms specifically designed for correcting errors. Our control queries are selected among those that provide strongest connectivity between records of each cluster, based on the concept ofgraph expanders (which are sparse graphs with formal connectivity properties). We give formal performance guarantees for our toolkit and provide experiments on real and synthetic data.
AI*CH@AI*IA | 2017
Donatella Firmani; Paolo Merialdo; Elena Nieddu; Simone Scardapane
SEBD | 2017
Serena Ammirati; Donatella Firmani; Marco Maiorino; Paolo Merialdo; Elena Nieddu; Andrea Rossi
knowledge discovery and data mining | 2018
Donatella Firmani; Marco Maiorino; Paolo Merialdo; Elena Nieddu
SEBD | 2018
Alessio Conte; Donatella Firmani; Caterina Mordente; Maurizio Patrignani; Riccardo Torlone
SEBD | 2018
Letizia Tanca; Paolo Atzeni; Davide Azzalini; Ilaria Bartolini; Luca Cabibbo; Luca Calderoni; Paolo Ciaccia; Valter Crescenzi; Juan Carlos De Martin; Selina Fenoglietto; Donatella Firmani; Sergio Greco; Francesco Isgrò; Dario Maio; Davide Martinenghi; Maristella Matera; Paolo Merialdo; Cristian Molinaro; Marco Patella; Roberto Prevete; Elisa Quintarelli; Antonio Santangelo; Andrea Tagarelli; Guglielmo Tamburrini; Riccardo Torlone
IEEE Data(base) Engineering Bulletin | 2018
Donatella Firmani; Sainyam Galhotra; Barna Saha; Divesh Srivastava
26th Italian Symposium on Advanced Database Systems, SEBD 2018 | 2018
Letizia Tanca; Paolo Atzeni; Azzalini Davide; Bartolini Ilaria; Luca Cabibbo; Calderoni Luca; Ciaccia Paolo; Valter Crescenzi; Juan Carlos De Martin; Fenoglietto Selina; Donatella Firmani; Sergio Greco; Isgrò Francesco; Maio Dario; Davide Martinenghi; Matera Maristella; Paolo Merialdo; Molinaro Cristian; Patella Marco; Prevete Roberto; Quintarelli Elisa; Santangelo Antonio; Tagarelli Andrea; Tamburrini Guglielmo; Riccardo Torlone