Alessandro Calefati
University of Insubria
Network
Latest external collaboration on country level. Dive into details by clicking on the dots.
Publication
Featured researches published by Alessandro Calefati.
international conference on web information systems and technologies | 2016
Lucia Noce; Ignazio Gallo; Alessandro Zamberletti; Alessandro Calefati
In this paper we propose a query suggestion method for price comparison search engines. Query suggestion techniques are used for generating alternative queries to facilitate web users in information seeking; in this specific domain, suggestions provided to web users need to be properly generated taking into account that the suggested products must be still available for sale. We propose a novel approach based on a slightly variant of classical query-URL graphs: the query-product click-through bipartite graph. Information extracted both from search engine logs and specific domain features are exploited to build the graph, and one of the advantages of this model is that such a graph can be used to suggest not only related queries but also related products. Concepts used in the proposed method are not restricted to our context but are used in many other major e-commerce and search engine websites, we tested the model on several challenging datasets, and also compared with a recent query suggestion approach specifically designed for price comparison engines. Our solution outperforms the competing approach, achieving higher results in terms of relevance of the provided suggestions and coverage rates on top-8 suggestions.
document engineering | 2016
Alessandro Calefati; Ignazio Gallo; Alessandro Zamberletti; Lucia Noce
The rise of online shopping has hurt physical retailers, which struggle to persuade customers to buy products in physical stores rather than online. Marketing flyers are a great mean to increase the visibility of physical retailers, but the unstructured offers appearing in those documents cannot be easily compared with similar online deals, making it hard for a customer to understand whether it is more convenient to order a product online or to buy it from the physical shop. In this work we tackle this problem, introducing a content extraction algorithm that automatically extracts structured data from flyers. Unlike competing approaches that mainly focus on textual content or simply analyze font type, color and text positioning, we propose a new approach that uses Convolutional Neural Networks to classify words extracted from flyers typically used in marketing materials to attract the attention of readers towards specific deals. We obtained good results and a high language and genre independence.
document engineering | 2016
Lucia Noce; Ignazio Gallo; Alessandro Zamberletti; Alessandro Calefati
In this paper we introduce a novel document image classification method based on combined visual and textual information. The proposed algorithms pipeline is inspired to the ones of other recent state-of-the-art methods which perform document image classification using Convolutional Neural Networks. The main addition of our work is the introduction of a preprocessing step embedding additional textual information into the processed document images. To do so we combine Optical Character Recognition and Natural Language Processing algorithms to extract and manipulate relevant text concepts from document images. Such textual information is then visually embedded within each document image to improve the classification results of a Convolutional Neural Network. Our experiments prove that the overall document classification accuracy of a Convolutional Neural Network trained using these text-augmented document images is considerably higher than the one achieved by a similar model trained solely on classic document images, especially when different classes of documents share similar visual characteristics.
digital image computing techniques and applications | 2016
Ignazio Gallo; Lucia Noce; Alessandro Zamberletti; Alessandro Calefati
In this manuscript we propose a novel method for jointly page stream segmentation and multi-page document classification.The end goal is to classify a stream of pages as belonging to different classes of documents. We take advantage of the recent state-of-the-art results achieved using deep architectures in related fields such as document image classification, and we adopt similar models to obtain satisfying classification accuracies and a low computational complexity. Our contribution is twofold: first, the extraction of visual features from the processed documents is automatically performed by the chosen Convolutional Neural Network; second, the predictions of the same network are further refined using an additional deep model which processes them in a classic sliding-window manner to help finding and solving classification errors committed by the first network. The proposed pipeline has been evaluated on a publicly available dataset composed of more than half a million multi-page documents collected by an on-line loan comparison company, showing excellent results and high efficiency.
international conference on document analysis and recognition | 2017
Ignazio Gallo; Shah Nawaz; Alessandro Calefati
document analysis systems | 2018
Shah Nawaz; Alessandro Calefati; Nisar Ahmed; Ignazio Gallo
british machine vision conference | 2018
Alessandro Calefati; Muhammad Kamran Janjua; Shah Nawaz; Ignazio Gallo
arXiv: Computer Vision and Pattern Recognition | 2018
Shah Nawaz; Alessandro Calefati; Muhammad Kamran Janjua; Ignazio Gallo
arXiv: Computer Vision and Pattern Recognition | 2018
Ignazio Gallo; Alessandro Calefati; Shah Nawaz; Muhammad Kamran Janjua
arXiv: Computer Vision and Pattern Recognition | 2018
Muhammad Kamran Janjua; Shah Nawaz; Alessandro Calefati; Ignazio Gallo