Network


Latest external collaboration on country level. Dive into details by clicking on the dots.

Hotspot


Dive into the research topics where Giuseppe Serra is active.

Publication


Featured researches published by Giuseppe Serra.


IEEE Transactions on Information Forensics and Security | 2011

A SIFT-Based Forensic Method for Copy–Move Attack Detection and Transformation Recovery

Irene Amerini; Lamberto Ballan; Roberto Caldelli; A. Del Bimbo; Giuseppe Serra

One of the principal problems in image forensics is determining if a particular image is authentic or not. This can be a crucial task when images are used as basic evidence to influence judgment like, for example, in a court of law. To carry out such forensic analysis, various technological instruments have been developed in the literature. In this paper, the problem of detecting if an image has been forged is investigated; in particular, attention has been paid to the case in which an area of an image is copied and then pasted onto another zone to create a duplication or to cancel something that was awkward. Generally, to adapt the image patch to the new context a geometric transformation is needed. To detect such modifications, a novel methodology based on scale invariant features transform (SIFT) is proposed. Such a method allows us to both understand if a copy-move attack has occurred and, furthermore, to recover the geometric transformation used to perform cloning. Extensive experimental results are presented to confirm that the technique is able to precisely individuate the altered area and, in addition, to estimate the geometric transformation parameters with high reliability. The method also deals with multiple cloning.


Multimedia Tools and Applications | 2011

Event detection and recognition for semantic annotation of video

Lamberto Ballan; Marco Bertini; Alberto Del Bimbo; Lorenzo Seidenari; Giuseppe Serra

Research on methods for detection and recognition of events and actions in videos is receiving an increasing attention from the scientific community, because of its relevance for many applications, from semantic video indexing to intelligent video surveillance systems and advanced human-computer interaction interfaces. Event detection and recognition requires to consider the temporal aspect of video, either at the low-level with appropriate features, or at a higher-level with models and classifiers than can represent time. In this paper we survey the field of event recognition, from interest point detectors and descriptors, to event modelling techniques and knowledge management technologies. We provide an overview of the methods, categorising them according to video production methods and video domains, and according to types of events and actions that are typical of these domains.


Signal Processing-image Communication | 2013

Copy-move forgery detection and localization by means of robust clustering with J-Linkage

Irene Amerini; Lamberto Ballan; Roberto Caldelli; Alberto Del Bimbo; Luca Del Tongo; Giuseppe Serra

Understanding if a digital image is authentic or not, is a key purpose of image forensics. There are several different tampering attacks but, surely, one of the most common and immediate one is copy-move. A recent and effective approach for detecting copy-move forgeries is to use local visual features such as SIFT. In this kind of methods, SIFT matching is often followed by a clustering procedure to group keypoints that are spatially close. Often, this procedure could be unsatisfactory, in particular in those cases in which the copied patch contains pixels that are spatially very distant among them, and when the pasted area is near to the original source. In such cases, a better estimation of the cloned area is necessary in order to obtain an accurate forgery localization. In this paper a novel approach is presented for copy-move forgery detection and localization based on the JLinkage algorithm, which performs a robust clustering in the space of the geometric transformation. Experimental results, carried out on different datasets, show that the proposed method outperforms other similar state-of-the-art techniques both in terms of copy-move forgery detection reliability and of precision in the manipulated patch localization.


IEEE MultiMedia | 2010

Video Annotation and Retrieval Using Ontologies and Rule Learning

Lamberto Ballan; Marco Bertini; Alberto Del Bimbo; Giuseppe Serra

An approach for automatic annotation and retrieval of video content uses semantic concept classifiers and ontologies to permit expanded queries to synonyms and concept specializations.


computer vision and pattern recognition | 2014

Gesture Recognition in Ego-centric Videos Using Dense Trajectories and Hand Segmentation

Lorenzo Baraldi; Francesco Paci; Giuseppe Serra; Luca Benini; Rita Cucchiara

We present a novel method for monocular hand gesture recognition in ego-vision scenarios that deals with static and dynamic gestures and can achieve high accuracy results using a few positive samples. Specifically, we use and extend the dense trajectories approach that has been successfully introduced for action recognition. Dense features are extracted around regions selected by a new hand segmentation technique that integrates superpixel classification, temporal and spatial coherence. We extensively testour gesture recognition and segmentation algorithms on public datasets and propose a new dataset shot with a wearable camera. In addition, we demonstrate that our solution can work in near real-time on a wearable device.


IEEE Transactions on Image Processing | 2013

Context-Dependent Logo Matching and Recognition

Hichem Sahbi; Lamberto Ballan; Giuseppe Serra; A. Del Bimbo

We contribute, through this paper, to the design of a novel variational framework able to match and recognize multiple instances of multiple reference logos in image archives. Reference logos and test images are seen as constellations of local features (interest points, regions, etc.) and matched by minimizing an energy function mixing: 1) a fidelity term that measures the quality of feature matching, 2) a neighborhood criterion that captures feature co-occurrence/geometry, and 3) a regularization term that controls the smoothness of the matching solution. We also introduce a detection/recognition procedure and study its theoretical consistency. Finally, we show the validity of our method through extensive experiments on the challenging MICC-Logos dataset. Our method overtakes, by 20%, baseline as well as state-of-the-art matching/recognition procedures.


international conference on acoustics, speech, and signal processing | 2010

Geometric tampering estimation by means of a SIFT-based forensic analysis

Irene Amerini; Lamberto Ballan; Roberto Caldelli; Alberto Del Bimbo; Giuseppe Serra

In many application scenarios digital images play a basic role and often it is important to assess if their content is realistic or has been manipulated to mislead watchers opinion. Image forensics tools provide answers to similar questions. This paper, in particular, focuses on the problem of detecting if a feigned image has been created by cloning an area of the image onto another zone to make a duplication or to cancel something awkward. The proposed method is based on SIFT features and allows both to understand which are the image points involved in the counterfeit attack and, furthermore, to recover the parameters of the geometric transformation. Experimental results are provided to witness the powerfulness of the proposed technique.


international conference on pattern recognition | 2016

A deep multi-level network for saliency prediction

Marcella Cornia; Lorenzo Baraldi; Giuseppe Serra; Rita Cucchiara

This paper presents a novel deep architecture for saliency prediction. Current state of the art models for saliency prediction employ Fully Convolutional networks that perform a non-linear combination of features extracted from the last convolutional layer to predict saliency maps. We propose an architecture which, instead, combines features extracted at different levels of a Convolutional Neural Network (CNN). Our model is composed of three main blocks: a feature extraction CNN, a feature encoding network, that weights low and high level feature maps, and a prior learning network. We compare our solution with state of the art saliency models on two public benchmarks datasets. Results show that our model outperforms under all evaluation metrics on the SALICON dataset, which is currently the largest public dataset for saliency prediction, and achieves competitive results on the MIT300 benchmark. Code is available at https://github.com/marcellacornia/mlnet.


acm multimedia | 2013

Hand segmentation for gesture recognition in EGO-vision

Giuseppe Serra; Marco Camurri; Lorenzo Baraldi; Michela Benedetti; Rita Cucchiara

Portable devices for first-person camera views will play a central role in future interactive systems. One necessary step for feasible human-computer guided activities is gesture recognition, preceded by a reliable hand segmentation from egocentric vision. In this work we provide a novel hand segmentation algorithm based on Random Forest superpixel classification that integrates light, time and space consistency. We also propose a gesture recognition method based Exemplar SVMs since it requires a only small set of positive sampels, hence it is well suitable for the egocentric video applications. Furthermore, this method is enhanced by using segmented images instead of full frames during test phase. Experimental results show that our hand segmentation algorithm outperforms the state-of-the-art approaches and improves the gesture recognition accuracy on both the publicly available EDSH dataset and our dataset designed for cultural heritage applications.


Multimedia Tools and Applications | 2010

Video event classification using string kernels

Lamberto Ballan; Marco Bertini; Alberto Del Bimbo; Giuseppe Serra

Event recognition is a crucial task to provide high-level semantic description of the video content. The bag-of-words (BoW) approach has proven to be successful for the categorization of objects and scenes in images, but it is unable to model temporal information between consecutive frames. In this paper we present a method to introduce temporal information for video event recognition within the BoW approach. Events are modeled as a sequence composed of histograms of visual features, computed from each frame using the traditional BoW. The sequences are treated as strings (phrases) where each histogram is considered as a character. Event classification of these sequences of variable length, depending on the duration of the video clips, are performed using SVM classifiers with a string kernel that uses the Needlemann-Wunsch edit distance. Experimental results, performed on two domains, soccer videos and a subset of TRECVID 2005 news videos, demonstrate the validity of the proposed approach.

Collaboration


Dive into the Giuseppe Serra's collaboration.

Top Co-Authors

Avatar

Rita Cucchiara

University of Modena and Reggio Emilia

View shared research outputs
Top Co-Authors

Avatar
Top Co-Authors

Avatar
Top Co-Authors

Avatar
Top Co-Authors

Avatar

Stefano Alletto

University of Modena and Reggio Emilia

View shared research outputs
Top Co-Authors

Avatar

Costantino Grana

University of Modena and Reggio Emilia

View shared research outputs
Top Co-Authors

Avatar

Lorenzo Baraldi

University of Modena and Reggio Emilia

View shared research outputs
Top Co-Authors

Avatar
Top Co-Authors

Avatar

Marcella Cornia

University of Modena and Reggio Emilia

View shared research outputs
Top Co-Authors

Avatar

Marco Manfredi

University of Modena and Reggio Emilia

View shared research outputs
Researchain Logo
Decentralizing Knowledge