Panagiotis Sidiropoulos
Aristotle University of Thessaloniki
Network
Latest external collaboration on country level. Dive into details by clicking on the dots.
Publication
Featured researches published by Panagiotis Sidiropoulos.
IEEE Transactions on Circuits and Systems for Video Technology | 2011
Panagiotis Sidiropoulos; Vasileios Mezaris; Ioannis Kompatsiaris; Hugo Meinedo; Miguel Bugalho; Isabel Trancoso
In this paper, a novel approach to video temporal decomposition into semantic units, termed scenes, is presented. In contrast to previous temporal segmentation approaches that employ mostly low-level visual or audiovisual features, we introduce a technique that jointly exploits low-level and high-level features automatically extracted from the visual and the auditory channel. This technique is built upon the well-known method of the scene transition graph (STG), first by introducing a new STG approximation that features reduced computational cost, and then by extending the unimodal STG-based temporal segmentation technique to a method for multimodal scene segmentation. The latter exploits, among others, the results of a large number of TRECVID-type trained visual concept detectors and audio event detectors, and is based on a probabilistic merging process that combines multiple individual STGs while at the same time diminishing the need for selecting and fine-tuning several STG construction parameters. The proposed approach is evaluated on three test datasets, comprising TRECVID documentary films, movies, and news-related videos, respectively. The experimental results demonstrate the improved performance of the proposed approach in comparison to other unimodal and multimodal techniques of the relevant literature and highlight the contribution of high-level audiovisual features toward improved video segmentation to scenes.
Pattern Recognition | 2011
Panagiotis Sidiropoulos; Stefanos Vrochidis; Ioannis Kompatsiaris
This paper proposes a method for binary image retrieval, where the black-and-white image is represented by a novel feature named the adaptive hierarchical density histogram, which exploits the distribution of the image points on a two-dimensional area. This adaptive hierarchical decomposition technique employs the estimation of point density histograms of image regions, which are determined by a pyramidal grid that is recursively updated through the calculation of image geometric centroids. The extracted descriptor combines global and local properties and can be used in variant types of binary image databases. The validity of the introduced method, which demonstrates high accuracy, low computational cost and scalability, is both theoretically and experimentally shown, while comparison with several other prevailing approaches demonstrates its performance.
workshop on image analysis for multimedia interactive services | 2010
Panagiotis Sidiropoulos; Vasileios Mezaris; Ioannis Kompatsiaris; Hugo Meinedo; Miguel Bugalho; Isabel Trancoso
This work deals with the problem of automatic temporal segmentation of a video into elementary semantic units known as scenes. Its novelty lies in the use of high-level audio information in the form of audio events for the improvement of scene segmentation performance. More specifically, the proposed technique is built upon a recently proposed audio-visual scene segmentation approach that involves the construction of multiple scene transition graphs (STGs) that separately exploit information coming from different modalities. In the extension of the latter approach presented in this work, audio event detection results are introduced to the definition of an audio-based scene transition graph, while a visual-based scene transition graph is also defined independently. The results of these two types of STGs are subsequently combined. The application of the proposed technique to broadcast videos demonstrates the usefulness of audio events for scene segmentation.
acm multimedia | 2009
Panagiotis Sidiropoulos; Vasileios Mezaris; Ioannis Kompatsiaris; Hugo Meinedo; Isabel Trancoso
In this work the problem of automatic decomposition of video into elementary semantic units, known in the literature as scenes, is addressed. Two multi-modal automatic scene segmentation techniques are proposed, both building upon the Scene Transition Graph (STG). In the first of the proposed approaches, speaker diarization results are used for introducing a post-processing step to the STG construction algorithm, with the objective of discarding scene boundaries erroneously identified according to visual-only dissimilarity. In the second approach, speaker diarization and additional audio analysis results are employed and a separate audio-based STG is constructed, in parallel to the original STG based on visual information. The two STGs are subsequently combined. Preliminary results from the application of the proposed techniques to broadcast videos reveal their improved performance over previous approaches.
international conference on image processing | 2013
Panagiotis Sidiropoulos; Vasileios Mezaris; Ioannis Kompatsiaris
In this work we deal with the problem of video concept detection, for the purpose of using the detection results towards more effective concept-based video retrieval. In order to handle this task, we propose using spatio-temporal video slices, called video tomographs, in the same way that visual keyframes are typically used in traditional keyframe-based video concept detection schemes. Video tomographs capture in a compact way motion patterns that are present in the video, and are used in this work for training a number of base detectors. The latter augment the set of keyframe-based base detectors that can be trained on different image representations. Combining the keyframe-based and tomograph-based detectors, improved concept detection accuracy can be achieved. The proposed approach is evaluated on a dataset that is extensive both in terms of video duration and concept variation. The experimental results manifest the merit of the proposed approach.
international conference on multimedia and expo | 2006
Spyros Nikolopoulos; Stefanos Zafeiriou; Panagiotis Sidiropoulos; Nikos Nikolaidis; Ioannis Pitas
In this paper a novel system for image replica detection is presented. The system uses color-based descriptors in order to extract robust features for image representation. These features are used for indexing the images in a database using an R-tree. When a query about whether a test image is a replica of an image in the database is submitted, the R-tree is traversed and a set of candidate images is retrieved. Then, in order to obtain a single result and at the same time reduce the number of decision errors the system is enhanced with linear discriminant analysis (LDA). The conducted experiments show that the proposed approach is very promising
ieee international conference semantic computing | 2011
Vasileios Mezaris; Panagiotis Sidiropoulos; Ioannis Kompatsiaris
In this work the contribution of automatically-extracted (thus, imperfect) video structural semantics towards improving interactive video retrieval is examined. First, the automatic extraction of video structural semantics, i.e. the decomposition of the video into scenes that correspond to the different sub-stories or high-level events, is performed. Then, these are introduced to the interactive video retrieval paradigm. Finally, their potential contribution is experimentally evaluated. To this end, different members of a family of scene segmentation algorithms are applied to an extensive professional video collection coming from the TRECVID benchmarking activity, subsequently, a large number of user interactions with a retrieval system that exploits these structural semantics is simulated. The experimental results document the contribution of state-of-the-art automatically-extracted video structural semantics to the efficient and effective interactive video retrieval.
content based multimedia indexing | 2010
Panagiotis Sidiropoulos; Stefanos Vrochidis; Ioannis Kompatsiaris
This paper proposes a novel binary image descriptor, namely the Adaptive Hierarchical Density Histogram, that can be utilized for complex binary image retrieval. This novel descriptor exploits the distribution of the image points on a two-dimensional area. To reflect effectively this distribution, we propose an adaptive pyramidal decomposition of the image into non-overlapping rectangular regions and the extraction of the density histogram of each region. This hierarchical decomposition algorithm is based on the recursive calculation of geometric centroids. The presented technique is experimentally shown to combine efficient performance, low computational cost and scalability. Comparison with other prevailing approaches demonstrates its high potential.
international workshop on machine learning for signal processing | 2012
Panagiotis Sidiropoulos; Vasileios Mezaris; Ioannis Kompatsiaris
In this work the problem of how to evaluate video scene segmentation results is examined. The evaluation, which is typically conducted by comparison of the experimental output of scene segmentation algorithms with a ground-truth temporal decomposition, often suffers from ambiguity in the definition of the ground truth. To alleviate this drawback the use of a string comparison measure, called differential edit distance (DED), is proposed. After defining video scene segmentation evaluation as a string comparison problem, the proposed measure is applied to limit the effect of scene segmentation ambiguity in the performance estimation uncertainty. The experimental results, which include comparisons with state of the art evaluation measures, demonstrate the ambiguity extent and verify the validity of the conducted analysis.
World Patent Information | 2010
Stefanos Vrochidis; Symeon Papadopoulos; Anastasia Moumtzidou; Panagiotis Sidiropoulos; Emanuelle Pianta; Ioannis Kompatsiaris