Network


Latest external collaboration on country level. Dive into details by clicking on the dots.

Hotspot


Dive into the research topics where Basilis Gatos is active.

Publication


Featured researches published by Basilis Gatos.


international conference on document analysis and recognition | 2009

ICDAR 2009 Handwriting Segmentation Contest

Nikolaos Stamatopoulos; Basilis Gatos; Georgios Louloudis; Umapada Pal; Alireza Alaei

This paper presents the results of the Handwriting Segmentation Contest that was organized in the context of the ICDAR2013. The general objective of the contest was to use well established evaluation practices and procedures to record recent advances in off-line handwriting segmentation. Two benchmarking datasets, one for text line and one for word segmentation, were created in order to test and compare all submitted algorithms as well as some state-of-the-art methods for handwritten document image segmentation in realistic circumstances. Handwritten document images were produced by many writers in two Latin based languages (English and Greek) and in one Indian language (Bangla, the second most popular language in India). These images were manually annotated in order to produce the ground truth which corresponds to the correct text line and word segmentation results. The datasets of previously organized contests (ICDAR2007, ICDAR2009 and ICFHR2010 Handwriting Segmentation Contests) along with a dataset of Bangla document images were used as training dataset. Eleven methods are submitted in this competition. A brief description of the submitted algorithms, the evaluation criteria and the segmentation results obtained from the submitted methods are also provided in this manuscript.


Pattern Recognition Letters | 2014

A combined approach for the binarization of handwritten document images

Konstantinos Ntirogiannis; Basilis Gatos; Ioannis Pratikakis

There are many challenges addressed in handwritten document image binarization, such as faint characters, bleed-through and large background ink stains. Usually, binarization methods cannot deal with all the degradation types effectively. Motivated by the low detection rate of faint characters in binarization of handwritten document images, a combination of a global and a local adaptive binarization method at connected component level is proposed that aims in an improved overall performance. Initially, background estimation is applied along with image normalization based on background compensation. Afterwards, global binarization is performed on the normalized image. In the binarized image very small components are discarded and representative characteristics of a document image such as the stroke width and the contrast are computed. Furthermore, local adaptive binarization is performed on the normalized image taking into account the aforementioned characteristics. Finally, the two binarization outputs are combined at connected component level. Our method achieves top performance after extensive testing on the DIBCO (Document Image Binarization Contest) series datasets which include a variety of degraded handwritten document images.


Pattern Recognition | 2014

Distinction between handwritten and machine-printed text based on the bag of visual words model

Konstantinos Zagoris; Ioannis Pratikakis; Apostolos Antonacopoulos; Basilis Gatos; Nikos Papamarkos

In a variety of documents, ranging from forms to archive documents and books with annotations, machine printed and handwritten text may coexist in the same document image, raising significant issues within the recognition pipeline. It is, therefore, necessary to separate the two types of text so that it becomes feasible to apply different recognition methodologies to each modality. In this paper, a new approach is proposed which strives towards identifying and separating handwritten from machine printed text using the Bag of Visual Words model (BoVW). Initially, blocks of interest are detected in the document image. For each block, a descriptor is calculated based on the BoVW. The final characterization of the blocks as Handwritten, Machine Printed or Noise is made by a decision scheme which relies upon the combination of binary SVM classifiers. The promising performance of the proposed approach is shown by using a consistent evaluation methodology which couples meaningful measures along with new datasets dedicated to the problem upon consideration. HighlightsSeparating handwritten from machine printed text using the BoVW model.Exploring various local features and weighting types.A decision scheme which relies upon the combination of binary SVM classifiers.Three distinct databases are provided for future evaluations.Each database contains different machine/handwritten separation scenarios.


international conference on frontiers in handwriting recognition | 2014

ICFHR 2014 Competition on Handwritten Keyword Spotting (H-KWS 2014)

Ioannis Pratikakis; Konstantinos Zagoris; Basilis Gatos; Georgios Louloudis; Nikolaos Stamatopoulos

H-KWS 2014 is the Handwritten Keyword Spotting Competition organized in conjunction with ICFHR 2014 conference. The main objective of the competition is to record current advances in keyword spotting algorithms using established performance evaluation measures frequently encountered in the information retrieval literature. The competition comprises two distinct tracks, namely, a segmentation-based and a segmentation-free track. Five (5) distinct research groups have participated in the competition with three (3) methods for the segmentation-based track and four (4) methods for the segmentation-free track. The benchmarking datasets that were used in the contest contain both historical and modern documents from multiple writers. In this paper, the contest details are reported including the evaluation measures and the performance of the submitted methods along with a short description of each method.


Pattern Recognition | 2017

A survey of document image word spotting techniques

Angelos P. Giotis; Giorgos Sfikas; Basilis Gatos; Christophoros Nikou

This work reviews the word spotting methods for document indexing.The nature of texts addressed by word spotting techniques is analyzed.The core steps that compose a word spotting system are thoroughly explored.Several boosting mechanisms which enhance the retrieved results are examined.Results achieved by the state of the art imply that there are still goals to be reached. Vast collections of documents available in image format need to be indexed for information retrieval purposes. In this framework, word spotting is an alternative solution to optical character recognition (OCR), which is rather inefficient for recognizing text of degraded quality and unknown fonts usually appearing in printed text, or writing style variations in handwritten documents. Over the past decade there has been a growing interest in addressing document indexing using word spotting which is reflected by the continuously increasing number of approaches. However, there exist very few comprehensive studies which analyze the various aspects of a word spotting system. This work aims to review the recent approaches as well as fill the gaps in several topics with respect to the related works. The nature of texts and inherent challenges addressed by word spotting methods are thoroughly examined. After presenting the core steps which compose a word spotting system, we investigate the use of retrieval enhancement techniques based on relevance feedback which improve the retrieved results. Finally, we present the datasets which are widely used for word spotting, we describe the evaluation standards and measures applied for performance assessment and discuss the results achieved by the state of the art.


international conference on document analysis and recognition | 2009

A Novel Feature Extraction and Classification Methodology for the Recognition of Historical Documents

Georgios Vamvakas; Basilis Gatos; Stavros J. Perantoni

In this paper, we present a methodology for off-line character recognition that mainly focuses on handling the difficult cases of historical fonts and styles. The proposed methodology relies on a new feature extraction technique based on recursive subdivisions of the image as well as on calculation of the centre of masses of each sub-image with sub-pixel accuracy. Feature extraction is followed by a hierarchical classification scheme based on the level of granularity of the feature extraction method. Pairs of classes with high values in the confusion matrix are merged at a certain level and higher level granularity features are employed for distinguishing them. Several historical documents were used in order to demonstrate the efficiency of the proposed technique.


international conference on frontiers in handwriting recognition | 2016

ICFHR2016 Handwritten Document Image Binarization Contest (H-DIBCO 2016)

Ioannis Pratikakis; Konstantinos Zagoris; George Barlas; Basilis Gatos

H-DIBCO 2016 is the international Handwritten Document Image Binarization Contest organized in the context of ICFHR 2016 conference. The general objective of the contest is to identify current advances in document image binarization of handwritten document images using performance evaluation measures that are motivated by document image analysis and recognition requirements. This paper describes the contest details including the evaluation measures used as well as the performance of the 12 submitted methods along with a brief description of each method.


international conference on document analysis and recognition | 2015

GRPOLY-DB: An old Greek polytonic document image database

Basilis Gatos; Nikolaos Stamatopoulos; Georgios Louloudis; Giorgos Sfikas; George Retsinas; Vassilis Papavassiliou; Fotini Sunistira; Vassilios Katsouros

Recognition of old Greek document images containing polytonic (multi accent) characters is a challenging task due to the large number of existing character classes (more than 270) which cannot be handled sufficiently by current OCR technologies. Taking into account that the Greek polytonic system was used from the late antiquity until recently, a large amount of scanned Greek documents still remains without full test search capabilities. In order to assist the progress of relevant research, this paper introduces the first publicly available old Greek polytonic database GRPOLY-DB for the evaluation of several document image processing tasks. It contains both machine-printed and handwritten documents as well as annotation with ground-truth information that can be used for training and evaluation of the most commou document image processing tasks, i.e.. text line and word segmentation, test recognition, isolated character recognition and word spotting. Results using several representative baseline technologies are also presented in order to help researchers evaluate their methods and advance the frontiers of old Greek document image recognition and word spotting.


international conference on frontiers in handwriting recognition | 2014

Segmentation-Based Historical Handwritten Word Spotting Using Document-Specific Local Features

Konstantinos Zagoris; Ioannis Pratikakis; Basilis Gatos

Many word spotting strategies for the modern documents are not directly applicable to historical handwritten documents due to writing styles variety and intense degradation. In this paper, a new method that permits effective word spotting in handwritten documents is presented that relies upon document-specific local features which take into account texture information around representative key points. Experimental work on two historical handwritten datasets using standard evaluation measures shows the improved performance achieved by the proposed methodology.


international conference on frontiers in handwriting recognition | 2014

Segmentation of Historical Handwritten Documents into Text Zones and Text Lines

Basilis Gatos; Georgios Louloudis; Nikolaos Stamatopoulos

In order to achieve accurate text recognition performance for historical handwritten document images, robust and efficient page segmentation is necessary. In this paper, we propose a text zone detection followed by a text line segmentation method suitable for historical handwritten documents. Our aim is to handle several challenging cases such as horizontal and vertical rule lines overlapping with the text, two column documents and characters of different text lines touching vertically. For text zone detection, we analyze vertical rule lines, connected components as well as vertical white runs while for text line segmentation, we enhance an existing approach based on Hough transform in order to better treat cases of vertical connected characters. Both methods have been proved very promising after an evaluation using a set of historical handwritten documents.

Collaboration


Dive into the Basilis Gatos's collaboration.

Top Co-Authors

Avatar

Georgios Louloudis

National and Kapodistrian University of Athens

View shared research outputs
Top Co-Authors

Avatar

Nikolaos Stamatopoulos

National and Kapodistrian University of Athens

View shared research outputs
Top Co-Authors

Avatar

Ioannis Pratikakis

Democritus University of Thrace

View shared research outputs
Top Co-Authors

Avatar

Giorgos Sfikas

University of Strasbourg

View shared research outputs
Top Co-Authors

Avatar

George Retsinas

National Technical University of Athens

View shared research outputs
Top Co-Authors

Avatar

Konstantinos Zagoris

Democritus University of Thrace

View shared research outputs
Top Co-Authors

Avatar
Top Co-Authors

Avatar
Top Co-Authors

Avatar
Top Co-Authors

Avatar

Vassilis Papavassiliou

National Technical University of Athens

View shared research outputs
Researchain Logo
Decentralizing Knowledge