Mohammed Javed | Researchain

Archive Network Publication Hotspot Collaboration

Network

Latest external collaboration on country level. Dive into details by clicking on the dots.

Explore More

Hotspot

Dive into the research topics where Mohammed Javed is active.

Explore More

Publication

Featured researches published by Mohammed Javed.

asian conference on pattern recognition | 2013

Extraction of Projection Profile, Run-Histogram and Entropy Features Straight from Run-Length Compressed Text-Documents

Mohammed Javed; P. Nagabhushan; B. B. Chaudhuri

Document Image Analysis, like any Digital Image Analysis requires identification and extraction of proper features, which are generally extracted from uncompressed images, though in reality images are made available in compressed form for the reasons such as transmission and storage efficiency. However, this implies that the compressed image should be decompressed, which indents additional computing resources. This limitation induces the motivation to research in extracting features directly from the compressed image. In this research, we propose to extract essential features such as projection profile, run-histogram and entropy for text document analysis directly from run-length compressed text-documents. The experimentation illustrates that features are extracted directly from the compressed image without going through the stage of decompression, because of which the computing time is reduced. The feature values so extracted are exactly identical to those extracted from uncompressed images.

computer vision and pattern recognition | 2013

Extraction of line-word-character segments directly from run-length compressed printed text-documents

Mohammed Javed; P. Nagabhushan; B. B. Chaudhuri

Segmentation of a text-document into lines, words and characters, which is considered to be the crucial preprocessing stage in Optical Character Recognition (OCR) is traditionally carried out on uncompressed documents, although most of the documents in real life are available in compressed form, for the reasons such as transmission and storage efficiency. However, this implies that the compressed image should be decompressed, which indents additional computing resources. This limitation has motivated us to take up research in document image analysis using compressed documents. In this paper, we think in a new way to carry out segmentation at line, word and character level in run-length compressed printed-text-documents. We extract the horizontal projection profile curve from the compressed file and using the local minima points perform line segmentation. However, tracing vertical information which leads to tracking words-characters in a run-length compressed file is not very straight forward. Therefore, we propose a novel technique for carrying out simultaneous word and character segmentation by popping out column runs from each row in an intelligent sequence. The proposed algorithms have been validated with 1101 text-lines, 1409 words and 7582 characters from a data-set of 35 noise and skew free compressed documents of Bengali, Kannada and English Scripts.

International Journal of Multimedia Information Retrieval | 2016

An efficient method for video shot boundary detection and keyframe extraction using SIFT-point distribution histogram

Rachida Hannane; Abdessamad Elboushaki; Karim Afdel; P. Naghabhushan; Mohammed Javed

In today’s digital era, there are large volumes of long-duration videos resulting from movies, documentaries, sports and surveillance cameras floating over internet and video databases (YouTube). Since manual processing of these videos are difficult, time-consuming and expensive, an automatic technique of abstracting these long-duration videos are very much desirable. In this backdrop, this paper presents a novel and efficient approach of video shot boundary detection and keyframe extraction, which subsequently leads to a summarized and compact video. The proposed method detects video shot boundaries by extracting the SIFT-point distribution histogram (SIFT-PDH) from the frames as a combination of local and global features. In the subsequent step, using the distance of SIFT-PDH of consecutive frames and an adaptive threshold video shot boundaries are detected. Further, the keyframes representing the salient content of each segmented shot are extracted using entropy-based singular values measure. Thus, the summarized video is then generated by combining the extracted keyframes. The experimental results show that our method can efficiently detect shot boundaries under both abrupt and gradual transitions, and even under different levels of illumination, motion effects and camera operations (zoom in, zoom out and camera rotation). With the proposed method, the computational complexity is comparatively less and video summarization is very compact.

ieee international conference on signal and image processing | 2014

Entropy Computations of Document Images in Run-Length Compressed Domain

P. Nagabhushan; Mohammed Javed; B. B. Chaudhuri

Compression of documents, images, audios and videos have been traditionally practiced to increase the efficiency of data storage and transfer. However, in order to process or carry out any analytical computations, decompression has become an unavoidable pre-requisite. In this research work, we have attempted to compute the entropy, which is an important document analytic directly from the compressed documents. We use Conventional Entropy Quantifier (CEQ) and Spatial Entropy Quantifiers (SEQ) for entropy computations [1]. The entropies obtained are useful in applications like establishing equivalence, word spotting and document retrieval. Experiments have been performed with all the data sets of [1], at character, word and line levels taking compressed documents in run-length compressed domain. The algorithms developed are computational and space efficient, and results obtained match 100% with the results reported in [1].

international conference on document analysis and recognition | 2015

Automatic extraction of correlation-entropy features for text document analysis directly in run-length compressed domain

Mohammed Javed; P. Nagabhushan; B. B. Chaudhuri

Automatic feature extraction plays a pivotal role in defining the overall performance of any Document Image Analysis system, which conventionally operates directly over uncompressed images, although most of the real time systems such as fax machines, digital libraries and e-governance applications accrue and archive the documents in the compressed form for the sake of storage and transfer efficiencies. However, this infers that the compressed documents need to be decompressed before carrying out any operation or analysis which warrants additional computing resources. This limitation in existing systems instigates motivation to explore for feature extraction techniques directly from the compressed documents and eventually design a document analysis system that works directly in compressed domain. Therefore, this research work proposes to extract novel correlation-entropy features directly from run-length compressed TIFF documents. Further, the research work also investigates different methods to demonstrate some of the straight forward application of the proposed features in carrying out compressed document image analysis such as text and non-text component detection, and subsequently performing compressed text line segmentation and characterization, all carried out in the compressed version of the printed text document without going through the stage of decompression. Finally, the experimental results reported validate the developed algorithms and also illustrate that the proposed features are quite powerful in distinguishing compressed text and non-text components.

CVIP (1) | 2017

Spotting of keyword directly in run-length compressed documents

Mohammed Javed; P. Nagabhushan; B. B. Chaudhuri

With the rapid growth of digital libraries, e-governance and Internet applications, huge volume of documents are being generated, communicated and archived in the compressed form to provide better storage and transfer efficiencies. In such a large repository of compressed documents, the frequently used operations like keyword searching and document retrieval have to be carried out after decompression and subsequently with the help of an OCR. Therefore developing keyword spotting technique directly in compressed documents is a potential and challenging research issue. In this backdrop, the paper presents a novel approach for searching keywords directly in run-length compressed documents without going through the stages of decompression and OCRing. The proposed method extracts simple and straightforward font size invariant features like number of run transitions and correlation of runs over the selected regions of test words, and matches with that of the user queried word. In the subsequent step, based on the matching score, the keywords are spotted in the compressed document. The idea of decompression-less and OCR-less word spotting directly in compressed documents is the major contribution of this paper. The method is experimented on a data set of compressed documents and the preliminary results obtained validate the proposed idea.

international conference on document analysis and recognition | 2015

A direct approach for word and character segmentation in run-length compressed documents with an application to word spotting

Mohammed Javed; P. Nagabhushan; B. B. Chaudhuri

Segmentation of a text document into lines, words and characters is an important objective in application like OCR and related analytics. However in todays scenario, the documents are compressed for archival and transmission efficiency. Text segmentation in compressed documents warrants decompression, and needs additional computing resources. In this backdrop, the paper proposes a method for text segmentation directly in run-length compressed, printed English text documents. Line segmentation is done using the projection profile technique. Further segmentation into words and characters is accomplished by tracing the white runs along the base region of the text line. During the process, a run based region growing technique is applied in the spatial neighborhood of the white runs to trace the vertical space between the characters. After detecting the character spaces in the entire text line, the decision of word space and character space is made by computing the average character space. Subsequently based on the spatial position of the detected words and characters, their respective compressed segments are extracted. The proposed algorithm is tested with 1083 compressed text lines, and F-measure of 97.93% and 92.86% respectively for word and character segmentation are obtained. Finally an application of word spotting is also presented.

arXiv: Computer Vision and Pattern Recognition | 2014

Direct Processing of Document Images in Compressed Domain

Mohammed Javed; P. Nagabhushan; B. B. Chaudhuri

Abstract: With the rapid increase in the volume of Big data of this digital era, fax documents, invoices, receipts, etc are traditionally subjected to compression for the efficiency of data storage and transfer. However, in order to process these documents, they need to undergo the stage of decompression which indents additional computing resources. This limitation induces the motivation to research on the possibility of directly processing of compressed images. In this research paper, we summarize the research work carried out to perform different operations straight from run-length compressed documents without going through the stage of decompression. The different operations demonstrated are feature extraction; text-line, word and character segmentation; document block segmentation; and font size detection, all carried out in the compressed version of the document. Feature extraction methods demonstrate how to extract the conventionally defined features such as projection profile, run-histogram and entropy, directly from the compressed document data. Document segmentation involves the extraction of compressed segments of text-lines, words and characters using the vertical and horizontal projection profile features. Further an attempt is made to segment randomly a block of interest from the compressed document and subsequently facilitate absolute and relative characterization of the segmented block which finds real time applications in automatic processing of Bank Cheques, Challans, etc, in compressed domain. Finally an application to detect font size at text line level is also investigated. All the proposed algorithms are validated experimentally with sufficient data set of compressed documents.

arXiv: Computer Vision and Pattern Recognition | 2014