Debasis Chakraborty
Murshidabad College of Engineering & Technology
Network
Latest external collaboration on country level. Dive into details by clicking on the dots.
Publication
Featured researches published by Debasis Chakraborty.
Pattern Recognition | 2011
Ujjwal Maulik; Debasis Chakraborty
In this article, we present a semisupervised support vector machine that uses self-training approach. We then construct an ensemble of semisupervised SVM classifiers to address the problem of pixel classification of remote sensing images. Semisupervised support vector machines (S^3VMs) are based on applying the margin maximization principle to both labeled and unlabeled samples. The ensemble of SVM classifiers recognizes the conceptual similarity between component classifiers from the same data source. The effectiveness of the proposed technique is first demonstrated for two numeric remote sensing data described in terms of feature vectors and then identifying different land cover regions in remote sensing imagery. Experimental results on these datasets show that employing this learning scheme can increase the accuracy level. The performance of the ensemble is compared with one of its component classifier and conventional SVM in terms of accuracy and quantitative cluster validity indices.
IEEE Transactions on Biomedical Engineering | 2013
Ujjwal Maulik; Anirban Mukhopadhyay; Debasis Chakraborty
With the advancement of microarray technology, gene expression profiling has shown great potential in outcome prediction for different types of cancers. Microarray cancer data, organized as samples versus genes fashion, are being exploited for the classification of tissue samples into benign and malignant or their subtypes. They are also useful for identifying potential gene markers for each cancer subtype, which helps in successful diagnosis of particular cancer type. Nevertheless, small sample size remains a bottleneck to design suitable classifiers. Traditional supervised classifiers can only work with labeled data. On the other hand, a large number of microarray data that do not have adequate follow-up information are disregarded. A novel approach to combine feature (gene) selection and transductive support vector machine (TSVM) is proposed. We demonstrated that 1) potential gene markers could be identified and 2) TSVMs improved prediction accuracy as compared to the standard inductive SVMs (ISVMs). A forward greedy search algorithm based on consistency and a statistic called signal-to-noise ratio were employed to obtain the potential gene markers. The selected genes of the microarray data were then exploited to design the TSVM. Experimental results confirm the effectiveness of the proposed technique compared to the ISVM and low-density separation method in the area of semisupervised cancer classification as well as gene-marker identification.
IEEE Transactions on Nanobioscience | 2014
Ujjwal Maulik; Debasis Chakraborty
DNA microarray data now permit scientists to screen thousand of genes simultaneously and determine whether those genes are active or silent in normal and cancerous tissues. With the advancement of microarray technology, new analytical methods must be developed to find out whether microarray data have discriminative signatures of gene expression over normal or cancerous tissues. In this paper, we attempt a prediction scheme that combines fuzzy preference based rough set (FPRS) method for feature (gene) selection with semisupervised SVMs. To show the effectiveness of the proposed approach, we compare the performance of this technique with the signal-to-noise ratio (SNR) and consistency based feature selection (CBFS) methods. Using six benchmark gene microarray datasets (including both binary and multi-class classification problems), we demonstrate experimentally that our proposed scheme can achieve significant empirical success and is biologically relevant for cancer diagnosis and drug discovery.
IEEE Geoscience and Remote Sensing Magazine | 2017
Ujjwal Maulik; Debasis Chakraborty
Land-cover mapping in remote sensing (RS) applications renders rich information for decision support and environmental monitoring systems. The derivation of such information increasingly relies on robust classification methods for identifying the complex land-cover area of different categories. Numerous classification techniques have been designed for the analysis of RS imagery. In this context, support vector machines (SVMs) have recently received increasing interest. However, the need for a small-size training set remains a bottleneck to design efficient supervised classifiers, while an adequate number of unlabeled data is readily available in RS images and can be exploited as a supplementary source of information. To fully leverage these precious unlabeled data, a number of promising advanced SVM-based methods, such as active SVMs, semisupervised SVMs (S3VMs), and SVMs combined with other algorithms, have been developed to analyze satellite imagery. In this literature review, we have surveyed these learning techniques to explore RS images. Moreover, we have provided the empirical evidences of SVMs and three representative techniques. It is our hope that this review will provide guidelines to future researchers to enhance further algorithmic developments in RS applications.
IEEE Journal of Translational Engineering in Health and Medicine | 2014
Debasis Chakraborty; Ujjwal Maulik
Microarrays have now gone from obscurity to being almost ubiquitous in biological research. At the same time, the statistical methodology for microarray analysis has progressed from simple visual assessments of results to novel algorithms for analyzing changes in expression profiles. In a micro-RNA (miRNA) or gene-expression profiling experiment, the expression levels of thousands of genes/miRNAs are simultaneously monitored to study the effects of certain treatments, diseases, and developmental stages on their expressions. Microarray-based gene expression profiling can be used to identify genes, whose expressions are changed in response to pathogens or other organisms by comparing gene expression in infected to that in uninfected cells or tissues. Recent studies have revealed that patterns of altered microarray expression profiles in cancer can serve as molecular biomarkers for tumor diagnosis, prognosis of disease-specific outcomes, and prediction of therapeutic responses. Microarray data sets containing expression profiles of a number of miRNAs or genes are used to identify biomarkers, which have dysregulation in normal and malignant tissues. However, small sample size remains a bottleneck to design successful classification methods. On the other hand, adequate number of microarray data that do not have clinical knowledge can be employed as additional source of information. In this paper, a combination of kernelized fuzzy rough set (KFRS) and semisupervised support vector machine (S3VM) is proposed for predicting cancer biomarkers from one miRNA and three gene expression data sets. Biomarkers are discovered employing three feature selection methods, including KFRS. The effectiveness of the proposed KFRS and S3VM combination on the microarray data sets is demonstrated, and the cancer biomarkers identified from miRNA data are reported. Furthermore, biological significance tests are conducted for miRNA cancer biomarkers.Microarrays have now gone from obscurity to being almost ubiquitous in biological research. At the same time, the statistical methodology for microarray analysis has progressed from simple visual assessments of results to novel algorithms for analyzing changes in expression profiles. In a micro-RNA (miRNA) or gene-expression profiling experiment, the expression levels of thousands of genes/miRNAs are simultaneously monitored to study the effects of certain treatments, diseases, and developmental stages on their expressions. Microarray-based gene expression profiling can be used to identify genes, whose expressions are changed in response to pathogens or other organisms by comparing gene expression in infected to that in uninfected cells or tissues. Recent studies have revealed that patterns of altered microarray expression profiles in cancer can serve as molecular biomarkers for tumor diagnosis, prognosis of disease-specific outcomes, and prediction of therapeutic responses. Microarray data sets containing expression profiles of a number of miRNAs or genes are used to identify biomarkers, which have dysregulation in normal and malignant tissues. However, small sample size remains a bottleneck to design successful classification methods. On the other hand, adequate number of microarray data that do not have clinical knowledge can be employed as additional source of information. In this paper, a combination of kernelized fuzzy rough set (KFRS) and semisupervised support vector machine (S3VM) is proposed for predicting cancer biomarkers from one miRNA and three gene expression data sets. Biomarkers are discovered employing three feature selection methods, including KFRS. The effectiveness of the proposed KFRS and S3VM combination on the microarray data sets is demonstrated, and the cancer biomarkers identified from miRNA data are reported. Furthermore, biological significance tests are conducted for miRNA cancer biomarkers.
Fundamenta Informaticae | 2010
Ujjwal Maulik; Debasis Chakraborty
Satellite image classification is a complex process that may be affected by many factors. This article addresses the problem of pixel classification of satellite images by a robust multiple classifier system that combines k-NN, support vector machine (SVM) and incremental learning algorithm (IL). The effectiveness of this combination is investigated for satellite imagery which usually have overlapping class boundaries. These classifiers are initially designed using a small set of labeled points. Combination of these algorithms has been done based on majority voting rule. The effectiveness of the proposed technique is first demonstrated for a numeric remote sensing data described in terms of feature vectors and then identifying different land cover regions in remote sensing imagery. Experimental results on numeric data as well as two remote sensing data show that employing combination of classifiers can effectively increase the accuracy label. Comparison is made with each of these single classifiers in terms of kappa value, accuracy, cluster quality indices and visual quality of the classified images.
2015 International Conference on Condition Assessment Techniques in Electrical Systems (CATCON) | 2015
Shemim Begum; Debasis Chakraborty; Ram Sarkar
Ensemble classification, which is the combination of result of a set of base learner has achieved much priority in machine learning theory. It has explored enough prospective in improving the empirical performance. There are very little bit research in Support Vector Machines (SVMs) ensemble in contrast to Neural Network or Decision Tree ensemble. To bridge this gap we analyse and compare SVM ensemble (ADASVM) with K-Nearest Neighbour (KNN) and SVM classifiers. Leukemia dataset is used as benchmark to evaluate and compare the performances of ADASVM with KNN and SVM classifiers.
international conference on computational intelligence and communication networks | 2015
Shemim Begum; Debasis Chakraborty; Ram Sarkar
The k Nearest Neighbour (kNN) method is one of the most popular algorithm in clustering and data classification. The kNN algorithm founds to be performed very efficient in the experiments on different dataset. In this paper, we focus on the classification problem. The algorithm is experienced over Leukemia dataset. Initially three feature selection algorithm Consistency Based Feature Selection (CBFS), Fuzzy Preference Based Rough Set (FPRS) and Kernelized Fuzzy Rough Set (KFRS) is applied on the dataset and then kNN is applied as a classifier onto the dataset. The results of our experiment demonstrates that CBFS algorithm generally perform better than other two KFRS and FPRS algorithm respectively.
international conference on emerging applications of information technology | 2012
Debasis Chakraborty; Shibu Das
With the advancement of microarray technology, gene expression profiling has shown great potential in outcome prediction for different types of cancers. They are also useful for identifying potential gene markers for each cancer subtype, which helps in successful diagnosis of particular cancer types. Traditional supervised classifiers can only work with labeled data. Consequently, a large number of microarray data that do not have adequate follow-up information are disregarded. A Novel approach to combine feature (gene) selection and transductive SVM (TSVM) has been proposed. The selected genes of the microarray data are then exploited to design the transductive SVM. Experimental results confirm the effectiveness of the proposed method in the area of semisupervised cancer classification as well as gene marker identification.
ieee international conference on control measurement and instrumentation | 2016
Shemim Begum; Debasis Chakraborty; Ram Sarkar
The disease Leukemia are continuously increasing among the people. The cause of leukemia is unknown but several factors, however are associated with the development of leukemia that are exposure to ionising radiation, exposure to benzene in rubber industry workers, cytotoxic drug particularly alkylating agent exposure, genetic disorder like down syndrome and immunological deficiency states. There are certain medical diagnosis that can reduce the burden on doctors. This paper explores some algorithms for the classification of Leukemia patients dataset. The algorithms considered here are Naive Bayes (NB), K Nearest Neighbour (KNN) and Support Vector Machines (SVMs).