Network


Latest external collaboration on country level. Dive into details by clicking on the dots.

Hotspot


Dive into the research topics where Subhadeep Mukhopadhyay is active.

Publication


Featured researches published by Subhadeep Mukhopadhyay.


Electronic Journal of Statistics | 2017

Large-scale mode identification and data-driven sciences

Subhadeep Mukhopadhyay

Bump-hunting or mode identification is a fundamental problem that arises in almost every scientific field of data-driven discovery. Surprisingly, very few data modeling tools are available for automatic (not requiring manual case-by-base investigation), objective (not subjective), and nonparametric (not based on restrictive parametric model assumptions) mode discovery, which can scale to large data sets. This article introduces LPMode--an algorithm based on a new theory for detecting multimodality of a probability density. We apply LPMode to answer important research questions arising in various fields from environmental science, ecology, econometrics, analytical chemistry to astronomy and cancer genomics.


Scientific Reports | 2018

Generalized Empirical Bayes Modeling via Frequentist Goodness of Fit

Subhadeep Mukhopadhyay; Douglas Fletcher

The two key issues of modern Bayesian statistics are: (i) establishing principled approach for distilling statistical prior that is consistent with the given data from an initial believable scientific prior; and (ii) development of a consolidated Bayes-frequentist data analysis workflow that is more effective than either of the two separately. In this paper, we propose the idea of “Bayes via goodness-of-fit” as a framework for exploring these fundamental questions, in a way that is general enough to embrace almost all of the familiar probability models. Several examples, spanning application areas such as clinical trials, metrology, insurance, medicine, and ecology show the unique benefit of this new point of view as a practical data science tool.


Machine Learning | 2018

LPiTrack: Eye movement pattern recognition algorithm and application to biometric identification

Subhadeep Mukhopadhyay; Shinjini Nandi

A comprehensive nonparametric statistical learning framework, called LPiTrack, is introduced for large-scale eye-movement pattern discovery. The foundation of our data-compression scheme is based on a new Karhunen–Loéve-type representation of the stochastic process in Hilbert space by specially designed orthonormal polynomial expansions. We apply this novel nonlinear transformation-based statistical data-processing algorithm to extract temporal-spatial-static characteristics from eye-movement trajectory data in an automated, robust way for biometric authentication. This is a significant step towards designing a next-generation gaze-based biometric identification system. We elucidate the essential components of our algorithm through data from the second Eye Movements Verification and Identification Competition, organized as a part of the 2014 International Joint Conference on Biometrics.


Journal of Nonparametric Statistics | 2018

Decentralized nonparametric multiple testing

Subhadeep Mukhopadhyay

ABSTRACT Consider a big data multiple testing task, where, due to storage and computational bottlenecks, one is given a very large collection of p-values by splitting into manageable chunks and distributing over thousands of computer nodes. This paper is concerned with the following question: How can we find the full data multiple testing solution by operating completely independently on individual machines in parallel, without any data exchange between nodes? This version of the problem tends naturally to arise in a wide range of data-intensive science and industry applications whose methodological solution has not appeared in the literature to date; therefore, we feel it is necessary to undertake such analysis. Based on the nonparametric functional statistical viewpoint of large-scale inference, started in Mukhopadhyay, S. [(2016), ‘Large Scale Signal Detection: A Unifying View’, Biometrics, 72, 325–334], this paper furnishes a new computing model that brings unexpected simplicity to the design of the algorithm which might otherwise seem daunting using classical approach and notations.


BAYESIAN INFERENCE AND MAXIMUM ENTROPY METHODS IN SCIENCE AND ENGINEERING: The 29th International Workshop on Bayesian Inference and Maximum Entropy Methods in Science and Engineering | 2009

Bayesian Analysis of High Dimensional Classification

Subhadeep Mukhopadhyay; Faming Liang

Modern data mining and bioinformatics have presented an important playground for statistical learning techniques, where the number of input variables is possibly much larger than the sample size of the training data. In supervised learning, logistic regression or probit regression can be used to model a binary output and form perceptron classification rules based on Bayesian inference. In these cases , there is a lot of interest in searching for sparse model in High Dimensional regression(/classification) setup. we first discuss two common challenges for analyzing high dimensional data. The first one is the curse of dimensionality. The complexity of many existing algorithms scale exponentially with the dimensionality of the space and by virtue of that algorithms soon become computationally intractable and therefore inapplicable in many real applications. secondly, multicollinearities among the predictors which severely slowdown the algorithm. In order to make Bayesian analysis operational in high dimensio...


arXiv: Statistics Theory | 2012

MODELING, DEPENDENCE, CLASSIFICATION, UNITED STATISTICAL SCIENCE, MANY CULTURES

Emanuel Parzen; Subhadeep Mukhopadhyay


arXiv: Statistics Theory | 2013

United Statistical Algorithm, Small and Big Data: Future OF Statistician

Emanuel Parzen; Subhadeep Mukhopadhyay


Archive | 2013

Nonparametric inference for high dimensional data

Subhadeep Mukhopadhyay


Biometrics | 2016

Large-scale signal detection: A unified perspective.

Subhadeep Mukhopadhyay


arXiv: Statistics Theory | 2013

LP Mixed Data Science : Outline of Theory

Emanuel Parzen; Subhadeep Mukhopadhyay

Collaboration


Dive into the Subhadeep Mukhopadhyay's collaboration.

Top Co-Authors

Avatar
Top Co-Authors

Avatar
Top Co-Authors

Avatar

Anil K. Ghosh

Indian Statistical Institute

View shared research outputs
Researchain Logo
Decentralizing Knowledge