Ilia Nouretdinov
Royal Holloway, University of London
Network
Latest external collaboration on country level. Dive into details by clicking on the dots.
Publication
Featured researches published by Ilia Nouretdinov.
NeuroImage | 2011
Ilia Nouretdinov; Sergi G. Costafreda; Alexander Gammerman; Alexey Ya. Chervonenkis; Vladimir Vovk; Vladimir Vapnik; Cynthia H.Y. Fu
There is rapidly accumulating evidence that the application of machine learning classification to neuroimaging measurements may be valuable for the development of diagnostic and prognostic prediction tools in psychiatry. However, current methods do not produce a measure of the reliability of the predictions. Knowing the risk of the error associated with a given prediction is essential for the development of neuroimaging-based clinical tools. We propose a general probabilistic classification method to produce measures of confidence for magnetic resonance imaging (MRI) data. We describe the application of transductive conformal predictor (TCP) to MRI images. TCP generates the most likely prediction and a valid measure of confidence, as well as the set of all possible predictions for a given confidence level. We present the theoretical motivation for TCP, and we have applied TCP to structural and functional MRI data in patients and healthy controls to investigate diagnostic and prognostic prediction in depression. We verify that TCP predictions are as accurate as those obtained with more standard machine learning methods, such as support vector machine, while providing the additional benefit of a valid measure of confidence for each prediction.
Clinical Chemistry | 2010
John F. Timms; Rainer Cramer; Stephane Camuzeaux; Ali Tiss; Celia Smith; Brian Burford; Ilia Nouretdinov; Dmitry Devetyarov; Aleksandra Gentry-Maharaj; Jeremy Ford; Zhiyuan Luo; Alexander Gammerman; Usha Menon; Ian Jacobs
BACKGROUND The serum peptidome may be a valuable source of diagnostic cancer biomarkers. Previous mass spectrometry (MS) studies have suggested that groups of related peptides discriminatory for different cancer types are generated ex vivo from abundant serum proteins by tumor-specific exopeptidases. We tested 2 complementary serum profiling strategies to see if similar peptides could be found that discriminate ovarian cancer from benign cases and healthy controls. METHODS We subjected identically collected and processed serum samples from healthy volunteers and patients to automated polypeptide extraction on octadecylsilane-coated magnetic beads and separately on ZipTips before MALDI-TOF MS profiling at 2 centers. The 2 platforms were compared and case control profiling data analyzed to find altered MS peak intensities. We tested models built from training datasets for both methods for their ability to classify a blinded test set. RESULTS Both profiling platforms had CVs of approximately 15% and could be applied for high-throughput analysis of clinical samples. The 2 methods generated overlapping peptide profiles, with some differences in peak intensity in different mass regions. In cross-validation, models from training data gave diagnostic accuracies up to 87% for discriminating malignant ovarian cancer from healthy controls and up to 81% for discriminating malignant from benign samples. Diagnostic accuracies up to 71% (malignant vs healthy) and up to 65% (malignant vs benign) were obtained when the models were validated on the blinded test set. CONCLUSIONS For ovarian cancer, altered MALDI-TOF MS peptide profiles alone cannot be used for accurate diagnoses.
artificial intelligence applications and innovations | 2012
Antonis Lambrou; Harris Papadopoulos; Ilia Nouretdinov; Alexander Gammerman
Venn Predictors (VPs) are machine learning algorithms that can provide well calibrated multiprobability outputs for their predictions. The only drawback of Venn Predictors is their computational inefficiency, especially in the case of large datasets. In this work, we propose an Inductive Venn Predictor (IVP) which overcomes the computational inefficiency problem of the original Venn Prediction framework. Each VP is defined by a taxonomy which separates the data into categories. We develop an IVP with a taxonomy derived from a multiclass Support Vector Machine (SVM), and we compare our method with other probabilistic methods for SVMs, namely Platt’s method, SVM Binning, and SVM with Isotonic Regression. We show that these methods do not always provide well calibrated outputs, while our IVP will always guarantee this property under the i.i.d. assumption.
Annals of Statistics | 2009
Vladimir Vovk; Ilia Nouretdinov; Alexander Gammerman
Gauss linear model; independent identically distributed observations; multivariate analysis; on-line protocol; prequential statistics; regression We consider the on-line predictive version of the standard problem of linear regression; the goal is to predict each consecutive response given the corresponding explanatory variables and all the previous observations. The standard treatment of prediction in linear regression analysis has two drawbacks: (1) the usual prediction intervals guarantee that the probability of error is equal to the nominal significance level ǫ, but this property per se does not imply that the long-run frequency of error is close to ǫ; (2) it is not suitable for prediction of complex systems as it assumes that the number of observations exceeds the number of parameters. We state a general result showing that in the on-line protocol the frequency of error does equal the nominal significance level, up to statistical fluctuations, and we describe alternative regression models in which informative prediction intervals can be found before the number of observations exceeds the number of parameters. One of these models, which only assumes that the observations are independent and identically distributed, is popular in machine learning but greatly underused in the statistical theory of regression.
artificial intelligence applications and innovations | 2010
Dmitry Devetyarov; Ilia Nouretdinov
Conformal predictors represent a new flexible framework that outputs region predictions with a guaranteed error rate. Efficiency of such predictions depends on the nonconformity measure that underlies the predictor. In this work we designed new nonconformity measures based on a random forest classifier. Experiments demonstrate that proposed conformal predictors are more efficient than current benchmarks on noisy mass spectrometry data (and at least as efficient on other type of data) while maintaining the property of validity: they output fewer multiple predictions, and the ratio of mistakes does not exceed the preset level. When forced to produce singleton predictions, the designed conformal predictors are at least as accurate as the benchmarks and sometimes significantly outperform them.
arXiv: Learning | 2016
Vladimir Vovk; Valentina Fedorova; Ilia Nouretdinov; Alexander Gammerman
We study optimal conformity measures for various criteria of efficiency in an idealised setting. This leads to an important class of criteria of efficiency that we call probabilistic; it turns out that the most standard criteria of efficiency used in literature on conformal prediction are not probabilistic.
artificial intelligence applications and innovations | 2011
Chenzhe Zhou; Ilia Nouretdinov; Zhiyuan Luo; Dmitry Adamskiy; Luke Randell; Nicholas Coldham; Alexander Gammerman
The main aim of this paper is to compare the results of several methods of prediction with confidence. In particular we compare the results of Venn Machine with Platt’s Method of estimating confidence. The results are presented and discussed.
artificial intelligence applications and innovations | 2012
Ilia Nouretdinov; Dmitry Devetyarov; Brian Burford; Stephane Camuzeaux; Aleksandra Gentry-Maharaj; Ali Tiss; Celia Smith; Zhiyuan Luo; Alexey Ya. Chervonenkis; Rachel Hallett; Volodya Vovk; M D Waterfield; Rainer Cramer; John F. Timms; Ian Jacobs; Usha Menon; Alexander Gammerman
This paper describes the methodology of providing multiprobability predictions for proteomic mass spectrometry data. The methodology is based on a newly developed machine learning framework called Venn machines. They allow us to output a valid probability interval. We apply this methodology to mass spectrometry data sets in order to predict the diagnosis of heart disease and early diagnoses of ovarian cancer. The experiments show that probability intervals are valid and narrow. In addition, probability intervals were compared with the output of a corresponding probability predictor.
arXiv: Learning | 2016
Paolo Toccaceli; Ilia Nouretdinov; Alexander Gammerman
The paper presents an application of Conformal Predictors to a chemoinformatics problem of identifying activities of chemical compounds. The paper addresses some specific challenges of this domain: a large number of compounds training examples, high-dimensionality of feature space, sparseness and a strong class imbalance. A variant of conformal predictors called Inductive Mondrian Conformal Predictor is applied to deal with these challenges. Results are presented for several non-conformity measures NCM extracted from underlying algorithms and different kernels. A number of performance measures are used in order to demonstrate the flexibility of Inductive Mondrian Conformal Predictors in dealing with such a complex set of data.
international conference on machine learning and applications | 2009
Ilia Nouretdinov; Brian Burford; Alexander Gammerman
In this work we apply a new technique called conformal prediction to the Functional Clustering of Gene Expression Profiles in Human Cancers Challenge. The method not only allows us to make predictions but also include measures of accuracy and reliability of the prediction. These measures are provably valid under i. i. d. assumption. Using this approach it becomes possible to control the number of errors by selecting a suitable confidence level. This paper describes the application of the method to gene expression for various types of cancer.