Is this you? Create Your Porfile

Ilia Nouretdinov

Royal Holloway, University of London

Archive Network Publication Hotspot Collaboration

Network

Latest external collaboration on country level. Dive into details by clicking on the dots.

Explore More

Hotspot

Dive into the research topics where Ilia Nouretdinov is active.

Explore More

Publication

Featured researches published by Ilia Nouretdinov.

NeuroImage | 2011

Machine learning classification with confidence: Application of transductive conformal predictors to MRI-based diagnostic and prognostic markers in depression

Ilia Nouretdinov; Sergi G. Costafreda; Alexander Gammerman; Alexey Ya. Chervonenkis; Vladimir Vovk; Vladimir Vapnik; Cynthia H.Y. Fu

There is rapidly accumulating evidence that the application of machine learning classification to neuroimaging measurements may be valuable for the development of diagnostic and prognostic prediction tools in psychiatry. However, current methods do not produce a measure of the reliability of the predictions. Knowing the risk of the error associated with a given prediction is essential for the development of neuroimaging-based clinical tools. We propose a general probabilistic classification method to produce measures of confidence for magnetic resonance imaging (MRI) data. We describe the application of transductive conformal predictor (TCP) to MRI images. TCP generates the most likely prediction and a valid measure of confidence, as well as the set of all possible predictions for a given confidence level. We present the theoretical motivation for TCP, and we have applied TCP to structural and functional MRI data in patients and healthy controls to investigate diagnostic and prognostic prediction in depression. We verify that TCP predictions are as accurate as those obtained with more standard machine learning methods, such as support vector machine, while providing the additional benefit of a valid measure of confidence for each prediction.

Clinical Chemistry | 2010

Peptides Generated Ex Vivo from Serum Proteins by Tumor-Specific Exopeptidases Are Not Useful Biomarkers in Ovarian Cancer

John F. Timms; Rainer Cramer; Stephane Camuzeaux; Ali Tiss; Celia Smith; Brian Burford; Ilia Nouretdinov; Dmitry Devetyarov; Aleksandra Gentry-Maharaj; Jeremy Ford; Zhiyuan Luo; Alexander Gammerman; Usha Menon; Ian Jacobs

BACKGROUND The serum peptidome may be a valuable source of diagnostic cancer biomarkers. Previous mass spectrometry (MS) studies have suggested that groups of related peptides discriminatory for different cancer types are generated ex vivo from abundant serum proteins by tumor-specific exopeptidases. We tested 2 complementary serum profiling strategies to see if similar peptides could be found that discriminate ovarian cancer from benign cases and healthy controls. METHODS We subjected identically collected and processed serum samples from healthy volunteers and patients to automated polypeptide extraction on octadecylsilane-coated magnetic beads and separately on ZipTips before MALDI-TOF MS profiling at 2 centers. The 2 platforms were compared and case control profiling data analyzed to find altered MS peak intensities. We tested models built from training datasets for both methods for their ability to classify a blinded test set. RESULTS Both profiling platforms had CVs of approximately 15% and could be applied for high-throughput analysis of clinical samples. The 2 methods generated overlapping peptide profiles, with some differences in peak intensity in different mass regions. In cross-validation, models from training data gave diagnostic accuracies up to 87% for discriminating malignant ovarian cancer from healthy controls and up to 81% for discriminating malignant from benign samples. Diagnostic accuracies up to 71% (malignant vs healthy) and up to 65% (malignant vs benign) were obtained when the models were validated on the blinded test set. CONCLUSIONS For ovarian cancer, altered MALDI-TOF MS peptide profiles alone cannot be used for accurate diagnoses.

artificial intelligence applications and innovations | 2012

Reliable Probability Estimates Based on Support Vector Machines for Large Multiclass Datasets

Antonis Lambrou; Harris Papadopoulos; Ilia Nouretdinov; Alexander Gammerman

Venn Predictors (VPs) are machine learning algorithms that can provide well calibrated multiprobability outputs for their predictions. The only drawback of Venn Predictors is their computational inefficiency, especially in the case of large datasets. In this work, we propose an Inductive Venn Predictor (IVP) which overcomes the computational inefficiency problem of the original Venn Prediction framework. Each VP is defined by a taxonomy which separates the data into categories. We develop an IVP with a taxonomy derived from a multiclass Support Vector Machine (SVM), and we compare our method with other probabilistic methods for SVMs, namely Platt’s method, SVM Binning, and SVM with Isotonic Regression. We show that these methods do not always provide well calibrated outputs, while our IVP will always guarantee this property under the i.i.d. assumption.

Annals of Statistics | 2009

On-line predictive linear regression

Vladimir Vovk; Ilia Nouretdinov; Alexander Gammerman

Gauss linear model; independent identically distributed observations; multivariate analysis; on-line protocol; prequential statistics; regression We consider the on-line predictive version of the standard problem of linear regression; the goal is to predict each consecutive response given the corresponding explanatory variables and all the previous observations. The standard treatment of prediction in linear regression analysis has two drawbacks: (1) the usual prediction intervals guarantee that the probability of error is equal to the nominal significance level ǫ, but this property per se does not imply that the long-run frequency of error is close to ǫ; (2) it is not suitable for prediction of complex systems as it assumes that the number of observations exceeds the number of parameters. We state a general result showing that in the on-line protocol the frequency of error does equal the nominal significance level, up to statistical fluctuations, and we describe alternative regression models in which informative prediction intervals can be found before the number of observations exceeds the number of parameters. One of these models, which only assumes that the observations are independent and identically distributed, is popular in machine learning but greatly underused in the statistical theory of regression.

artificial intelligence applications and innovations | 2010

Prediction with Confidence Based on a Random Forest Classifier

Dmitry Devetyarov; Ilia Nouretdinov

Conformal predictors represent a new flexible framework that outputs region predictions with a guaranteed error rate. Efficiency of such predictions depends on the nonconformity measure that underlies the predictor. In this work we designed new nonconformity measures based on a random forest classifier. Experiments demonstrate that proposed conformal predictors are more efficient than current benchmarks on noisy mass spectrometry data (and at least as efficient on other type of data) while maintaining the property of validity: they output fewer multiple predictions, and the ratio of mistakes does not exceed the preset level. When forced to produce singleton predictions, the designed conformal predictors are at least as accurate as the benchmarks and sometimes significantly outperform them.

arXiv: Learning | 2016

Criteria of Efficiency for Conformal Prediction

Vladimir Vovk; Valentina Fedorova; Ilia Nouretdinov; Alexander Gammerman

We study optimal conformity measures for various criteria of efficiency in an idealised setting. This leads to an important class of criteria of efficiency that we call probabilistic; it turns out that the most standard criteria of efficiency used in literature on conformal prediction are not probabilistic.

artificial intelligence applications and innovations | 2011

A Comparison of Venn Machine with Platt’s Method in Probabilistic Outputs

Chenzhe Zhou; Ilia Nouretdinov; Zhiyuan Luo; Dmitry Adamskiy; Luke Randell; Nicholas Coldham; Alexander Gammerman

The main aim of this paper is to compare the results of several methods of prediction with confidence. In particular we compare the results of Venn Machine with Platt’s Method of estimating confidence. The results are presented and discussed.

artificial intelligence applications and innovations | 2012

Multiprobabilistic Venn Predictors with Logistic Regression

Ilia Nouretdinov; Dmitry Devetyarov; Brian Burford; Stephane Camuzeaux; Aleksandra Gentry-Maharaj; Ali Tiss; Celia Smith; Zhiyuan Luo; Alexey Ya. Chervonenkis; Rachel Hallett; Volodya Vovk; M D Waterfield; Rainer Cramer; John F. Timms; Ian Jacobs; Usha Menon; Alexander Gammerman

This paper describes the methodology of providing multiprobability predictions for proteomic mass spectrometry data. The methodology is based on a newly developed machine learning framework called Venn machines. They allow us to output a valid probability interval. We apply this methodology to mass spectrometry data sets in order to predict the diagnosis of heart disease and early diagnoses of ovarian cancer. The experiments show that probability intervals are valid and narrow. In addition, probability intervals were compared with the output of a corresponding probability predictor.

arXiv: Learning | 2016

Conformal Predictors for Compound Activity Prediction

Paolo Toccaceli; Ilia Nouretdinov; Alexander Gammerman

The paper presents an application of Conformal Predictors to a chemoinformatics problem of identifying activities of chemical compounds. The paper addresses some specific challenges of this domain: a large number of compounds training examples, high-dimensionality of feature space, sparseness and a strong class imbalance. A variant of conformal predictors called Inductive Mondrian Conformal Predictor is applied to deal with these challenges. Results are presented for several non-conformity measures NCM extracted from underlying algorithms and different kernels. A number of performance measures are used in order to demonstrate the flexibility of Inductive Mondrian Conformal Predictors in dealing with such a complex set of data.

international conference on machine learning and applications | 2009

Application of Inductive Confidence Machine to ICMLA Competition Data

Ilia Nouretdinov; Brian Burford; Alexander Gammerman

In this work we apply a new technique called conformal prediction to the Functional Clustering of Gene Expression Profiles in Human Cancers Challenge. The method not only allows us to make predictions but also include measures of accuracy and reliability of the prediction. These measures are provably valid under i. i. d. assumption. Using this approach it becomes possible to control the number of errors by selecting a suitable confidence level. This paper describes the application of the method to gene expression for various types of cancer.

Explore More