Network


Latest external collaboration on country level. Dive into details by clicking on the dots.

Hotspot


Dive into the research topics where Zisis Kozlakidis is active.

Publication


Featured researches published by Zisis Kozlakidis.


Eurosurveillance | 2017

Emergence of a novel subclade of influenza A(H3N2) virus in London, December 2016 to January 2017

Heli Harvala; Dan Frampton; Paul Grant; Jade Raffle; Ruth Bridget Ferns; Zisis Kozlakidis; Paul Kellam; Deenan Pillay; Andrew Hayward; Eleni Nastouli

We report the molecular investigations of a large influenza A(H3N2) outbreak, in a season characterised by sharp increase in influenza admissions since December 2016. Analysis of haemagglutinin (HA) sequences demonstrated co-circulation of multiple clades (3C.3a, 3C.2a and 3C.2a1). Most variants fell into a novel subclade (proposed as 3C.2a2); they possessed four unique amino acid substitutions in the HA protein and loss of a potential glycosylation site. These changes potentially modify the H3N2 strain antigenicity.


Scientific Reports | 2016

Using nearly full-genome HIV sequence data improves phylogeny reconstruction in a simulated epidemic

Gonzalo Yebra; Emma B. Hodcroft; Manon Ragonnet-Cronin; Pillay D; Andrew J. Brown; Christophe Fraser; Paul Kellam; Tulio de Oliveira; Ann M. Dennis; Anne Hoppe; Cissy Kityo; Dan Frampton; Deogratius Ssemwanga; Frank Tanser; Jagoda Keshani; Jairam R. Lingappa; Joshua T. Herbeck; Maria J. Wawer; Max Essex; Myron S. Cohen; Nicholas I. Paton; Oliver Ratmann; Pontiano Kaleebu; Richard Hayes; Sarah Fidler; Thomas Quinn; Vladimir Novitsky; Iconic; Andrew Haywards; Eleni Nastouli

HIV molecular epidemiology studies analyse viral pol gene sequences due to their availability, but whole genome sequencing allows to use other genes. We aimed to determine what gene(s) provide(s) the best approximation to the real phylogeny by analysing a simulated epidemic (created as part of the PANGEA_HIV project) with a known transmission tree. We sub-sampled a simulated dataset of 4662 sequences into different combinations of genes (gag-pol-env, gag-pol, gag, pol, env and partial pol) and sampling depths (100%, 60%, 20% and 5%), generating 100 replicates for each case. We built maximum-likelihood trees for each combination using RAxML (GTRu2009+u2009Γ), and compared their topologies to the corresponding true tree’s using CompareTree. The accuracy of the trees was significantly proportional to the length of the sequences used, with the gag-pol-env datasets showing the best performance and gag and partial pol sequences showing the worst. The lowest sampling depths (20% and 5%) greatly reduced the accuracy of tree reconstruction and showed high variability among replicates, especially when using the shortest gene datasets. In conclusion, using longer sequences derived from nearly whole genomes will improve the reliability of phylogenetic reconstruction. With low sample coverage, results can be highly variable, particularly when based on short sequences.


Artificial Intelligence in Medicine | 2016

Evolving classification of intensive care patients from event data

Olga Tosas; Tiziano Gallo Cassarino; Zisis Kozlakidis; Jonathan D. Edgeworth

OBJECTIVEnThis work aims at predicting the patient discharge outcome on each hospitalization day by introducing a new paradigm-evolving classification of event data streams. Most classification algorithms implicitly assume the values of all predictive features to be available at the time of making the prediction. This assumption does not necessarily hold in the evolving classification setting (such as intensive care patient monitoring), where we may be interested in classifying the monitored entities as early as possible, based on the attributes initially available to the classifier, and then keep refining our classification model at each time step (e.g., on daily basis) with the arrival of additional attributes.nnnMATERIALS AND METHODSnAn oblivious read-once decision-tree algorithm, called information network (IN), is extended to deal with evolving classification. The new algorithm, named incremental information network (IIN), restricts the order of selected features by the temporal order of feature arrival. The IIN algorithm is compared to six other evolving classification approaches on an 8-year dataset of adult patients admitted to two Intensive Care Units (ICUs) in the United Kingdom.nnnRESULTSnRetrospective study of 3452 episodes of adult patients (≥16years of age) admitted to the ICUs of Guys and St. Thomas hospitals in London between 2002 and 2009. Random partition (66:34) into a development (training) set n=2287 and validation set n=1165. Episode-related time steps: Day 0-time of ICU admission, Day x-end of the x-th day at ICU. The most accurate decision-tree models, based on the area under curve (AUC): Day 0: IN (AUC=0.652), Day 1: IIN (AUC=0.660), Day 2: J48 decision-tree algorithm (AUC=0.678), Days 3-7: regenerative IN (AUC=0.717-0.772). Logistic regression AUC: 0.582 (Day 0)-0.827 (Day 7).nnnCONCLUSIONSnOur experimental results have not identified a single optimal approach for evolving classification of ICU episodes. On Days 0 and 1, the IIN algorithm has produced the simplest and the most accurate models, which incorporate the temporal order of feature arrival. However, starting with Day 2, regenerative approaches have reached better performance in terms of predictive accuracy.


Philosophy & Technology | 2017

Big Data Analytics, Infectious Diseases and Associated Ethical Impacts

Chiara Garattini; Jade Raffle; Dewi Nur Aisyah; Felicity Sartain; Zisis Kozlakidis

The exponential accumulation, processing and accrual of big data in healthcare are only possible through an equally rapidly evolving field of big data analytics. The latter offers the capacity to rationalize, understand and use big data to serve many different purposes, from improved services modelling to prediction of treatment outcomes, to greater patient and disease stratification. In the area of infectious diseases, the application of big data analytics has introduced a number of changes in the information accumulation models. These are discussed by comparing the traditional and new models of data accumulation. Big data analytics is fast becoming a crucial component for the modelling of transmission—aiding infection control measures and policies—emergency response analyses required during local or international outbreaks. However, the application of big data analytics in infectious diseases is coupled with a number of ethical impacts. Four key areas are discussed in this paper: (i) automation and algorithmic reliance impacting freedom of choice, (ii) big data analytics complexity impacting informed consent, (iii) reliance on profiling impacting individual and group identities and justice/fair access and (iv) increased surveillance and population intervention capabilities impacting behavioural norms and practices. Furthermore, the extension of big data analytics to include information derived from personal devices, such as mobile phones and wearables as part of infectious disease frameworks in the near future and their potential ethical impacts are discussed. Considered together, the need for a constructive and transparent inclusion of ethical questioning in this rapidly evolving field becomes an increasing necessity in order to provide a moral foundation for the societal acceptance and responsible development of the technological advancement.


bioRxiv | 2016

High-throughput pipeline for de-novo assembly and drug resistance mutations identification from Next-Generation Sequencing viral data of residual diagnostic samples

Tiziano Gallo Cassarino; Daniel Frampton; Robert Sugar; Elijah Charles; Zisis Kozlakidis; Paul Kellam

Motivation The underlying genomic variation of a large number of pathogenic viruses can give rise to drug resistant mutations resulting in treatment failure. Next generation sequencing (NGS) enables the identification of viral quasi-species and the quantification of minority variants in clinical samples; therefore, it can be of direct benefit by detecting drug resistant mutations and devising optimal treatment strategies for individual patients. Results The ICONIC (InfeCtion respONse through vIrus genomiCs) project has developed an automated, portable and customisable high-throughput computational pipeline to assemble de novo whole viral genomes, either segmented or non-segmented, and quantify minority variants using residual diagnostic samples. The pipeline has been benchmarked on a dedicated High-Performance Computing cluster using paired-end reads from RSV and Influenza clinical samples. The median length of generated genomes was 96% for the RSV dataset and 100% for each Influenza segment. The analysis of each set lasted less than 12 hours; each sample took around 3 hours and required a maximum memory of 10 GB. The pipeline can be easily ported to a dedicated server or cluster through either an installation script or a docker image. As it enables the subtyping of viral samples and the detection of relevant drug resistance mutations within three days of sample collection, our pipeline could operate within existing clinical reporting time frames and potentially be used as a decision support tool towards more effective personalised patient treatments. Availability The software and its documentation are available from https://github.com/ICONIC-UCL/pipeline Contact [email protected], [email protected] Supplementary information Supplementary data are available at Briefings in Bioinformatics online.Motivation: The underlying genomic variation of a large number of pathogenic viruses can give rise to drug resistant mutations resulting in treatment failure. Next generation sequencing (NGS) enables the identification of viral quasi-species and the quantification of minority variants in clinical samples; therefore, it can be of direct benefit by detecting drug resistant mutations and devising optimal treatment strategies for individual patients. Results: The ICONIC (InfeCtion respONse through vIrus genomiCs) project has developed an automated, portable and customisable high-throughput computational pipeline to assemble de novo whole viral genomes, either segmented or non-segmented, and quantify minority variants using residual diagnostic samples. The pipeline has been benchmarked on a dedicated High-Performance Computing cluster using paired-end reads from RSV, HIV and Influenza clinical samples. The median length of generated genomes was 82% for the HIV dataset and 100% for each Influenza segment. The analysis of each set lasted less than 12 hours; each sample took around 3 hours and required a maximum memory of 10 GB. The pipeline can be easily ported to a dedicated server or cluster through either an installation script or a docker image. As it enables the subtyping of viral samples and the detection of relevant drug resistance mutations within three days of sample collection, our pipeline could operate within existing clinical reporting time frames and potentially be used as a decision support tool towards more effective personalised patient treatments. Availability: The software and its documentation are available from https://github.com/ICONIC-UCL/pipeline


The Lancet HIV | 2018

Ethical considerations in global HIV phylogenetic research.

Cordelia E. M. Coltart; Anne Hoppe; Michael Parker; Liza Dawson; Joseph J. Amon; Musonda Simwinga; Gail Geller; Gail E. Henderson; Oliver Laeyendecker; Joseph D. Tucker; Patrick Eba; Vladimir Novitsky; Anne-Mieke Vandamme; Janet Seeley; Gina Dallabetta; Guy Harling; M. Kate Grabowski; Peter Godfrey-Faussett; Christophe Fraser; Myron S. Cohen; Deenan Pillay; Rachel Baggaley; Edwin J. Bernard; David N. Burns; Cordelia C. Coltart; Nikos Dedes; Valerie Delpech; Patrick M. Eba; Danielle German; M. Kate Grabowksi

Phylogenetic analysis of pathogens is an increasingly powerful way to reduce the spread of epidemics, including HIV. As a result, phylogenetic approaches are becoming embedded in public health and research programmes, as well as outbreak responses, presenting unique ethical, legal, and social issues that are not adequately addressed by existing bioethics literature. We formed a multidisciplinary working group to explore the ethical issues arising from the design of, conduct in, and use of results from HIV phylogenetic studies, and to propose recommendations to minimise the associated risks to both individuals and groups. We identified eight key ethical domains, within which we highlighted factors that make HIV phylogenetic research unique. In this Review, we endeavoured to provide a framework to assist researchers, public health practitioners, and funding institutions to ensure that HIV phylogenetic studies are designed, done, and disseminated in an ethical manner. Our conclusions also have broader relevance for pathogen phylogenetics.


Clinical Infectious Diseases | 2018

Estimating the Hospital Burden of Norovirus-Associated Gastroenteritis in England and Its Opportunity Costs for Nonadmitted Patients

F.G. Sandmann; Laura Shallcross; Natalie Adams; David Allen; Pietro G. Coen; Annette Jeanes; Zisis Kozlakidis; Lesley Larkin; Fatima B Wurie; Julie V. Robotham; Mark Jit; Sarah R Deeny

Since the introduction of rotavirus vaccination in England in July 2013, norovirus has become the second-largest contributor of inpatient gastroenteritis, preventing 57800 patients from being admitted annually. Economic costs amount to £297.7 million, which translates into 6300 quality-adjusted life years.


The Lancet | 2016

Cost analysis of standard Sanger sequencing versus next generation sequencing in the ICONIC study

Nishma Patel; Bridget R Ferns; Eleni Nastouli; Zisis Kozlakidis; Paul Kellam; Stephen Morris

Abstract Background HIV and hepatitis C virus (HCV) are a major cause of morbidity and mortality, and both viruses contain high genomic variation. To date, viral gene sequencing has been handled by standard Sanger sequencing (SSS) for the detection of specific drug-resistance determinants for HIV and HCV. However, SSS-derived information is very limited. By contrast, full-length viral gene sequences when linked to clinical data might influence the monitoring of drug resistance to optimally guide treatment, identify sources of viral transmissions within health-care settings, and track emerging epidemics. The ICONIC (Infection Response through Virus Genomics) study aims to introduce a novel method called next generation sequencing (NGS) within UK health-care settings, testing potential implementation in routine practice. With use of samples within established diagnostic laboratory workflows, NGS has the potential to produce higher informational content than SSS and report in a timely fashion. However, economic evidence for this emerging method is scarce. We aimed to use the examples of HIV and HCV to compare the cost of NGS versus SSS. Methods We performed a bottom-up cost analysis using published, genomic-testing, costing templates to estimate the mean cost per sample for SSS and NGS methods over a 1 year period at a major London hospital. Data on resource use associated with genomic testing were based on estimates from individual sample data, routinely collected from a UK population. Findings With SSS, mean cost per sample, including operating costs, was £178 for HCV (2080 samples) and £79 for HIV (520). Mean cost per sample with NGS was £119 (2207 samples), including operating costs, generating a cost saving of £59 for HCV and a surplus of £40 for HIV. Although this method is still research based and prices vary widely, our results demonstrated a broad NGS and SSS cost equivalence. Interpretation NGS is data rich and could be integrated in emerging stratified patient treatments. The mean cost per sample for the two methods should be similar and provide added-value information. A number of costing toolkits should be designed to address the appropriate pricing, including health-care consultation and operating costs when considering NGS efficiency and cost-effectiveness. The mean cost per sample for NGS as part of routine health care requires further exploration. Funding This project is funded by the Health Innovation Challenge Fund, a parallel partnership between the Wellcome Trust and the Department of Health (ref HICF-T5-344). FRB received funding for this study from the National Institute for Health Research (NIHR) Biomedical Research Centre, and the UCLH/UCL Biomedical Research Centre funded this NIHR Health Informatics Collaborative study.


The Journal of Infectious Diseases | 2018

Use of Whole-Genome Sequencing in the Investigation of a Nosocomial Influenza Virus Outbreak

Catherine Houlihan; Dan Frampton; R. Bridget Ferns; Jade Raffle; Paul Grant; Myriam Reidy; Leila Hail; Kirsty Thomson; Frank Mattes; Zisis Kozlakidis; Deenan Pillay; Andrew Hayward; Eleni Nastouli

Abstract Traditional epidemiological investigation of nosocomial transmission of influenza involves the identification of patients who have the same influenza virus type and who have overlapped in time and place. This method may misidentify transmission where it has not occurred or miss transmission when it has. We used influenza virus whole-genome sequencing (WGS) to investigate an outbreak of influenza A virus infection in a hematology/oncology ward and identified 2 separate introductions, one of which resulted in 5 additional infections and 79 bed-days lost. Results from WGS are becoming rapidly available and may supplement traditional infection control procedures in the investigation and management of nosocomial outbreaks.


PLOS ONE | 2018

A high HIV-1 strain variability in London, UK, revealed by full-genome analysis: Results from the ICONIC project

Gonzalo Yebra; Dan Frampton; Tiziano Gallo Cassarino; Jade Raffle; Jonathan Hubb; R. Bridget Ferns; Laura Waters; C. Y. William Tong; Zisis Kozlakidis; Andrew Hayward; Paul Kellam; Deenan Pillay; Duncan A. Clark; Eleni Nastouli; Andrew J. Brown

Background & methods The ICONIC project has developed an automated high-throughput pipeline to generate HIV nearly full-length genomes (NFLG, i.e. from gag to nef) from next-generation sequencing (NGS) data. The pipeline was applied to 420 HIV samples collected at University College London Hospitals NHS Trust and Barts Health NHS Trust (London) and sequenced using an Illumina MiSeq at the Wellcome Trust Sanger Institute (Cambridge). Consensus genomes were generated and subtyped using COMET, and unique recombinants were studied with jpHMM and SimPlot. Maximum-likelihood phylogenetic trees were constructed using RAxML to identify transmission networks using the Cluster Picker. Results The pipeline generated sequences of at least 1Kb of length (median = 7.46Kb, IQR = 4.01Kb) for 375 out of the 420 samples (89%), with 174 (46.4%) being NFLG. A total of 365 sequences (169 of them NFLG) corresponded to unique subjects and were included in the down-stream analyses. The most frequent HIV subtypes were B (n = 149, 40.8%) and C (n = 77, 21.1%) and the circulating recombinant form CRF02_AG (n = 32, 8.8%). We found 14 different CRFs (n = 66, 18.1%) and multiple URFs (n = 32, 8.8%) that involved recombination between 12 different subtypes/CRFs. The most frequent URFs were B/CRF01_AE (4 cases) and A1/D, B/C, and B/CRF02_AG (3 cases each). Most URFs (19/26, 73%) lacked breakpoints in the PR+RT pol region, rendering them undetectable if only that was sequenced. Twelve (37.5%) of the URFs could have emerged within the UK, whereas the rest were probably imported from sub-Saharan Africa, South East Asia and South America. For 2 URFs we found highly similar pol sequences circulating in the UK. We detected 31 phylogenetic clusters using the full dataset: 25 pairs (mostly subtypes B and C), 4 triplets and 2 quadruplets. Some of these were not consistent across different genes due to inter- and intra-subtype recombination. Clusters involved 70 sequences, 19.2% of the dataset. Conclusions The initial analysis of genome sequences detected substantial hidden variability in the London HIV epidemic. Analysing full genome sequences, as opposed to only PR+RT, identified previously undetected recombinants. It provided a more reliable description of CRFs (that would be otherwise misclassified) and transmission clusters.

Collaboration


Dive into the Zisis Kozlakidis's collaboration.

Top Co-Authors

Avatar

Dan Frampton

University College London

View shared research outputs
Top Co-Authors

Avatar

Paul Kellam

Imperial College London

View shared research outputs
Top Co-Authors

Avatar

Eleni Nastouli

University College London

View shared research outputs
Top Co-Authors

Avatar

Deenan Pillay

University College London

View shared research outputs
Top Co-Authors

Avatar
Top Co-Authors

Avatar

Andrew Hayward

University College London

View shared research outputs
Top Co-Authors

Avatar
Top Co-Authors

Avatar

Jade Raffle

University College London

View shared research outputs
Top Co-Authors

Avatar
Top Co-Authors

Avatar

Anne Hoppe

University College London

View shared research outputs
Researchain Logo
Decentralizing Knowledge