Network


Latest external collaboration on country level. Dive into details by clicking on the dots.

Hotspot


Dive into the research topics where Arianna Dagliati is active.

Publication


Featured researches published by Arianna Dagliati.


BMC Bioinformatics | 2012

An ICT infrastructure to integrate clinical and molecular data in oncology research

Daniele Segagni; Valentina Tibollo; Arianna Dagliati; Alberto Zambelli; Silvia G. Priori; Riccardo Bellazzi

BackgroundThe ONCO-i2b2 platform is a bioinformatics tool designed to integrate clinical and research data and support translational research in oncology. It is implemented by the University of Pavia and the IRCCS Fondazione Maugeri hospital (FSM), and grounded on the software developed by the Informatics for Integrating Biology and the Bedside (i2b2) research center. I2b2 has delivered an open source suite based on a data warehouse, which is efficiently interrogated to find sets of interesting patients through a query tool interface.MethodsOnco-i2b2 integrates data coming from multiple sources and allows the users to jointly query them. I2b2 data are then stored in a data warehouse, where facts are hierarchically structured as ontologies. Onco-i2b2 gathers data from the FSM pathology unit (PU) database and from the hospital biobank and merges them with the clinical information from the hospital information system.Our main effort was to provide a robust integrated research environment, giving a particular emphasis to the integration process and facing different challenges, consecutively listed: biospecimen samples privacy and anonymization; synchronization of the biobank database with the i2b2 data warehouse through a series of Extract, Transform, Load (ETL) operations; development and integration of a Natural Language Processing (NLP) module, to retrieve coded information, such as SNOMED terms and malignant tumors (TNM) classifications, and clinical tests results from unstructured medical records. Furthermore, we have developed an internal SNOMED ontology rested on the NCBO BioPortal web services.ResultsOnco-i2b2 manages data of more than 6,500 patients with breast cancer diagnosis collected between 2001 and 2011 (over 390 of them have at least one biological sample in the cancer biobank), more than 47,000 visits and 96,000 observations over 960 medical concepts.ConclusionsOnco-i2b2 is a concrete example of how integrated Information and Communication Technology architecture can be implemented to support translational research. The next steps of our project will involve the extension of its capabilities by implementing new plug-in devoted to bioinformatics data analysis as well as a temporal query module.


medical informatics europe | 2011

The ONCO-I2b2 project: integrating biobank information and clinical data to support translational research in oncology.

Daniele Segagni; Valentina Tibollo; Arianna Dagliati; Leonardo Perinati; Alberto Zambelli; Silvia G. Priori; Riccardo Bellazzi

The University of Pavia and the IRCCS Fondazione Salvatore Maugeri of Pavia (FSM), has recently started an IT initiative to support clinical research in oncology, called ONCO-i2b2. ONCO-i2b2, funded by the Lombardia region, grounds on the software developed by the Informatics for Integrating Biology and the Bedside (i2b2) NIH project. Using i2b2 and new software modules purposely designed, data coming from multiple sources are integrated and jointly queried. The core of the integration process stands in retrieving and merging data from the biobank management software and from the FSM hospital information system. The integration process is based on a ontology of the problem domain and on open-source software integration modules. A Natural Language Processing module has been implemented, too. This module automatically extracts clinical information of oncology patients from unstructured medical records. The system currently manages more than two thousands patients and will be further implemented and improved in the next two years.


Methods of Molecular Biology | 2015

Analyzing complex patients' temporal histories: New frontiers in temporal data mining

Lucia Sacchi; Arianna Dagliati; Riccardo Bellazzi

In recent years, data coming from hospital information systems (HIS) and local healthcare organizations have started to be intensively used for research purposes. This rising amount of available data allows reconstructing the compete histories of the patients, which have a strong temporal component. This chapter introduces the major challenges faced by temporal data mining researchers in an era when huge quantities of complex clinical temporal data are becoming available. The analysis is focused on the peculiar features of this kind of data and describes the methodological and technological aspects that allow managing such complex framework. The chapter shows how heterogeneous data can be processed to derive a homogeneous representation. Starting from this representation, it illustrates different techniques for jointly analyze such kind of data. Finally, the technological strategies that allow creating a common data warehouse to gather data coming from different sources and with different formats are presented.


Journal of diabetes science and technology | 2015

Big Data Technologies: New Opportunities for Diabetes Management

Riccardo Bellazzi; Arianna Dagliati; Lucia Sacchi; Daniele Segagni

The so-called big data revolution provides substantial opportunities to diabetes management. At least 3 important directions are currently of great interest. First, the integration of different sources of information, from primary and secondary care to administrative information, may allow depicting a novel view of patient’s care processes and of single patient’s behaviors, taking into account the multifaceted nature of chronic care. Second, the availability of novel diabetes technologies, able to gather large amounts of real-time data, requires the implementation of distributed platforms for data analysis and decision support. Finally, the inclusion of geographical and environmental information into such complex IT systems may further increase the capability of interpreting the data gathered and extract new knowledge from them. This article reviews the main concepts and definitions related to big data, it presents some efforts in health care, and discusses the potential role of big data in diabetes care. Finally, as an example, it describes the research efforts carried on in the MOSAIC project, funded by the European Commission.


Journal of Biomedical Informatics | 2017

Temporal electronic phenotyping by mining careflows of breast cancer patients

Arianna Dagliati; Lucia Sacchi; Alberto Zambelli; Valentina Tibollo; L. Pavesi; John H. Holmes; Riccardo Bellazzi

In this work we present a careflow mining approach designed to analyze heterogeneous longitudinal data and to identify phenotypes in a patient cohort. The main idea underlying our approach is to combine methods derived from sequential pattern mining and temporal data mining to derive frequent healthcare histories (careflows) in a population of patients. This approach was applied to an integrated data repository containing clinical and administrative data of more than 4000 breast cancer patients. We used the mined histories to identify sub-cohorts of patients grouped according to healthcare activities pathways, then we characterized these sub-cohorts with clinical data. In this way, we were able to perform temporal electronic phenotyping of electronic health records (EHR) data.


medical informatics europe | 2012

CARDIO-i2b2: integrating arrhythmogenic disease data in i2b2.

Daniele Segagni; Tibollo; Arianna Dagliati; Carlo Napolitano; Riccardo Bellazzi

The CARDIO-i2b2 project is an initiative to customize the i2b2 bioinformatics tool with the aim to integrate clinical and research data in order to support translational research in cardiology. In this work we describe the implementation and the customization of i2b2 to manage the data of arrhytmogenic disease patients collected at the Fondazione Salvatore Maugeri of Pavia in a joint project with the NYU Langone Medical Center (New York, USA). The i2b2 clinical research chart data warehouse is populated with the data obtained by the research database called TRIAD. The research infrastructure is extended by the development of new plug-ins for the i2b2 web client application able to properly select and export phenotypic data and to perform data analysis.


Journal of the American Medical Informatics Association | 2018

A dashboard-based system for supporting diabetes care

Arianna Dagliati; Lucia Sacchi; Valentina Tibollo; Giulia Cogni; Marsida Teliti; Antonio Martinez-Millana; Vicente Traver; Daniele Segagni; Jorge Posada; Manuel Ottaviano; Giuseppe Fico; María Teresa Arredondo; Pasquale De Cata; Luca Chiovato; Riccardo Bellazzi

Objective To describe the development, as part of the European Union MOSAIC (Models and Simulation Techniques for Discovering Diabetes Influence Factors) project, of a dashboard-based system for the management of type 2 diabetes and assess its impact on clinical practice. Methods The MOSAIC dashboard system is based on predictive modeling, longitudinal data analytics, and the reuse and integration of data from hospitals and public health repositories. Data are merged into an i2b2 data warehouse, which feeds a set of advanced temporal analytic models, including temporal abstractions, care-flow mining, drug exposure pattern detection, and risk-prediction models for type 2 diabetes complications. The dashboard has 2 components, designed for (1) clinical decision support during follow-up consultations and (2) outcome assessment on populations of interest. To assess the impact of the clinical decision support component, a pre-post study was conducted considering visit duration, number of screening examinations, and lifestyle interventions. A pilot sample of 700 Italian patients was investigated. Judgments on the outcome assessment component were obtained via focus groups with clinicians and health care managers. Results The use of the decision support component in clinical activities produced a reduction in visit duration (P ≪ .01) and an increase in the number of screening exams for complications (P < .01). We also observed a relevant, although nonstatistically significant, increase in the proportion of patients receiving lifestyle interventions (from 69% to 77%). Regarding the outcome assessment component, focus groups highlighted the systems capability of identifying and understanding the characteristics of patient subgroups treated at the center. Conclusion Our study demonstrates that decision support tools based on the integration of multiple-source data and visual and predictive analytics do improve the management of a chronic disease such as type 2 diabetes by enacting a successful implementation of the learning health care system cycle.


Journal of diabetes science and technology | 2016

Integration of Administrative, Clinical, and Environmental Data to Support the Management of Type 2 Diabetes Mellitus: From Satellites to Clinical Care.

Arianna Dagliati; Andrea Marinoni; Carlo Cerra; Pasquale Decata; Luca Chiovato; Paolo Gamba; Riccardo Bellazzi

A very interesting perspective of “big data” in diabetes management stands in the integration of environmental information with data gathered for clinical and administrative purposes, to increase the capability of understanding spatial and temporal patterns of diseases. Within the MOSAIC project, funded by the European Union with the goal to design new diabetes analytics, we have jointly analyzed a clinical-administrative dataset of nearly 1.000 type 2 diabetes patients with environmental information derived from air quality maps acquired from remote sensing (satellite) data. Within this context we have adopted a general analysis framework able to deal with a large variety of temporal, geo-localized data. Thanks to the exploitation of time series analysis and satellite images processing, we studied whether glycemic control showed seasonal variations and if they have a spatiotemporal correlation with air pollution maps. We observed a link between the seasonal trends of glycated hemoglobin and air pollution in some of the considered geographic areas. Such findings will need future investigations for further confirmation. This work shows that it is possible to successfully deal with big data by implementing new analytics and how their exploration may provide new scenarios to better understand clinical phenomena.


international geoscience and remote sensing symposium | 2015

Inferring air quality maps from remotely sensed data to exploit georeferenced clinical onsets: The Pavia 2013 case

Andrea Marinoni; Arianna Dagliati; Riccardo Bellazzi; Paolo Gamba

Recent developments in data acquisition, storage, mining and maintenance have allowed the flourishing of several multi-disciplinary research fields, which can be stated, defined and carried out according to the so-called Big Data paradigm. In this environment, the investigation and analysis of interactions between human phenomena and natural events play a key-role, as they can be fundamental for several applications, from sustainable development to community policy design and short-, medium- and long-range resource allocation planning. In this paper, we provide a study of the interplay between air pollution (as estimated by remotely sensed data processing) and clinical records, so that inferences and correlations among black particulate concentration, micro- and macro-vascular disease onsets and hospitalization tracks can be efficiently drawn. We focused on the second order administrative area of the city of Pavia, Italy, on 2013. Experimental results show how effective connections between the estimated air quality and the hospitalizations behavior can be accurately drawn and derived.


Journal of diabetes science and technology | 2018

Machine Learning Methods to Predict Diabetes Complications

Arianna Dagliati; Simone Marini; Lucia Sacchi; Giulia Cogni; Marsida Teliti; Valentina Tibollo; Pasquale De Cata; Luca Chiovato; Riccardo Bellazzi

One of the areas where Artificial Intelligence is having more impact is machine learning, which develops algorithms able to learn patterns and decision rules from data. Machine learning algorithms have been embedded into data mining pipelines, which can combine them with classical statistical strategies, to extract knowledge from data. Within the EU-funded MOSAIC project, a data mining pipeline has been used to derive a set of predictive models of type 2 diabetes mellitus (T2DM) complications based on electronic health record data of nearly one thousand patients. Such pipeline comprises clinical center profiling, predictive model targeting, predictive model construction and model validation. After having dealt with missing data by means of random forest (RF) and having applied suitable strategies to handle class imbalance, we have used Logistic Regression with stepwise feature selection to predict the onset of retinopathy, neuropathy, or nephropathy, at different time scenarios, at 3, 5, and 7 years from the first visit at the Hospital Center for Diabetes (not from the diagnosis). Considered variables are gender, age, time from diagnosis, body mass index (BMI), glycated hemoglobin (HbA1c), hypertension, and smoking habit. Final models, tailored in accordance with the complications, provided an accuracy up to 0.838. Different variables were selected for each complication and time scenario, leading to specialized models easy to translate to the clinical practice.

Collaboration


Dive into the Arianna Dagliati's collaboration.

Top Co-Authors

Avatar
Top Co-Authors

Avatar
Top Co-Authors

Avatar
Top Co-Authors

Avatar
Top Co-Authors

Avatar
Top Co-Authors

Avatar
Top Co-Authors

Avatar
Top Co-Authors

Avatar
Top Co-Authors

Avatar
Top Co-Authors

Avatar
Researchain Logo
Decentralizing Knowledge