Network


Latest external collaboration on country level. Dive into details by clicking on the dots.

Hotspot


Dive into the research topics where Valentina Tibollo is active.

Publication


Featured researches published by Valentina Tibollo.


BMC Bioinformatics | 2012

An ICT infrastructure to integrate clinical and molecular data in oncology research

Daniele Segagni; Valentina Tibollo; Arianna Dagliati; Alberto Zambelli; Silvia G. Priori; Riccardo Bellazzi

BackgroundThe ONCO-i2b2 platform is a bioinformatics tool designed to integrate clinical and research data and support translational research in oncology. It is implemented by the University of Pavia and the IRCCS Fondazione Maugeri hospital (FSM), and grounded on the software developed by the Informatics for Integrating Biology and the Bedside (i2b2) research center. I2b2 has delivered an open source suite based on a data warehouse, which is efficiently interrogated to find sets of interesting patients through a query tool interface.MethodsOnco-i2b2 integrates data coming from multiple sources and allows the users to jointly query them. I2b2 data are then stored in a data warehouse, where facts are hierarchically structured as ontologies. Onco-i2b2 gathers data from the FSM pathology unit (PU) database and from the hospital biobank and merges them with the clinical information from the hospital information system.Our main effort was to provide a robust integrated research environment, giving a particular emphasis to the integration process and facing different challenges, consecutively listed: biospecimen samples privacy and anonymization; synchronization of the biobank database with the i2b2 data warehouse through a series of Extract, Transform, Load (ETL) operations; development and integration of a Natural Language Processing (NLP) module, to retrieve coded information, such as SNOMED terms and malignant tumors (TNM) classifications, and clinical tests results from unstructured medical records. Furthermore, we have developed an internal SNOMED ontology rested on the NCBO BioPortal web services.ResultsOnco-i2b2 manages data of more than 6,500 patients with breast cancer diagnosis collected between 2001 and 2011 (over 390 of them have at least one biological sample in the cancer biobank), more than 47,000 visits and 96,000 observations over 960 medical concepts.ConclusionsOnco-i2b2 is a concrete example of how integrated Information and Communication Technology architecture can be implemented to support translational research. The next steps of our project will involve the extension of its capabilities by implementing new plug-in devoted to bioinformatics data analysis as well as a temporal query module.


Journal of the American Medical Informatics Association | 2011

R engine cell: integrating R into the i2b2 software infrastructure.

Daniele Segagni; Fulvia Ferrazzi; Cristiana Larizza; Valentina Tibollo; Carlo Napolitano; Silvia G. Priori; Riccardo Bellazzi

Informatics for Integrating Biology and the Bedside (i2b2) is an initiative funded by the NIH that aims at building an informatics infrastructure to support biomedical research. The University of Pavia has recently integrated i2b2 infrastructure with a registry of inherited arrhythmogenic diseases. Within this project, the authors created a novel i2b2 cell, named R Engine Cell, which allows the communication between i2b2 and the R statistical software. As survival analyses are routinely performed by cardiology researchers, the authors have first concentrated on making Kaplan-Meier analyses available within the i2b2 web interface. To this aim, the authors developed a web-client plug-in to select the patient set on which to perform the analysis and to display the results in a graphical, intuitive way. R Engine Cell has been designed to easily support the integration of other R-based statistical analyses into i2b2.


medical informatics europe | 2011

The ONCO-I2b2 project: integrating biobank information and clinical data to support translational research in oncology.

Daniele Segagni; Valentina Tibollo; Arianna Dagliati; Leonardo Perinati; Alberto Zambelli; Silvia G. Priori; Riccardo Bellazzi

The University of Pavia and the IRCCS Fondazione Salvatore Maugeri of Pavia (FSM), has recently started an IT initiative to support clinical research in oncology, called ONCO-i2b2. ONCO-i2b2, funded by the Lombardia region, grounds on the software developed by the Informatics for Integrating Biology and the Bedside (i2b2) NIH project. Using i2b2 and new software modules purposely designed, data coming from multiple sources are integrated and jointly queried. The core of the integration process stands in retrieving and merging data from the biobank management software and from the FSM hospital information system. The integration process is based on a ontology of the problem domain and on open-source software integration modules. A Natural Language Processing module has been implemented, too. This module automatically extracts clinical information of oncology patients from unstructured medical records. The system currently manages more than two thousands patients and will be further implemented and improved in the next two years.


Journal of Biomedical Informatics | 2017

Temporal electronic phenotyping by mining careflows of breast cancer patients

Arianna Dagliati; Lucia Sacchi; Alberto Zambelli; Valentina Tibollo; L. Pavesi; John H. Holmes; Riccardo Bellazzi

In this work we present a careflow mining approach designed to analyze heterogeneous longitudinal data and to identify phenotypes in a patient cohort. The main idea underlying our approach is to combine methods derived from sequential pattern mining and temporal data mining to derive frequent healthcare histories (careflows) in a population of patients. This approach was applied to an integrated data repository containing clinical and administrative data of more than 4000 breast cancer patients. We used the mined histories to identify sub-cohorts of patients grouped according to healthcare activities pathways, then we characterized these sub-cohorts with clinical data. In this way, we were able to perform temporal electronic phenotyping of electronic health records (EHR) data.


Journal of the American Medical Informatics Association | 2018

A dashboard-based system for supporting diabetes care

Arianna Dagliati; Lucia Sacchi; Valentina Tibollo; Giulia Cogni; Marsida Teliti; Antonio Martinez-Millana; Vicente Traver; Daniele Segagni; Jorge Posada; Manuel Ottaviano; Giuseppe Fico; María Teresa Arredondo; Pasquale De Cata; Luca Chiovato; Riccardo Bellazzi

Objective To describe the development, as part of the European Union MOSAIC (Models and Simulation Techniques for Discovering Diabetes Influence Factors) project, of a dashboard-based system for the management of type 2 diabetes and assess its impact on clinical practice. Methods The MOSAIC dashboard system is based on predictive modeling, longitudinal data analytics, and the reuse and integration of data from hospitals and public health repositories. Data are merged into an i2b2 data warehouse, which feeds a set of advanced temporal analytic models, including temporal abstractions, care-flow mining, drug exposure pattern detection, and risk-prediction models for type 2 diabetes complications. The dashboard has 2 components, designed for (1) clinical decision support during follow-up consultations and (2) outcome assessment on populations of interest. To assess the impact of the clinical decision support component, a pre-post study was conducted considering visit duration, number of screening examinations, and lifestyle interventions. A pilot sample of 700 Italian patients was investigated. Judgments on the outcome assessment component were obtained via focus groups with clinicians and health care managers. Results The use of the decision support component in clinical activities produced a reduction in visit duration (P ≪ .01) and an increase in the number of screening exams for complications (P < .01). We also observed a relevant, although nonstatistically significant, increase in the proportion of patients receiving lifestyle interventions (from 69% to 77%). Regarding the outcome assessment component, focus groups highlighted the systems capability of identifying and understanding the characteristics of patient subgroups treated at the center. Conclusion Our study demonstrates that decision support tools based on the integration of multiple-source data and visual and predictive analytics do improve the management of a chronic disease such as type 2 diabetes by enacting a successful implementation of the learning health care system cycle.


Journal of diabetes science and technology | 2018

Machine Learning Methods to Predict Diabetes Complications

Arianna Dagliati; Simone Marini; Lucia Sacchi; Giulia Cogni; Marsida Teliti; Valentina Tibollo; Pasquale De Cata; Luca Chiovato; Riccardo Bellazzi

One of the areas where Artificial Intelligence is having more impact is machine learning, which develops algorithms able to learn patterns and decision rules from data. Machine learning algorithms have been embedded into data mining pipelines, which can combine them with classical statistical strategies, to extract knowledge from data. Within the EU-funded MOSAIC project, a data mining pipeline has been used to derive a set of predictive models of type 2 diabetes mellitus (T2DM) complications based on electronic health record data of nearly one thousand patients. Such pipeline comprises clinical center profiling, predictive model targeting, predictive model construction and model validation. After having dealt with missing data by means of random forest (RF) and having applied suitable strategies to handle class imbalance, we have used Logistic Regression with stepwise feature selection to predict the onset of retinopathy, neuropathy, or nephropathy, at different time scenarios, at 3, 5, and 7 years from the first visit at the Hospital Center for Diabetes (not from the diagnosis). Considered variables are gender, age, time from diagnosis, body mass index (BMI), glycated hemoglobin (HbA1c), hypertension, and smoking habit. Final models, tailored in accordance with the complications, provided an accuracy up to 0.838. Different variables were selected for each complication and time scenario, leading to specialized models easy to translate to the clinical practice.


data integration in the life sciences | 2012

ONCO-i2b2: improve patients selection through case-based information retrieval techniques

Daniele Segagni; Matteo Gabetta; Valentina Tibollo; Alberto Zambelli; Silvia G. Priori; Riccardo Bellazzi

The University of Pavia (Italy) and the IRCCS Fondazione Salvatore Maugeri hospital in Pavia have recently started an information technology initiative to support clinical research in oncology called ONCO-i2b2. This project aims at supporting translational research in oncology and exploits the software solutions implemented by the Informatics for Integrating Biology and the Bed-side (i2b2) research center. The ONCO-i2b2 software is designed to integrate the i2b2 infrastructure with the hospital information system, with the pathology unit and with a cancer biobank that manages both plasma and cancer tissue samples. Exploiting the medical concepts related to each patient, we have developed a novel data mining procedure that allows researchers to easily identify patients similar to those found with the i2b2 query tool, so as to increase the number of patients, compared to the patient set directly retrieved by the query. This allows physicians to obtain additional information that can support new insights in the study of tumors.


Journal of diabetes science and technology | 2018

Careflow Mining Techniques to Explore Type 2 Diabetes Evolution

Arianna Dagliati; Valentina Tibollo; Giulia Cogni; Luca Chiovato; Riccardo Bellazzi; Lucia Sacchi

In this work we describe the application of a careflow mining algorithm to detect the most frequent patterns of care in a type 2 diabetes patients cohort. The applied method enriches the detected patterns with clinical data to define temporal phenotypes across the studied population. Novel phenotypes are discovered from heterogeneous data of 424 Italian patients, and compared in terms of metabolic control and complications. Results show that careflow mining can help to summarize the complex evolution of the disease into meaningful patterns, which are also significant from a clinical point of view.


International Journal of Medical Informatics | 2018

Information extraction from Italian medical reports: An ontology-driven approach

Natalia Viani; Cristiana Larizza; Valentina Tibollo; Carlo Napolitano; Silvia G. Priori; Riccardo Bellazzi; Lucia Sacchi

OBJECTIVE In this work, we propose an ontology-driven approach to identify events and their attributes from episodes of care included in medical reports written in Italian. For this language, shared resources for clinical information extraction are not easily accessible. MATERIALS AND METHODS The corpus considered in this work includes 5432 non-annotated medical reports belonging to patients with rare arrhythmias. To guide the information extraction process, we built a domain-specific ontology that includes the events and the attributes to be extracted, with related regular expressions. The ontology and the annotation system were constructed on a development set, while the performance was evaluated on an independent test set. As a gold standard, we considered a manually curated hospital database named TRIAD, which stores most of the information written in reports. RESULTS The proposed approach performs well on the considered Italian medical corpus, with a percentage of correct annotations above 90% for most considered clinical events. We also assessed the possibility to adapt the system to the analysis of another language (i.e., English), with promising results. DISCUSSION AND CONCLUSION Our annotation system relies on a domain ontology to extract and link information in clinical text. We developed an ontology that can be easily enriched and translated, and the system performs well on the considered task. In the future, it could be successfully used to automatically populate the TRIAD database.


Diabetes and Vascular Disease Research | 2018

Risk factors for the development of micro-vascular complications of type 2 diabetes in a single-centre cohort of patients:

Marsida Teliti; Giulia Cogni; Lucia Sacchi; Arianna Dagliati; Simone Marini; Valentina Tibollo; Pasquale De Cata; Riccardo Bellazzi; Luca Chiovato

Aims: In type 2 diabetes, we aimed at clarifying the role of glycated haemoglobin variability and other risk factors in the development of the main micro-vascular complications: peripheral neuropathy, nephropathy and retinopathy. Methods: In a single-centre cohort of 900 patients, glycated haemoglobin variability was evaluated as intra-individual standard deviation, adjusted standard deviation and coefficient of variation of serially measured glycated haemoglobin in the 2-year period before a randomly selected index visit. We devised four models considering different aspects of glycated haemoglobin evolution. Multivariate stepwise logistic regression analysis was performed including the following covariates at the index visit: age, disease duration, body mass index, total cholesterol, high-density lipoprotein cholesterol, triglycerides, sex, smoking habit, hypertension, dyslipidemia, treatment with anti-diabetic drugs, occurrence of macro-vascular events and the presence of another micro-vascular complication. Results: Males with high mean glycated haemoglobin, long duration of diabetes, presence of macro-vascular events and retinopathy emerged at higher risk for peripheral neuropathy. Development of nephropathy was independently associated with higher glycated haemoglobin variability, older age, male sex, current smoking status, presence of retinopathy, of peripheral neuropathy and of hypertension. Higher mean glycated haemoglobin, younger age, longer duration of diabetes, reduced estimated glomerular filtration rate and the presence of peripheral neuropathy were significantly associated with increased incidence of retinopathy. Conclusion: Glycated haemoglobin variability was associated with increased incidence of nephropathy, while mean glycated haemoglobin emerged as independent risk factor for the development of retinopathy and peripheral neuropathy. The presence of macro-vascular events was positively correlated with peripheral neuropathy. Finally, the occurrence of another micro-vascular complication was found to be a stronger risk factor for developing another micro-vascular complication than the mean or variability of glycated haemoglobin.

Collaboration


Dive into the Valentina Tibollo's collaboration.

Top Co-Authors

Avatar
Top Co-Authors

Avatar
Top Co-Authors

Avatar
Top Co-Authors

Avatar
Top Co-Authors

Avatar
Top Co-Authors

Avatar
Top Co-Authors

Avatar
Top Co-Authors

Avatar
Top Co-Authors

Avatar
Top Co-Authors

Avatar
Researchain Logo
Decentralizing Knowledge