Paola Sebastiani | Researchain

Archive Network Publication Hotspot Collaboration

Network

Latest external collaboration on country level. Dive into details by clicking on the dots.

Explore More

Hotspot

Dive into the research topics where Paola Sebastiani is active.

Explore More

Publication

Featured researches published by Paola Sebastiani.

Nature Medicine | 2007

Airway epithelial gene expression in the diagnostic evaluation of smokers with suspect lung cancer

Avrum Spira; Jennifer Beane; Vishal Shah; Katrina Steiling; Gang Liu; Frank Schembri; Sean Gilman; Yves-Martine Dumas; Paul Calner; Paola Sebastiani; Sriram Sridhar; John F. Beamis; Carla Lamb; Timothy Anderson; Norman P. Gerry; Joseph Keane; Marc E. Lenburg; Jerome S. Brody

Lung cancer is the leading cause of death from cancer in the US and the world. The high mortality rate (80–85% within 5 years) results, in part, from a lack of effective tools to diagnose the disease at an early stage. Given that cigarette smoke creates a field of injury throughout the airway, we sought to determine if gene expression in histologically normal large-airway epithelial cells obtained at bronchoscopy from smokers with suspicion of lung cancer could be used as a lung cancer biomarker. Using a training set (n = 77) and gene-expression profiles from Affymetrix HG-U133A microarrays, we identified an 80-gene biomarker that distinguishes smokers with and without lung cancer. We tested the biomarker on an independent test set (n = 52), with an accuracy of 83% (80% sensitive, 84% specific), and on an additional validation set independently obtained from five medical centers (n = 35). Our biomarker had ∼90% sensitivity for stage 1 cancer across all subjects. Combining cytopathology of lower airway cells obtained at bronchoscopy with the biomarker yielded 95% sensitivity and a 95% negative predictive value. These findings indicate that gene expression in cytologically normal large-airway epithelial cells can serve as a lung cancer biomarker, potentially owing to a cancer-specific airway-wide response to cigarette smoke.

Proceedings of the National Academy of Sciences of the United States of America | 2002

Cluster analysis of gene expression dynamics

Marco F. Ramoni; Paola Sebastiani; Isaac S. Kohane

This article presents a Bayesian method for model-based clustering of gene expression dynamics. The method represents gene-expression dynamics as autoregressive equations and uses an agglomerative procedure to search for the most probable set of clusters given the available data. The main contributions of this approach are the ability to take into account the dynamic nature of gene expression time series during clustering and a principled way to identify the number of distinct clusters. As the number of possible clustering models grows exponentially with the number of observed time series, we have devised a distance-based heuristic search procedure able to render the search process feasible. In this way, the method retains the important visualization capability of traditional distance-based clustering and acquires an independent, principled measure to decide when two series are different enough to belong to different clusters. The reliance of this method on an explicit statistical representation of gene expression dynamics makes it possible to use standard statistical techniques to assess the goodness of fit of the resulting model and validate the underlying assumptions. A set of gene-expression time series, collected to study the response of human fibroblasts to serum, is used to identify the properties of the method.

Nature Genetics | 2005

Genetic dissection and prognostic modeling of overt stroke in sickle cell anemia

Paola Sebastiani; Marco F. Ramoni; Vikki G. Nolan; Clinton T. Baldwin; Martin H. Steinberg

Sickle cell anemia (SCA) is a paradigmatic single gene disorder caused by homozygosity with respect to a unique mutation at the β-globin locus. SCA is phenotypically complex, with different clinical courses ranging from early childhood mortality to a virtually unrecognized condition. Overt stroke is a severe complication affecting 6–8% of individuals with SCA. Modifier genes might interact to determine the susceptibility to stroke, but such genes have not yet been identified. Using Bayesian networks, we analyzed 108 SNPs in 39 candidate genes in 1,398 individuals with SCA. We found that 31 SNPs in 12 genes interact with fetal hemoglobin to modulate the risk of stroke. This network of interactions includes three genes in the TGF-β pathway and SELP, which is associated with stroke in the general population. We validated this model in a different population by predicting the occurrence of stroke in 114 individuals with 98.2% accuracy.

Journal of the American Medical Informatics Association | 2003

Implementing Syndromic Surveillance: A Practical Guide Informed by the Early Experience

Kenneth D. Mandl; J. Marc Overhage; Michael M. Wagner; William B. Lober; Paola Sebastiani; Farzad Mostashari; Julie A. Pavlin; Per H. Gesteland; Tracee A. Treadwell; Eileen Koski; Lori Hutwagner; David L. Buckeridge; Raymond D. Aller; Shaun J. Grannis

Syndromic surveillance refers to methods relying on detection of individual and population health indicators that are discernible before confirmed diagnoses are made. In particular, prior to the laboratory confirmation of an infectious disease, ill persons may exhibit behavioral patterns, symptoms, signs, or laboratory findings that can be tracked through a variety of data sources. Syndromic surveillance systems are being developed locally, regionally, and nationally. The efforts have been largely directed at facilitating the early detection of a covert bioterrorist attack, but the technology may also be useful for general public health, clinical medicine, quality improvement, patient safety, and research. This report, authored by developers and methodologists involved in the design and deployment of the first wave of syndromic surveillance systems, is intended to serve as a guide for informaticians, public health managers, and practitioners who are currently planning deployment of such systems in their regions.

PLOS ONE | 2012

Genetic Signatures of Exceptional Longevity in Humans

Paola Sebastiani; Nadia Solovieff; Andrew T. DeWan; Kyle M. Walsh; Annibale Alessandro Puca; Stephen W. Hartley; Efthymia Melista; Stacy L. Andersen; Daniel A. Dworkis; Jemma B. Wilk; Richard H. Myers; Martin H. Steinberg; Monty Montano; Clinton T. Baldwin; Josephine Hoh; Thomas T. Perls

Like most complex phenotypes, exceptional longevity is thought to reflect a combined influence of environmental (e.g., lifestyle choices, where we live) and genetic factors. To explore the genetic contribution, we undertook a genome-wide association study of exceptional longevity in 801 centenarians (median age at death 104 years) and 914 genetically matched healthy controls. Using these data, we built a genetic model that includes 281 single nucleotide polymorphisms (SNPs) and discriminated between cases and controls of the discovery set with 89% sensitivity and specificity, and with 58% specificity and 60% sensitivity in an independent cohort of 341 controls and 253 genetically matched nonagenarians and centenarians (median age 100 years). Consistent with the hypothesis that the genetic contribution is largest with the oldest ages, the sensitivity of the model increased in the independent cohort with older and older ages (71% to classify subjects with an age at death>102 and 85% to classify subjects with an age at death>105). For further validation, we applied the model to an additional, unmatched 60 centenarians (median age 107 years) resulting in 78% sensitivity, and 2863 unmatched controls with 61% specificity. The 281 SNPs include the SNP rs2075650 in TOMM40/APOE that reached irrefutable genome wide significance (posterior probability of association = 1) and replicated in the independent cohort. Removal of this SNP from the model reduced the accuracy by only 1%. Further in-silico analysis suggests that 90% of centenarians can be grouped into clusters characterized by different “genetic signatures” of varying predictive values for exceptional longevity. The correlation between 3 signatures and 3 different life spans was replicated in the combined replication sets. The different signatures may help dissect this complex phenotype into sub-phenotypes of exceptional longevity.

Blood | 2011

Fetal hemoglobin in sickle cell anemia

Idowu Akinsheye; Abdulrahman Alsultan; Nadia Solovieff; Duyen Ngo; Clinton T. Baldwin; Paola Sebastiani; David H.K. Chui; Martin H. Steinberg

Fetal hemoglobin (HbF) is the major genetic modulator of the hematologic and clinical features of sickle cell disease, an effect mediated by its exclusion from the sickle hemoglobin polymer. Fetal hemoglobin genes are genetically regulated, and the level of HbF and its distribution among sickle erythrocytes is highly variable. Some patients with sickle cell disease have exceptionally high levels of HbF that are associated with the Senegal and Saudi-Indian haplotype of the HBB-like gene cluster; some patients with different haplotypes can have similarly high HbF. In these patients, high HbF is associated with generally milder but not asymptomatic disease. Studying these persons might provide additional insights into HbF gene regulation. HbF appears to benefit some complications of disease more than others. This might be related to the premature destruction of erythrocytes that do not contain HbF, even though the total HbF concentration is high. Recent insights into HbF regulation have spurred new efforts to induce high HbF levels in sickle cell disease beyond those achievable with the current limited repertory of HbF inducers.

Proceedings of the National Academy of Sciences of the United States of America | 2003

Minimal haplotype tagging

Paola Sebastiani; Ross Lazarus; Scott T. Weiss; Louis M. Kunkel; Isaac S. Kohane; Marco F. Ramoni

The high frequency of single-nucleotide polymorphisms (SNPs) in the human genome presents an unparalleled opportunity to track down the genetic basis of common diseases. At the same time, the sheer number of SNPs also makes unfeasible genomewide disease association studies. The haplotypic nature of the human genome, however, lends itself to the selection of a parsimonious set of SNPs, called haplotype tagging SNPs (htSNPs), able to distinguish the haplotypic variations in a population. Current approaches rely on statistical analysis of transmission rates to identify htSNPs. In contrast to these approximate methods, this contribution describes an exact, analytical, and lossless method, called BEST (Best Enumeration of SNP Tags), able to identify the minimum set of SNPs tagging an arbitrary set of haplotypes from either pedigree or independent samples. Our results confirm that a small proportion of SNPs is sufficient to capture the haplotypic variations in a population and that this proportion decreases exponentially as the haplotype length increases. We used BEST to tag the haplotypes of 105 genes in an African-American and a European-American sample. An interesting finding of this analysis is that the vast majority (95%) of the htSNPs in the European-American sample is a subset of the htSNPs of the African-American sample. This result seems to provide further evidence that a severe bottleneck occurred during the founding of Europe and the conjectured “Out of Africa” event.

Genome Biology | 2007

Reversible and permanent effects of tobacco smoke exposure on airway epithelial gene expression

Jennifer Beane; Paola Sebastiani; Gang Liu; Jerome S. Brody; Marc E. Lenburg; Avrum Spira

BackgroundTobacco use remains the leading preventable cause of death in the US. The risk of dying from smoking-related diseases remains elevated for former smokers years after quitting. The identification of irreversible effects of tobacco smoke on airway gene expression may provide insights into the causes of this elevated risk.ResultsUsing oligonucleotide microarrays, we measured gene expression in large airway epithelial cells obtained via bronchoscopy from never, current, and former smokers (n = 104). Linear models identified 175 genes differentially expressed between current and never smokers, and classified these as irreversible (n = 28), slowly reversible (n = 6), or rapidly reversible (n = 139) based on their expression in former smokers. A greater percentage of irreversible and slowly reversible genes were down-regulated by smoking, suggesting possible mechanisms for persistent changes, such as allelic loss at 16q13. Similarities with airway epithelium gene expression changes caused by other environmental exposures suggest that common mechanisms are involved in the response to tobacco smoke. Finally, using irreversible genes, we built a biomarker of ever exposure to tobacco smoke capable of classifying an independent set of former and current smokers with 81% and 100% accuracy, respectively.ConclusionWe have categorized smoking-related changes in airway gene expression by their degree of reversibility upon smoking cessation. Our findings provide insights into the mechanisms leading to reversible and persistent effects of tobacco smoke that may explain former smokers increased risk for developing tobacco-induced lung disease and provide novel targets for chemoprophylaxis. Airway gene expression may also serve as a sensitive biomarker to identify individuals with past exposure to tobacco smoke.

Journal of Leukocyte Biology | 2012

BET bromodomain inhibition as a novel strategy for reactivation of HIV-1

Camellia Banerjee; Nancie M. Archin; Daniel Michaels; Anna C. Belkina; Gerald V. Denis; James E. Bradner; Paola Sebastiani; David M. Margolis; Monty Montano

The persistence of latent HIV‐1 remains a major challenge in therapeutic efforts to eradicate infection. We report the capacity for HIV reactivation by a selective small molecule inhibitor of BET family bromodomains, JQ1, a promising therapeutic agent with antioncogenic properties. JQ1 reactivated HIV transcription in models of latent T cell infection and latent monocyte infection. We also tested the effect of exposure to JQ1 to allow recovery of replication‐competent HIV from pools of resting CD4+ T cells isolated from HIV‐infected, ART‐treated patients. In one of three patients, JQ1 allowed recovery of virus at a frequency above unstimulated conditions. JQ1 potently suppressed T cell proliferation with minimal cytotoxic effect. Transcriptional profiling of T cells with JQ1 showed potent down‐regulation of T cell activation genes, including CD3, CD28, and CXCR4, similar to HDAC inhibitors, but JQ1 also showed potent up‐regulation of chromatin modification genes, including SIRT1, HDAC6, and multiple lysine demethylases (KDMs). Thus, JQ1 reactivates HIV‐1 while suppressing T cell activation genes and up‐regulating histone modification genes predicted to favor increased Tat activity. Thus, JQ1 may be useful in studies of potentially novel mechanisms for transcriptional control as well as in translational efforts to identify therapeutic molecules to achieve viral eradication.

Machine Learning | 2002

Bayesian Clustering by Dynamics

Marco F. Ramoni; Paola Sebastiani; Paul R. Cohen

This paper introduces a Bayesian method for clustering dynamic processes. The method models dynamics as Markov chains and then applies an agglomerative clustering procedure to discover the most probable set of clusters capturing different dynamics. To increase efficiency, the method uses an entropy-based heuristic search strategy. A controlled experiment suggests that the method is very accurate when applied to artificial time series in a broad range of conditions and, when applied to clustering sensor data from mobile robots, it produces clusters that are meaningful in the domain of application.

Explore More