Network


Latest external collaboration on country level. Dive into details by clicking on the dots.

Hotspot


Dive into the research topics where Luke V. Rasmussen is active.

Publication


Featured researches published by Luke V. Rasmussen.


Nature Biotechnology | 2013

Systematic comparison of phenome-wide association study of electronic medical record data and genome-wide association study data

Joshua C. Denny; Marylyn D. Ritchie; Robert J. Carroll; Raquel Zink; Jonathan D. Mosley; Julie R. Field; Jill M. Pulley; Andrea H. Ramirez; Erica Bowton; Melissa A. Basford; David Carrell; Peggy L. Peissig; Abel N. Kho; Jennifer A. Pacheco; Luke V. Rasmussen; David R. Crosslin; Paul K. Crane; Jyotishman Pathak; Suzette J. Bielinski; Sarah A. Pendergrass; Hua Xu; Lucia A. Hindorff; Rongling Li; Teri A. Manolio; Christopher G. Chute; Rex L. Chisholm; Eric B. Larson; Gail P. Jarvik; Murray H. Brilliant; Catherine A. McCarty

Candidate gene and genome-wide association studies (GWAS) have identified genetic variants that modulate risk for human disease; many of these associations require further study to replicate the results. Here we report the first large-scale application of the phenome-wide association study (PheWAS) paradigm within electronic medical records (EMRs), an unbiased approach to replication and discovery that interrogates relationships between targeted genotypes and multiple phenotypes. We scanned for associations between 3,144 single-nucleotide polymorphisms (previously implicated by GWAS as mediators of human traits) and 1,358 EMR-derived phenotypes in 13,835 individuals of European ancestry. This PheWAS replicated 66% (51/77) of sufficiently powered prior GWAS associations and revealed 63 potentially pleiotropic associations with P < 4.6 × 10−6 (false discovery rate < 0.1); the strongest of these novel associations were replicated in an independent cohort (n = 7,406). These findings validate PheWAS as a tool to allow unbiased interrogation across multiple phenotypes in EMR-based cohorts and to enhance analysis of the genomic basis of human disease.


Science Translational Medicine | 2011

Electronic Medical Records for Genetic Research: Results of the eMERGE Consortium

Abel N. Kho; Jennifer A. Pacheco; Peggy L. Peissig; Luke V. Rasmussen; Katherine M. Newton; Noah Weston; Paul K. Crane; Jyotishman Pathak; Christopher G. Chute; Suzette J. Bielinski; Iftikhar J. Kullo; Rongling Li; Teri A. Manolio; Rex L. Chisholm; Joshua C. Denny

Clinical data captured in electronic medical records accurately identify cases and controls for genome-wide association studies. Where Electronic Records and Genomics Meet There has been a surge of interest in using electronic medical records in hospitals and clinics to capture information about patients that is normally buried in doctors’ handwritten notes. Indeed, the U.S. government has made the implementation of electronic medical records a priority area and has instigated standards for the recording and use of these records. The clinical data captured in electronic medical records including diagnoses, medical tests, and medications provide accurate clinical information that will improve patient care. With the ability to sequence the genomes of individuals faster and cheaper than ever before, it may be possible in the future to include the genome sequences of patients in their electronic medical records. A consortium called the Electronic Medical Records and Genomics Network (eMERGE) has set out to investigate whether clinical data captured in electronic medical records could be used to accurately identify patients with particular diseases for inclusion in genome-wide association studies (GWAS). GWAS scrutinize the genomes of individuals with particular diseases to identify tiny genetic variations that are associated with the risk of developing that disease. Here, the eMERGE consortium reports its study of the electronic medical records from five clinical centers and how accurately it identified patients with one of five diseases: dementia, cataracts, peripheral arterial disease, type 2 diabetes, and cardiac conduction defects. The investigators show that even though the electronic medical records were of different types and did not all use natural language processing to extract information from the records, they were able to obtain robust positive and negative values for identifying patients with these diseases with sufficient accuracy for use in GWAS. They conclude that widespread adoption of electronic medical records will provide real-world clinical data that will be valuable for GWAS and other types of genetic research. Clinical data in electronic medical records (EMRs) are a potential source of longitudinal clinical data for research. The Electronic Medical Records and Genomics Network (eMERGE) investigates whether data captured through routine clinical care using EMRs can identify disease phenotypes with sufficient positive and negative predictive values for use in genome-wide association studies (GWAS). Using data from five different sets of EMRs, we have identified five disease phenotypes with positive predictive values of 73 to 98% and negative predictive values of 98 to 100%. Most EMRs captured key information (diagnoses, medications, laboratory tests) used to define phenotypes in a structured format. We identified natural language processing as an important tool to improve case identification rates. Efforts and incentives to increase the implementation of interoperable EMRs will markedly improve the availability of clinical data for genomics research.


Journal of the American Medical Informatics Association | 2013

Validation of electronic medical record-based phenotyping algorithms: results and lessons learned from the eMERGE network

Katherine M. Newton; Peggy L. Peissig; Abel N. Kho; Suzette J. Bielinski; Richard L. Berg; Vidhu Choudhary; Melissa A. Basford; Christopher G. Chute; Iftikhar J. Kullo; Rongling Li; Jennifer A. Pacheco; Luke V. Rasmussen; Leslie Spangler; Joshua C. Denny

BACKGROUND Genetic studies require precise phenotype definitions, but electronic medical record (EMR) phenotype data are recorded inconsistently and in a variety of formats. OBJECTIVE To present lessons learned about validation of EMR-based phenotypes from the Electronic Medical Records and Genomics (eMERGE) studies. MATERIALS AND METHODS The eMERGE network created and validated 13 EMR-derived phenotype algorithms. Network sites are Group Health, Marshfield Clinic, Mayo Clinic, Northwestern University, and Vanderbilt University. RESULTS By validating EMR-derived phenotypes we learned that: (1) multisite validation improves phenotype algorithm accuracy; (2) targets for validation should be carefully considered and defined; (3) specifying time frames for review of variables eases validation time and improves accuracy; (4) using repeated measures requires defining the relevant time period and specifying the most meaningful value to be studied; (5) patient movement in and out of the health plan (transience) can result in incomplete or fragmented data; (6) the review scope should be defined carefully; (7) particular care is required in combining EMR and research data; (8) medication data can be assessed using claims, medications dispensed, or medications prescribed; (9) algorithm development and validation work best as an iterative process; and (10) validation by content experts or structured chart review can provide accurate results. CONCLUSIONS Despite the diverse structure of the five EMRs of the eMERGE sites, we developed, validated, and successfully deployed 13 electronic phenotype algorithms. Validation is a worthwhile process that not only measures phenotype performance but also strengthens phenotype algorithm definitions and enhances their inter-institutional sharing.


Current protocols in human genetics | 2011

Quality Control Procedures for Genome‐Wide Association Studies

Stephen D. Turner; Loren L. Armstrong; Yuki Bradford; Christopher S. Carlson; Dana C. Crawford; Andrew Crenshaw; Mariza de Andrade; Kimberly F. Doheny; Jonathan L. Haines; Geoffrey Hayes; Gail P. Jarvik; Lan Jiang; Iftikhar J. Kullo; Rongling Li; Hua Ling; Teri A. Manolio; Martha E. Matsumoto; Catherine A. McCarty; Andrew McDavid; Daniel B. Mirel; Justin Paschall; Elizabeth W. Pugh; Luke V. Rasmussen; Russell A. Wilke; Rebecca L. Zuvich; Marylyn D. Ritchie

Genome‐wide association studies (GWAS) are being conducted at an unprecedented rate in population‐based cohorts and have increased our understanding of the pathophysiology of complex disease. Regardless of context, the practical utility of this information will ultimately depend upon the quality of the original data. Quality control (QC) procedures for GWAS are computationally intensive, operationally challenging, and constantly evolving. Here we enumerate some of the challenges in QC of GWAS data and describe the approaches that the electronic MEdical Records and Genomics (eMERGE) network is using for quality assurance in GWAS data, thereby minimizing potential bias and error in GWAS results. We discuss common issues associated with QC of GWAS data, including data file formats, software packages for data manipulation and analysis, sex chromosome anomalies, sample identity, sample relatedness, population substructure, batch effects, and marker quality. We propose best practices and discuss areas of ongoing and future research. Curr. Protoc. Hum. Genet. 68:1.19.1‐1.19.18


Clinical Pharmacology & Therapeutics | 2014

Design and anticipated outcomes of the eMERGE-PGx project: a multicenter pilot for preemptive pharmacogenomics in electronic health record systems.

Laura J. Rasmussen-Torvik; Sarah Stallings; Adam S. Gordon; Berta Almoguera; Melissa A. Basford; Suzette J. Bielinski; Ariel Brautbar; Murray H. Brilliant; David Carrell; John J. Connolly; David R. Crosslin; Kimberly F. Doheny; Carlos J. Gallego; Omri Gottesman; Daniel Seung Kim; Kathleen A. Leppig; Rongling Li; Simon Lin; Shannon Manzi; Ana R. Mejia; Jennifer A. Pacheco; Vivian Pan; Jyotishman Pathak; Cassandra Perry; Josh F. Peterson; Cynthia A. Prows; James D. Ralston; Luke V. Rasmussen; Marylyn D. Ritchie; Senthilkumar Sadhasivam

We describe here the design and initial implementation of the eMERGE‐PGx project. eMERGE‐PGx, a partnership of the Electronic Medical Records and Genomics Network and the Pharmacogenomics Research Network, has three objectives: (i) to deploy PGRNseq, a next‐generation sequencing platform assessing sequence variation in 84 proposed pharmacogenes, in nearly 9,000 patients likely to be prescribed drugs of interest in a 1‐ to 3‐year time frame across several clinical sites; (ii) to integrate well‐established clinically validated pharmacogenetic genotypes into the electronic health record with associated clinical decision support and to assess process and clinical outcomes of implementation; and (iii) to develop a repository of pharmacogenetic variants of unknown significance linked to a repository of electronic health record–based clinical phenotype data for ongoing pharmacogenomics discovery. We describe site‐specific project implementation and anticipated products, including genetic variant and phenotype data repositories, novel variant association studies, clinical decision support modules, clinical and process outcomes, approaches to managing incidental findings, and patient and clinician education methods.


Circulation | 2013

Genome- and Phenome-Wide Analyses of Cardiac Conduction Identifies Markers of Arrhythmia Risk

Marylyn D. Ritchie; Joshua C. Denny; Rebecca L. Zuvich; Dana C. Crawford; Jonathan S. Schildcrout; Andrea H. Ramirez; Jonathan D. Mosley; Jill M. Pulley; Melissa A. Basford; Yuki Bradford; Luke V. Rasmussen; Jyotishman Pathak; Christopher G. Chute; Iftikhar J. Kullo; Catherine A. McCarty; Rex L. Chisholm; Abel N. Kho; Christopher S. Carlson; Eric B. Larson; Gail P. Jarvik; Nona Sotoodehnia; Teri A. Manolio; Rongling Li; Daniel R. Masys; Jonathan L. Haines; Dan M. Roden

Background— ECG QRS duration, a measure of cardiac intraventricular conduction, varies ≈2-fold in individuals without cardiac disease. Slow conduction may promote re-entrant arrhythmias. Methods and Results— We performed a genome-wide association study to identify genomic markers of QRS duration in 5272 individuals without cardiac disease selected from electronic medical record algorithms at 5 sites in the Electronic Medical Records and Genomics (eMERGE) network. The most significant loci were evaluated within the Cohorts for Heart and Aging Research in Genomic Epidemiology (CHARGE) consortium QRS genome-wide association study meta-analysis. Twenty-three single-nucleotide polymorphisms in 5 loci, previously described by CHARGE, were replicated in the eMERGE samples; 18 single-nucleotide polymorphisms were in the chromosome 3 SCN5A and SCN10A loci, where the most significant single-nucleotide polymorphisms were rs1805126 in SCN5A with P=1.2×10−8 (eMERGE) and P=2.5×10−20 (CHARGE) and rs6795970 in SCN10A with P=6×10−6 (eMERGE) and P=5×10−27 (CHARGE). The other loci were in NFIA, near CDKN1A, and near C6orf204. We then performed phenome-wide association studies on variants in these 5 loci in 13859 European Americans to search for diagnoses associated with these markers. Phenome-wide association study identified atrial fibrillation and cardiac arrhythmias as the most common associated diagnoses with SCN10A and SCN5A variants. SCN10A variants were also associated with subsequent development of atrial fibrillation and arrhythmia in the original 5272 “heart-healthy” study population. Conclusions— We conclude that DNA biobanks coupled to electronic medical records not only provide a platform for genome-wide association study but also may allow broad interrogation of the longitudinal incidence of disease associated with genetic variants. The phenome-wide association study approach implicated sodium channel variants modulating QRS duration in subjects without cardiac disease as predictors of subsequent arrhythmias.


Journal of the American Medical Informatics Association | 2012

Importance of multi-modal approaches to effectively identify cataract cases from electronic health records

Peggy L. Peissig; Luke V. Rasmussen; Richard L. Berg; James G. Linneman; Catherine A. McCarty; Carol Waudby; Lin Chen; Joshua C. Denny; Russell A. Wilke; Jyotishman Pathak; David Carrell; Abel N. Kho; Justin Starren

OBJECTIVE There is increasing interest in using electronic health records (EHRs) to identify subjects for genomic association studies, due in part to the availability of large amounts of clinical data and the expected cost efficiencies of subject identification. We describe the construction and validation of an EHR-based algorithm to identify subjects with age-related cataracts. MATERIALS AND METHODS We used a multi-modal strategy consisting of structured database querying, natural language processing on free-text documents, and optical character recognition on scanned clinical images to identify cataract subjects and related cataract attributes. Extensive validation on 3657 subjects compared the multi-modal results to manual chart review. The algorithm was also implemented at participating electronic MEdical Records and GEnomics (eMERGE) institutions. RESULTS An EHR-based cataract phenotyping algorithm was successfully developed and validated, resulting in positive predictive values (PPVs) >95%. The multi-modal approach increased the identification of cataract subject attributes by a factor of three compared to single-mode approaches while maintaining high PPV. Components of the cataract algorithm were successfully deployed at three other institutions with similar accuracy. DISCUSSION A multi-modal strategy incorporating optical character recognition and natural language processing may increase the number of cases identified while maintaining similar PPVs. Such algorithms, however, require that the needed information be embedded within clinical documents. CONCLUSION We have demonstrated that algorithms to identify and characterize cataracts can be developed utilizing data collected via the EHR. These algorithms provide a high level of accuracy even when implemented across multiple EHRs and institutional boundaries.


Preventing Chronic Disease | 2012

Construction of a Multisite DataLink Using Electronic Health Records for the Identification, Surveillance, Prevention, and Management of Diabetes Mellitus: The SUPREME-DM Project

Gregory A. Nichols; Jay Desai; Jennifer Elston Lafata; Jean M. Lawrence; Patrick J. O'Connor; Ram D. Pathak; Marsha A. Raebel; Robert J. Reid; Joseph V. Selby; Barbara G. Silverman; John F. Steiner; W. F. Stewart; Suma Vupputuri; Beth Waitzfelder; Christina L. Clarke; William T. Donahoo; Glenn K. Goodrich; Andrea R. Paolino; Emily B. Schroeder; Michael Shainline; Stan Xu; Lora Bounds; Gabrielle Gundersen; Katherine M. Newton; Eileen Rillamas-Sun; Brandon Geise; Ronald Harris; Rebecca Stametz; Xiaowei Sherry Yan; Nonna Akkerman

Introduction Electronic health record (EHR) data enhance opportunities for conducting surveillance of diabetes. The objective of this study was to identify the number of people with diabetes from a diabetes DataLink developed as part of the SUPREME-DM (SUrveillance, PREvention, and ManagEment of Diabetes Mellitus) project, a consortium of 11 integrated health systems that use comprehensive EHR data for research. Methods We identified all members of 11 health care systems who had any enrollment from January 2005 through December 2009. For these members, we searched inpatient and outpatient diagnosis codes, laboratory test results, and pharmaceutical dispensings from January 2000 through December 2009 to create indicator variables that could potentially identify a person with diabetes. Using this information, we estimated the number of people with diabetes and among them, the number of incident cases, defined as indication of diabetes after at least 2 years of continuous health system enrollment. Results The 11 health systems contributed 15,765,529 unique members, of whom 1,085,947 (6.9%) met 1 or more study criteria for diabetes. The nonstandardized proportion meeting study criteria for diabetes ranged from 4.2% to 12.4% across sites. Most members with diabetes (88%) met multiple criteria. Of the members with diabetes, 428,349 (39.4%) were incident cases. Conclusion The SUPREME-DM DataLink is a unique resource that provides an opportunity to conduct comparative effectiveness research, epidemiologic surveillance including longitudinal analyses, and population-based care management studies of people with diabetes. It also provides a useful data source for pragmatic clinical trials of prevention or treatment interventions.


Genetic Epidemiology | 2011

Pitfalls of Merging GWAS Data: Lessons Learned in the eMERGE Network and Quality Control Procedures to Maintain High Data Quality

Rebecca L. Zuvich; Loren L. Armstrong; Suzette J. Bielinski; Yuki Bradford; Christopher S. Carlson; Dana C. Crawford; Andrew Crenshaw; Mariza de Andrade; Kimberly F. Doheny; Jonathan L. Haines; M. Geoffrey Hayes; Gail P. Jarvik; Lan Jiang; Iftikhar J. Kullo; Rongling Li; Hua Ling; Teri A. Manolio; Martha E. Matsumoto; Catherine A. McCarty; Andrew McDavid; Daniel B. Mirel; Lana M. Olson; Justin Paschall; Elizabeth W. Pugh; Luke V. Rasmussen; Laura J. Rasmussen-Torvik; Stephen D. Turner; Russell A. Wilke; Marylyn D. Ritchie

Genome‐wide association studies (GWAS) are a useful approach in the study of the genetic components of complex phenotypes. Aside from large cohorts, GWAS have generally been limited to the study of one or a few diseases or traits. The emergence of biobanks linked to electronic medical records (EMRs) allows the efficient reuse of genetic data to yield meaningful genotype–phenotype associations for multiple phenotypes or traits. Phase I of the electronic MEdical Records and GEnomics (eMERGE‐I) Network is a National Human Genome Research Institute‐supported consortium composed of five sites to perform various genetic association studies using DNA repositories and EMR systems. Each eMERGE site has developed EMR‐based algorithms to comprise a core set of 14 phenotypes for extraction of study samples from each sites DNA repository. Each eMERGE site selected samples for a specific phenotype, and these samples were genotyped at either the Broad Institute or at the Center for Inherited Disease Research using the Illumina Infinium BeadChip technology. In all, approximately 17,000 samples from across the five sites were genotyped. A unified quality control (QC) pipeline was developed by the eMERGE Genomics Working Group and used to ensure thorough cleaning of the data. This process includes examination of sample and marker quality and various batch effects. Upon completion of the genotyping and QC analyses for each sites primary study, eMERGE Coordinating Center merged the datasets from all five sites. This larger merged dataset reentered the established eMERGE QC pipeline. Based on lessons learned during the process, additional analyses and QC checkpoints were added to the pipeline to ensure proper merging. Here, we explore the challenges associated with combining datasets from different genotyping centers and describe the expansion to eMERGE QC pipeline for merged datasets. These additional steps will be useful as the eMERGE project expands to include additional sites in eMERGE‐II, and also serve as a starting point for investigators merging multiple genotype datasets accessible through the National Center for Biotechnology Information in the database of Genotypes and Phenotypes. Our experience demonstrates that merging multiple datasets after additional QC can be an efficient use of genotype data despite new challenges that appear in the process. Genet. Epidemiol. 35:887–898, 2011.


Journal of the American Medical Informatics Association | 2016

PheKB: a catalog and workflow for creating electronic phenotype algorithms for transportability

Jacqueline Kirby; Peter Speltz; Luke V. Rasmussen; Melissa A. Basford; Omri Gottesman; Peggy L. Peissig; Jennifer A. Pacheco; Gerard Tromp; Jyotishman Pathak; David Carrell; Stephen Ellis; Todd Lingren; William K. Thompson; Guergana Savova; Jonathan L. Haines; Dan M. Roden; Paul A. Harris; Joshua C. Denny

OBJECTIVE Health care generated data have become an important source for clinical and genomic research. Often, investigators create and iteratively refine phenotype algorithms to achieve high positive predictive values (PPVs) or sensitivity, thereby identifying valid cases and controls. These algorithms achieve the greatest utility when validated and shared by multiple health care systems.Materials and Methods We report the current status and impact of the Phenotype KnowledgeBase (PheKB, http://phekb.org), an online environment supporting the workflow of building, sharing, and validating electronic phenotype algorithms. We analyze the most frequent components used in algorithms and their performance at authoring institutions and secondary implementation sites. RESULTS As of June 2015, PheKB contained 30 finalized phenotype algorithms and 62 algorithms in development spanning a range of traits and diseases. Phenotypes have had over 3500 unique views in a 6-month period and have been reused by other institutions. International Classification of Disease codes were the most frequently used component, followed by medications and natural language processing. Among algorithms with published performance data, the median PPV was nearly identical when evaluated at the authoring institutions (n = 44; case 96.0%, control 100%) compared to implementation sites (n = 40; case 97.5%, control 100%). DISCUSSION These results demonstrate that a broad range of algorithms to mine electronic health record data from different health systems can be developed with high PPV, and algorithms developed at one site are generally transportable to others. CONCLUSION By providing a central repository, PheKB enables improved development, transportability, and validity of algorithms for research-grade phenotypes using health care generated data.

Collaboration


Dive into the Luke V. Rasmussen's collaboration.

Top Co-Authors

Avatar

Joshua C. Denny

Vanderbilt University Medical Center

View shared research outputs
Top Co-Authors

Avatar
Top Co-Authors

Avatar
Top Co-Authors

Avatar
Top Co-Authors

Avatar
Top Co-Authors

Avatar
Top Co-Authors

Avatar

Abel N. Kho

Northwestern University

View shared research outputs
Top Co-Authors

Avatar
Top Co-Authors

Avatar
Top Co-Authors

Avatar
Researchain Logo
Decentralizing Knowledge