Network


Latest external collaboration on country level. Dive into details by clicking on the dots.

Hotspot


Dive into the research topics where Daniel R. Harris is active.

Publication


Featured researches published by Daniel R. Harris.


PLOS Genetics | 2013

Plant-symbiotic fungi as chemical engineers: multi-genome analysis of the clavicipitaceae reveals dynamics of alkaloid loci

Christopher L. Schardl; Carolyn A. Young; Uljana Hesse; Stefan G. Amyotte; Kalina Andreeva; Patrick J. Calie; Damien J. Fleetwood; David Haws; Neil Moore; Birgitt Oeser; Daniel G. Panaccione; Kathryn Schweri; Christine R. Voisey; Mark L. Farman; Jerzy W. Jaromczyk; Bruce A. Roe; Donal M. O'Sullivan; Barry Scott; Paul Tudzynski; Zhiqiang An; Elissaveta G. Arnaoudova; Charles T. Bullock; Nikki D. Charlton; Li Chen; Murray P. Cox; Randy D. Dinkins; Simona Florea; Anthony E. Glenn; Anna Gordon; Ulrich Güldener

The fungal family Clavicipitaceae includes plant symbionts and parasites that produce several psychoactive and bioprotective alkaloids. The family includes grass symbionts in the epichloae clade (Epichloë and Neotyphodium species), which are extraordinarily diverse both in their host interactions and in their alkaloid profiles. Epichloae produce alkaloids of four distinct classes, all of which deter insects, and some—including the infamous ergot alkaloids—have potent effects on mammals. The exceptional chemotypic diversity of the epichloae may relate to their broad range of host interactions, whereby some are pathogenic and contagious, others are mutualistic and vertically transmitted (seed-borne), and still others vary in pathogenic or mutualistic behavior. We profiled the alkaloids and sequenced the genomes of 10 epichloae, three ergot fungi (Claviceps species), a morning-glory symbiont (Periglandula ipomoeae), and a bamboo pathogen (Aciculosporium take), and compared the gene clusters for four classes of alkaloids. Results indicated a strong tendency for alkaloid loci to have conserved cores that specify the skeleton structures and peripheral genes that determine chemical variations that are known to affect their pharmacological specificities. Generally, gene locations in cluster peripheries positioned them near to transposon-derived, AT-rich repeat blocks, which were probably involved in gene losses, duplications, and neofunctionalizations. The alkaloid loci in the epichloae had unusual structures riddled with large, complex, and dynamic repeat blocks. This feature was not reflective of overall differences in repeat contents in the genomes, nor was it characteristic of most other specialized metabolism loci. The organization and dynamics of alkaloid loci and abundant repeat blocks in the epichloae suggested that these fungi are under selection for alkaloid diversification. We suggest that such selection is related to the variable life histories of the epichloae, their protective roles as symbionts, and their associations with the highly speciose and ecologically diverse cool-season grasses.


canadian conference on artificial intelligence | 2013

Unsupervised Extraction of Diagnosis Codes from EMRs Using Knowledge-Based and Extractive Text Summarization Techniques

Ramakanth Kavuluru; Sifei Han; Daniel R. Harris

Diagnosis codes are extracted from medical records for billing and reimbursement and for secondary uses such as quality control and cohort identification. In the US, these codes come from the standard terminology ICD-9-CM derived from the international classification of diseases (ICD). ICD-9 codes are generally extracted by trained human coders by reading all artifacts available in a patients medical record following specific coding guidelines. To assist coders in this manual process, this paper proposes an unsupervised ensemble approach to automatically extract ICD-9 diagnosis codes from textual narratives included in electronic medical records (EMRs). Earlier attempts on automatic extraction focused on individual documents such as radiology reports and discharge summaries. Here we use a more realistic dataset and extract ICD-9 codes from EMRs of 1000 inpatient visits at the University of Kentucky Medical Center. Using named entity recognition (NER), graph-based concept-mapping of medical concepts, and extractive text summarization techniques, we achieve an example based average recall of 0.42 with average precision 0.47; compared with a baseline of using only NER, we notice a 12% improvement in recall with the graph-based approach and a 7% improvement in precision using the extractive text summarization approach. Although diagnosis codes are complex concepts often expressed in text with significant long range non-local dependencies, our present work shows the potential of unsupervised methods in extracting a portion of codes. As such, our findings are especially relevant for code extraction tasks where obtaining large amounts of training data is difficult.


Molecular Plant-microbe Interactions | 2017

Host Tissue Environment Directs Activities of an Epichloë Endophyte, While It Induces Systemic Hormone and Defense Responses in Its Native Perennial Ryegrass Host

Jan Schmid; Robert B. Day; Ningxin Zhang; Pierre-Yves Dupont; Murray P. Cox; Christopher L. Schardl; Niki Minards; Mauro Truglio; Neil Moore; Daniel R. Harris; Yanfei Zhou

Increased resilience of pasture grasses mediated by fungal Epichloë endophytes is crucial to pastoral industries. The underlying mechanisms are only partially understood and likely involve very different activities of the endophyte in different plant tissues and responses of the plant to these. We analyzed the transcriptomes of Epichloë festucae and its host, Lolium perenne, in host tissues of different function and developmental stages. The endophyte contributed approximately 10× more to the transcriptomes than to the biomass of infected tissues. Proliferating mycelium in growing host tissues highly expressed genes involved in hyphal growth. Nonproliferating mycelium in mature plant tissues, transcriptionally equally active, highly expressed genes involved in synthesizing antiherbivore compounds. Transcripts from the latter accounted for 4% of fungal transcripts. Endophyte infection systemically but moderately increased transcription of L. perenne genes with roles in hormone biosynthesis and perception as well as stress and pathogen resistance while reducing expression of genes involved in photosynthesis. There was a good correlation between transcriptome-based observations and physiological observations. Our data indicate that the fitness-enhancing effects of the endophyte are based both on its biosynthetic activities, predominantly in mature host tissues, and also on systemic alteration of the hosts hormonal responses and induction of stress response genes. [Formula: see text] Copyright


Information Systems Frontiers | 2016

Foundations of reusable and interoperable facet models using category theory

Daniel R. Harris

Faceted browsing has become ubiquitous with modern digital libraries and online search engines, yet the process is still difficult to abstractly model in a manner that supports the development of interoperable and reusable interfaces. We propose category theory as a theoretical foundation for faceted browsing and demonstrate how the interactive process can be mathematically abstracted. Existing efforts in facet modeling are based upon set theory, formal concept analysis, and light-weight ontologies, but in many regards, they are implementations of faceted browsing rather than a specification of the basic, underlying structures and interactions. We will demonstrate that category theory allows us to specify faceted objects and study the relationships and interactions within a faceted browsing system. Resulting implementations can then be constructed through a category-theoretic lens using these models, allowing abstract comparison and communication that naturally support interoperability and reuse.


information reuse and integration | 2015

Modeling Reusable and Interoperable Faceted Browsing Systems with Category Theory

Daniel R. Harris

Faceted browsing has become ubiquitous with modern digital libraries and online search engines, yet the process is still difficult to abstractly model in a manner that supports the development of interoperable and reusable interfaces. We propose category theory as a theoretical foundation for faceted browsing and demonstrate how the interactive process can be mathematically abstracted. Existing efforts in facet modeling are based upon set theory, formal concept analysis, and light-weight ontologies, but in many regards, they are implementations of faceted browsing rather than a specification of the basic, underlying structures and interactions. We will demonstrate that category theory allows us to specify faceted objects and study the relationships and interactions within a faceted browsing system. Implementations can then be constructed through a category-theoretic lens using these models, allowing abstract comparison and communication that naturally support interoperability and reuse.


IEEE Journal of Biomedical and Health Informatics | 2014

Using common table expressions to build a scalable Boolean query generator for clinical data warehouses.

Daniel R. Harris; Darren W. Henderson; Ramakanth Kavuluru; Arnold J. Stromberg; Todd R. Johnson

We present a custom, Boolean query generator utilizing common-table expressions (CTEs) that is capable of scaling with big datasets. The generator maps user-defined Boolean queries, such as those interactively created in clinical-research and general-purpose healthcare tools, into SQL. We demonstrate the effectiveness of this generator by integrating our study into the Informatics for Integrating Biology and the Bedside (i2b2) query tool and show that it is capable of scaling. Our custom generator replaces and outperforms the default query generator found within the Clinical Research Chart cell of i2b2. In our experiments, 16 different types of i2b2 queries were identified by varying four constraints: date, frequency, exclusion criteria, and whether selected concepts occurred in the same encounter. We generated nontrivial, random Boolean queries based on these 16 types; the corresponding SQL queries produced by both generators were compared by execution times. The CTE-based solution significantly outperformed the default query generator and provided a much more consistent response time across all query types (M = 2.03, SD = 6.64 versus M = 75.82, SD = 238.88 s). Without costly hardware upgrades, we provide a scalable solution based on CTEs with very promising empirical results centered on performance gains. The evaluation methodology used for this provides a means of profiling clinical data warehouse performance.


information reuse and integration | 2016

Modeling Integration and Reuse of Heterogeneous Terminologies in Faceted Browsing Systems

Daniel R. Harris

We integrate heterogeneous terminologies into our category-theoretic model of faceted browsing and show that existing terminologies and vocabularies can be reused as facets in a cohesive, interactive system. Commonly found in online search engines and digital libraries, faceted browsing systems depend upon one or more taxonomies which outline the structure and content of the facets available for user interaction. Controlled vocabularies or terminologies are often externally curated and are available as a reusable resource across systems. We demonstrated previously that category theory can abstractly model faceted browsing in a way that supports the development of interfaces capable of reusing and integrating multiple models of faceted browsing. We extend this model by illustrating that terminologies can be reused and integrated as facets across systems with examples from the biomedical domain.


BMC Bioinformatics | 2017

Proceedings of the 16th Annual UT-KBRIN Bioinformatics Summit 2016: bioinformatics

Eric C. Rouchka; Julia H. Chariker; David Tieri; Juw Won Park; Shreedharkumar Rajurkar; Vikas K. Singh; Nishchal K. Verma; Yan Cui; Mark L. Farman; Bradford Condon; Neil Moore; Jerzy W. Jaromczyk; Jolanta Jaromczyk; Daniel R. Harris; Patrick J. Calie; Eun Kyong Shin; Robert L. Davis; Arash Shaban-Nejad; Joshua M. Mitchell; Robert M. Flight; Qing Jun Wang; Richard M. Higashi; Teresa W.-M. Fan; Andrew N. Lane; Hunter N. B. Moseley; Liangqun Lu; Bernie J. Daigle; Xi Chen; Andrey Smelter; Li Chen

I1 Proceedings of the Sixteenth Annual UTKBRIN Bioinformatics Summit 2017 Eric C Rouchka, Julia H Chariker, David A Tieri, Juw Won Park Department of Computer Engineering and Computer Science, University of Louisville, Duthie Center for Engineering, Louisville, KY 40292, USA; Kentucky Biomedical Research Infrastructure (KBRIN) Bioinformatics Core, 522 East Gray Street, Louisville, KY 40292, USA; Department of Psychological and Brain Sciences, University of Louisville, Louisville, KY 40292, USA; Department of Anatomical Sciences and Neurobiology, University of Louisville, Louisville, KY 40292, USA Correspondence: Eric C Rouchka ([email protected]) BMC Bioinformatics 2017, 18(Suppl 9):I1


ieee embs international conference on biomedical and health informatics | 2016

Informatics-based challenges of building collaborative healthcare research and analysis networks from rural community health centers

Daniel R. Harris; Tamela J. Harper; Darren W. Henderson; Keith W. Henry; Jeffrey Talbert

We discuss informatics-based challenges of constructing large-scale collaborative networks for healthcare research and analysis from rural community health centers. These types of networks provide data access and analytic insights across multiple heterogeneous health centers for both healthcare professionals and biomedical researchers. Challenges fall into three general categories: data access, data integration, and technical infrastructure. Data access issues arise in balancing patient privacy, security, and utility; data integration issues persist from each site independently operating its desired electronic medical record; technical infrastructure challenges include creating an analysis and reporting hub capable of scaling across a large collaborative network. Other challenges, such as the difficulty of site recruitment, are important to discuss, but cannot be solved directly through informatics alone. We discuss these challenges and their potential solutions in the context of our implementation of the Kentucky Diabetes and Obesity Collaborative (KDOC). KDOC is a network of Federally-Qualified Community Health Centers (FQHCs) that established a collaborative infrastructure for research and analysis of obesity and diabetes in rural and under-served communities.


BMC Bioinformatics | 2010

Experimenting with database segmentation size vs time performance for mpiBLAST on an IBM HS21 blade cluster

Daniel R. Harris; Jerzy W. Jaromczyk; Christopher L. Schardl

Background Large-scale genomic projects such as the Epichloe festucae Genome Project require regular use of bioinformatic tools. When using BLAST in conjunction with larger databases, processing complex sequences often uses substantial computation time. Parallelization is considered a standard method of curbing extensive computing requirements and parallel implementations of BLAST, such as mpiBLAST, are freely available.

Collaboration


Dive into the Daniel R. Harris's collaboration.

Top Co-Authors

Avatar
Top Co-Authors

Avatar
Top Co-Authors

Avatar

Neil Moore

University of Kentucky

View shared research outputs
Top Co-Authors

Avatar
Top Co-Authors

Avatar
Top Co-Authors

Avatar
Top Co-Authors

Avatar
Top Co-Authors

Avatar

Patrick J. Calie

Eastern Kentucky University

View shared research outputs
Top Co-Authors

Avatar

Todd R. Johnson

University of Texas Health Science Center at Houston

View shared research outputs
Top Co-Authors

Avatar
Researchain Logo
Decentralizing Knowledge