David Ervin
Ohio State University
Network
Latest external collaboration on country level. Dive into details by clicking on the dots.
Publication
Featured researches published by David Ervin.
Journal of the American Medical Informatics Association | 2008
Scott Oster; Stephen Langella; Shannon Hastings; David Ervin; Ravi K. Madduri; Joshua Phillips; Tahsin M. Kurç; Frank Siebenlist; Peter A. Covitz; Krishnakant Shanbhag; Ian T. Foster; Joel H. Saltz
OBJECTIVE To develop software infrastructure that will provide support for discovery, characterization, integrated access, and management of diverse and disparate collections of information sources, analysis methods, and applications in biomedical research. DESIGN An enterprise Grid software infrastructure, called caGrid version 1.0 (caGrid 1.0), has been developed as the core Grid architecture of the NCI-sponsored cancer Biomedical Informatics Grid (caBIG) program. It is designed to support a wide range of use cases in basic, translational, and clinical research, including 1) discovery, 2) integrated and large-scale data analysis, and 3) coordinated study. MEASUREMENTS The caGrid is built as a Grid software infrastructure and leverages Grid computing technologies and the Web Services Resource Framework standards. It provides a set of core services, toolkits for the development and deployment of new community provided services, and application programming interfaces for building client applications. RESULTS The caGrid 1.0 was released to the caBIG community in December 2006. It is built on open source components and caGrid source code is publicly and freely available under a liberal open source license. The core software, associated tools, and documentation can be downloaded from the following URL: https://cabig.nci.nih.gov/workspaces/Architecture/caGrid. CONCLUSIONS While caGrid 1.0 is designed to address use cases in cancer research, the requirements associated with discovery, analysis and integration of large scale data, and coordinated studies are common in other biomedical fields. In this respect, caGrid 1.0 is the realization of a framework that can benefit the entire biomedical community.
Journal of Grid Computing | 2007
Shannon Hastings; Scott Oster; Stephen Langella; David Ervin; Tahsin M. Kurç; Joel H. Saltz
Service-oriented architectures and applications have gained wide acceptance in the Grid computing community. A number of tools and middleware systems have been developed to support application development using Grid Services architectures. Most of these efforts, however, have focused on low-level support for management and execution of Grid services, management of Grid-enabled resources, and deployment and execution of applications that make use of Grid services. Simple-to-use service development tools, which would allow a Grid service developer to leverage Grid technologies without needing to know low-level details, are becoming increasingly important for wider application of the Grid. In this paper, we describe an open-source, extensible toolkit, called Introduce, that supports easy development and deployment of Web Services Resource Framework (WSRF) compliant services. Introduce is designed to reduce the service development and deployment effort by hiding low level details of the Globus Toolkit and to enable the implementation of strongly typed services. In strongly typed services, a service produces and consumes data types that are well-defined and published in the Grid. This enables data-level syntactic interoperability so that clients and services can access and consume data elements programmatically and correctly. We expect that enabling strongly typed Grid services while lowering the difficulty of entry to the Grid via toolkits like Introduce will have a major impact to the success of the Grid and its wider adoption as a viable technology of choice in the commercial sector as well as in academic, medical, and government research.
Journal of the American Medical Informatics Association | 2008
Stephen Langella; Shannon Hastings; Scott Oster; Tony Pan; Ashish Sharma; Justin Permar; David Ervin; Berkant Barla Cambazoglu; Tahsin M. Kurç; Joel H. Saltz
OBJECTIVES To develop a security infrastructure to support controlled and secure access to data and analytical resources in a biomedical research Grid environment, while facilitating resource sharing among collaborators. DESIGN A Grid security infrastructure, called Grid Authentication and Authorization with Reliably Distributed Services (GAARDS), is developed as a key architecture component of the NCI-funded cancer Biomedical Informatics Grid (caBIG). The GAARDS is designed to support in a distributed environment 1) efficient provisioning and federation of user identities and credentials; 2) group-based access control support with which resource providers can enforce policies based on community accepted groups and local groups; and 3) management of a trust fabric so that policies can be enforced based on required levels of assurance. MEASUREMENTS GAARDS is implemented as a suite of Grid services and administrative tools. It provides three core services: Dorian for management and federation of user identities, Grid Trust Service for maintaining and provisioning a federated trust fabric within the Grid environment, and Grid Grouper for enforcing authorization policies based on both local and Grid-level groups. RESULTS The GAARDS infrastructure is available as a stand-alone system and as a component of the caGrid infrastructure. More information about GAARDS can be accessed at http://www.cagrid.org. CONCLUSIONS GAARDS provides a comprehensive system to address the security challenges associated with environments in which resources may be located at different sites, requests to access the resources may cross institutional boundaries, and user credentials are created, managed, revoked dynamically in a de-centralized manner.
IEEE Computer | 2008
Joel H. Saltz; Tahsin M. Kurç; Shannon Hastings; Stephen Langella; Scott Oster; David Ervin; Ashish Sharma; Tony Pan; Metin N. Gurcan; Justin Permar; Renato Ferreira; Philip R. O. Payne; E. Caserta; G. Leone; M.C. Ostrowski; Ravi K. Madduri; Ian T. Foster; Subha Madhavan
Translational research projects target a wide variety of diseases, test many different kinds of biomedical hypotheses, and employ a large assortment of experimental methodologies. Diverse data, complex execution environments, and demanding security and reliability requirements make the implementation of these projects extremely challenging and require novel e-Science technologies.
ieee international conference on high performance computing data and analytics | 2009
Tahsin M. Kurç; Shannon Hastings; Vijay Kumar; Stephen Langella; Ashish Sharma; Tony Pan; Scott Oster; David Ervin; Justin Permar; Sivaramakrishnan Narayanan; Yolanda Gil; Ewa Deelman; Mary W. Hall; Joel H. Saltz
Integrative biomedical research projects query, analyze, and integrate many different data types and make use of datasets obtained from measurements or simulations of structure and function at multiple biological scales. With the increasing availability of high-throughput and high-resolution instruments, the integrative biomedical research imposes many challenging requirements on software middleware systems. In this paper, we look at some of these requirements using example research pattern templates. We then discuss how middleware systems, which incorporate Grid and high-performance computing, could be employed to address the requirements.
Journal of the American Medical Informatics Association | 2006
Tahsin M. Kurç; Daniel Janies; Andrew D. Johnson; Stephen Langella; Scott Oster; Shannon Hastings; Farhat Habib; Terry Camerlengo; David Ervin; Joel H. Saltz
Abstract Diverse data sets have become key building blocks of translational biomedical research. Data types captured and referenced by sophisticated research studies include high throughput genomic and proteomic data, laboratory data, data from imagery, and outcome data. In this paper, the authors present the application of an XML-based data management system to support integration of data from disparate data sources and large data sets. This system facilitates management of XML schemas and on-demand creation and management of XML databases that conform to these schemas. They illustrate the use of this system in an application for genotype–phenotype correlation analyses. This application implements a method of phenotype–genotype correlation based on phylogenetic optimization of large data sets of mouse SNPs and phenotypic data. The application workflow requires the management and integration of genomic information and phenotypic data from external data repositories and from the results of phenotype–genotype correlation analyses. Our implementation supports the process of carrying out a complex workflow that includes large-scale phylogenetic tree optimizations and application of Maddisons concentrated changes test to large phylogenetic tree data sets. The data management system also allows collaborators to share data in a uniform way and supports complex queries that target data sets.
international parallel and distributed processing symposium | 2008
Joel H. Saltz; Scott Oster; Shannon Hastings; Stephen Langella; Renato Ferreira; Justin Permar; Ashish Sharma; David Ervin; Tony Pan; Tahsin M. Kurç
Design templates that involve discovery, analysis, and integration of information resources commonly occur in many scientific research projects. In this paper we present examples of design templates from the biomedical translational research domain and discuss the requirements imposed on Grid middleware infrastructures by them. Using caGrid, which is a Grid middleware system based on the model driven architecture (MDA) and the service oriented architecture (SOA) paradigms, as a starting point, we discuss architecture directions for MDA and SOA based systems like caGrid to support common design templates.
Archive | 2010
Tahsin M. Kurç; Ashish Sharma; Scott Oster; Tony Pan; Shannon Hastings; Stephen Langella; David Ervin; Justin Permar; Daniel J. Brat; Thomas J. Fitzgerald; James A. Purdy; Walter R. Bosch; Joel H. Saltz
As our ability to capture and generate large biomedical datasets improves, researchers increasingly need to synthesize information using a variety of data types, data systems, and analysis tools. The need for informatics support to facilitate coordinated and federated access to disparate data and analysis resources is more pronounced in collaborative basic, clinical, and translational research studies spanning multiple institutions. This chapter presents a high-level overview of several middleware architecture frameworks and technologies and discusses how these approaches can be employed to address the informatics requirements of large-scale and collaborative cancer research.
american medical informatics association annual symposium | 2007
Stephen Langella; Scott Oster; Shannon Hastings; Frank Siebenlist; Joshua Phillips; David Ervin; Justin Permar; Tahsin M. Kurç; Joel H. Saltz
AMIA Joint Summits on Translational Science proceedings. AMIA Joint Summits on Translational Science | 2010
Rob Wynden; Mark G. Weiner; Ida Sim; Davera Gabriel; Marco Casale; Simona Carini; Shannon Hastings; David Ervin; Samson W. Tu; John H. Gennari; Nicholas R. Anderson; Ketty Mobed; Prakash Lakshminarayanan; Maggie Massary; Russell J. Cucina