Network


Latest external collaboration on country level. Dive into details by clicking on the dots.

Hotspot


Dive into the research topics where Vincent A. Fusaro is active.

Publication


Featured researches published by Vincent A. Fusaro.


BMC Bioinformatics | 2010

Cloud computing for comparative genomics

Dennis P. Wall; Parul Kudtarkar; Vincent A. Fusaro; Rimma Pivovarov; Prasad Patil; Peter J. Tonellato

BackgroundLarge comparative genomics studies and tools are becoming increasingly more compute-expensive as the number of available genome sequences continues to rise. The capacity and cost of local computing infrastructures are likely to become prohibitive with the increase, especially as the breadth of questions continues to rise. Alternative computing architectures, in particular cloud computing environments, may help alleviate this increasing pressure and enable fast, large-scale, and cost-effective comparative genomics strategies going forward. To test this, we redesigned a typical comparative genomics algorithm, the reciprocal smallest distance algorithm (RSD), to run within Amazons Elastic Computing Cloud (EC2). We then employed the RSD-cloud for ortholog calculations across a wide selection of fully sequenced genomes.ResultsWe ran more than 300,000 RSD-cloud processes within the EC2. These jobs were farmed simultaneously to 100 high capacity compute nodes using the Amazon Web Service Elastic Map Reduce and included a wide mix of large and small genomes. The total computation time took just under 70 hours and cost a total of


PLOS Computational Biology | 2011

Biomedical Cloud Computing With Amazon Web Services

Vincent A. Fusaro; Prasad Patil; Erik Gafni; Dennis P. Wall; Peter J. Tonellato

6,302 USD.ConclusionsThe effort to transform existing comparative genomics algorithms from local compute infrastructures is not trivial. However, the speed and flexibility of cloud computing environments provides a substantial boost with manageable cost. The procedure designed to transform the RSD algorithm into a cloud-ready application is readily adaptable to similar comparative genomics problems.


Translational Psychiatry | 2012

Use of machine learning to shorten observation-based screening and diagnosis of autism

Dennis P. Wall; Jack A. Kosmicki; Todd DeLuca; Elizabeth Borges Harstad; Vincent A. Fusaro

In this overview to biomedical computing in the cloud, we discussed two primary ways to use the cloud (a single instance or cluster), provided a detailed example using NGS mapping, and highlighted the associated costs. While many users new to the cloud may assume that entry is as straightforward as uploading an application and selecting an instance type and storage options, we illustrated that there is substantial up-front effort required before an application can make full use of the clouds vast resources. Our intention was to provide a set of best practices and to illustrate how those apply to a typical application pipeline for biomedical informatics, but also general enough for extrapolation to other types of computational problems. Our mapping example was intended to illustrate how to develop a scalable project and not to compare and contrast alignment algorithms for read mapping and genome assembly. Indeed, with a newer aligner such as Bowtie, it is possible to map the entire African genome using one m2.2xlarge instance in 48 hours for a total cost of approximately


BMC Medical Genomics | 2010

Genotator: A disease-agnostic tool for genetic annotation of disease

Dennis P. Wall; Rimma Pivovarov; Mark Y. Tong; Jae-Yoon Jung; Vincent A. Fusaro; Todd DeLuca; Peter J. Tonellato

48 in computation time. In our example, we were not concerned with data transfer rates, which are heavily influenced by the amount of available bandwidth, connection latency, and network availability. When transferring large amounts of data to the cloud, bandwidth limitations can be a major bottleneck, and in some cases it is more efficient to simply mail a storage device containing the data to AWS (http://aws.amazon.com/importexport/). More information about cloud computing, detailed cost analysis, and security can be found in references.


Evolutionary Bioinformatics | 2010

Cost-Effective Cloud Computing: A Case Study Using the Comparative Genomics Tool, Roundup:

Parul Kudtarkar; Todd DeLuca; Vincent A. Fusaro; Peter J. Tonellato; Dennis P. Wall

The Autism Diagnostic Observation Schedule-Generic (ADOS) is one of the most widely used instruments for behavioral evaluation of autism spectrum disorders. It is composed of four modules, each tailored for a specific group of individuals based on their language and developmental level. On average, a module takes between 30 and 60 min to deliver. We used a series of machine-learning algorithms to study the complete set of scores from Module 1 of the ADOS available at the Autism Genetic Resource Exchange (AGRE) for 612 individuals with a classification of autism and 15 non-spectrum individuals from both AGRE and the Boston Autism Consortium (AC). Our analysis indicated that 8 of the 29 items contained in Module 1 of the ADOS were sufficient to classify autism with 100% accuracy. We further validated the accuracy of this eight-item classifier against complete sets of scores from two independent sources, a collection of 110 individuals with autism from AC and a collection of 336 individuals with autism from the Simons Foundation. In both cases, our classifier performed with nearly 100% sensitivity, correctly classifying all but two of the individuals from these two resources with a diagnosis of autism, and with 94% specificity on a collection of observed and simulated non-spectrum controls. The classifier contained several elements found in the ADOS algorithm, demonstrating high test validity, and also resulted in a quantitative score that measures classification confidence and extremeness of the phenotype. With incidence rates rising, the ability to classify autism effectively and quickly requires careful design of assessment and diagnostic tools. Given the brevity, accuracy and quantitative nature of the classifier, results from this study may prove valuable in the development of mobile tools for preliminary evaluation and clinical prioritization—in particular those focused on assessment of short home videos of children—that speed the pace of initial evaluation and broaden the reach to a significantly larger percentage of the population at risk.


PLOS ONE | 2014

The potential of accelerating early detection of autism through content analysis of YouTube videos.

Vincent A. Fusaro; Jena Daniels; Marlena Duda; Todd DeLuca; Olivia D’Angelo; Jenna Tamburello; James Maniscalco; Dennis P. Wall

BackgroundDisease-specific genetic information has been increasing at rapid rates as a consequence of recent improvements and massive cost reductions in sequencing technologies. Numerous systems designed to capture and organize this mounting sea of genetic data have emerged, but these resources differ dramatically in their disease coverage and genetic depth. With few exceptions, researchers must manually search a variety of sites to assemble a complete set of genetic evidence for a particular disease of interest, a process that is both time-consuming and error-prone.MethodsWe designed a real-time aggregation tool that provides both comprehensive coverage and reliable gene-to-disease rankings for any disease. Our tool, called Genotator, automatically integrates data from 11 externally accessible clinical genetics resources and uses these data in a straightforward formula to rank genes in order of disease relevance. We tested the accuracy of coverage of Genotator in three separate diseases for which there exist specialty curated databases, Autism Spectrum Disorder, Parkinsons Disease, and Alzheimer Disease. Genotator is freely available at http://genotator.hms.harvard.edu.ResultsGenotator demonstrated that most of the 11 selected databases contain unique information about the genetic composition of disease, with 2514 genes found in only one of the 11 databases. These findings confirm that the integration of these databases provides a more complete picture than would be possible from any one database alone. Genotator successfully identified at least 75% of the top ranked genes for all three of our use cases, including a 90% concordance with the top 40 ranked candidates for Alzheimer Disease.ConclusionsAs a meta-query engine, Genotator provides high coverage of both historical genetic research as well as recent advances in the genetic understanding of specific diseases. As such, Genotator provides a real-time aggregation of ranked data that remains current with the pace of research in the disease fields. Genotators algorithm appropriately transforms query terms to match the input requirements of each targeted databases and accurately resolves named synonyms to ensure full coverage of the genetic results with official nomenclature. Genotator generates an excel-style output that is consistent across disease queries and readily importable to other applications.


Circulation | 2013

A Systems Approach to Designing Effective Clinical Trials Using Simulations

Vincent A. Fusaro; Prasad Patil; Chih Lin Chi; Charles F. Contant; Peter J. Tonellato

Background Comparative genomics resources, such as ortholog detection tools and repositories are rapidly increasing in scale and complexity. Cloud computing is an emerging technological paradigm that enables researchers to dynamically build a dedicated virtual cluster and may represent a valuable alternative for large computational tools in bioinformatics. In the present manuscript, we optimize the computation of a large-scale comparative genomics resource—Roundup—using cloud computing, describe the proper operating principles required to achieve computational efficiency on the cloud, and detail important procedures for improving cost-effectiveness to ensure maximal computation at minimal costs. Methods Utilizing the comparative genomics tool, Roundup, as a case study, we computed orthologs among 902 fully sequenced genomes on Amazons Elastic Compute Cloud. For managing the ortholog processes, we designed a strategy to deploy the web service, Elastic MapReduce, and maximize the use of the cloud while simultaneously minimizing costs. Specifically, we created a model to estimate cloud runtime based on the size and complexity of the genomes being compared that determines in advance the optimal order of the jobs to be submitted. Results We computed orthologous relationships for 245,323 genome-to-genome comparisons on Amazons computing cloud, a computation that required just over 200 hours and cost


Journal of the American Medical Informatics Association | 2017

Creating a scalable clinical pharmacogenomics service with automated interpretation and medical record result integration - experience from a pediatric tertiary care facility.

Shannon Manzi; Vincent A. Fusaro; Laura Chadwick; Catherine A. Brownstein; Catherine Clinton; Kenneth D. Mandl; Wendy A. Wolf; Jared B. Hawkins

8,000 USD, at least 40% less than expected under a strategy in which genome comparisons were submitted to the cloud randomly with respect to runtime. Our cost savings projections were based on a model that not only demonstrates the optimal strategy for deploying RSD to the cloud, but also finds the optimal cluster size to minimize waste and maximize usage. Our cost-reduction model is readily adaptable for other comparative genomics tools and potentially of significant benefit to labs seeking to take advantage of the cloud as an alternative to local computing infrastructure.


PLOS ONE | 2013

Early Detection of Poor Adherers to Statins: Applying Individualized Surveillance to Pay for Performance

Andrew J. Zimolzak; Claire M. Spettell; Joaquim Fernandes; Vincent A. Fusaro; Nathan Palmer; Suchi Saria; Isaac S. Kohane; Magdalena Jonikas; Kenneth D. Mandl

Abstract Autism is on the rise, with 1 in 88 children receiving a diagnosis in the United States, yet the process for diagnosis remains cumbersome and time consuming. Research has shown that home videos of children can help increase the accuracy of diagnosis. However the use of videos in the diagnostic process is uncommon. In the present study, we assessed the feasibility of applying a gold-standard diagnostic instrument to brief and unstructured home videos and tested whether video analysis can enable more rapid detection of the core features of autism outside of clinical environments. We collected 100 public videos from YouTube of children ages 1–15 with either a self-reported diagnosis of an ASD (N = 45) or not (N = 55). Four non-clinical raters independently scored all videos using one of the most widely adopted tools for behavioral diagnosis of autism, the Autism Diagnostic Observation Schedule-Generic (ADOS). The classification accuracy was 96.8%, with 94.1% sensitivity and 100% specificity, the inter-rater correlation for the behavioral domains on the ADOS was 0.88, and the diagnoses matched a trained clinician in all but 3 of 22 randomly selected video cases. Despite the diversity of videos and non-clinical raters, our results indicate that it is possible to achieve high classification accuracy, sensitivity, and specificity as well as clinically acceptable inter-rater reliability with nonclinical personnel. Our results also demonstrate the potential for video-based detection of autism in short, unstructured home videos and further suggests that at least a percentage of the effort associated with detection and monitoring of autism may be mobilized and moved outside of traditional clinical environments.


cairo international biomedical engineering conference | 2010

An approach to optimal individualized warfarin treatment through clinical trial simulations

Chih Lin Chi; Vincent A. Fusaro; Prasad Patil; Matthew A. Crawford; Charles F. Contant; Peter J. Tonellato

Background— Pharmacogenetics in warfarin clinical trials have failed to show a significant benefit in comparison with standard clinical therapy. This study demonstrates a computational framework to systematically evaluate preclinical trial design of target population, pharmacogenetic algorithms, and dosing protocols to optimize primary outcomes. Methods and Results— We programmatically created an end-to-end framework that systematically evaluates warfarin clinical trial designs. The framework includes options to create a patient population, multiple dosing strategies including genetic-based and nongenetic clinical-based, multiple-dose adjustment protocols, pharmacokinetic/pharmacodynamics modeling and international normalization ratio prediction, and various types of outcome measures. We validated the framework by conducting 1000 simulations of the Applying Pharmacogenetic Algorithms to Individualize Dosing of Warfarin (CoumaGen) clinical trial primary end points. The simulation predicted a mean time in therapeutic range of 70.6% and 72.2% (P=0.47) in the standard and pharmacogenetic arms, respectively. Then, we evaluated another dosing protocol under the same original conditions and found a significant difference in the time in therapeutic range between the pharmacogenetic and standard arm (78.8% versus 73.8%; P=0.0065), respectively. Conclusions— We demonstrate that this simulation framework is useful in the preclinical assessment phase to study and evaluate design options and provide evidence to optimize the clinical trial for patient efficacy and reduced risk.

Collaboration


Dive into the Vincent A. Fusaro's collaboration.

Top Co-Authors

Avatar

Peter J. Tonellato

University of Wisconsin–Milwaukee

View shared research outputs
Top Co-Authors

Avatar

Prasad Patil

Johns Hopkins University

View shared research outputs
Top Co-Authors

Avatar
Top Co-Authors

Avatar

Chih Lin Chi

University of Minnesota

View shared research outputs
Top Co-Authors

Avatar

Kenneth D. Mandl

Boston Children's Hospital

View shared research outputs
Top Co-Authors

Avatar
Top Co-Authors

Avatar
Top Co-Authors

Avatar
Top Co-Authors

Avatar

Catherine Clinton

Boston Children's Hospital

View shared research outputs
Top Co-Authors

Avatar
Researchain Logo
Decentralizing Knowledge