Eric Perlman | Researchain

Archive Network Publication Hotspot Collaboration

Network

Latest external collaboration on country level. Dive into details by clicking on the dots.

Explore More

Hotspot

Dive into the research topics where Eric Perlman is active.

Explore More

Publication

Featured researches published by Eric Perlman.

conference on high performance computing (supercomputing) | 2007

Data exploration of turbulence simulations using a database cluster

Eric Perlman; Randal C. Burns; Yi Li; Charles Meneveau

We describe a new environment for the exploration of turbulent flows that uses a cluster of databases to store complete histories of Direct Numerical Simulation (DNS) results. This allows for spatial and temporal exploration of high-resolution data that were traditionally too large to store and too computationally expensive to produce on demand. We perform analysis of these data directly on the databases nodes, which minimizes the volume of network traffic. The low network demands enable us to provide public access to this experimental platform and its datasets through Web services. This paper details the system design and implementation. Specifically, we focus on hierarchical spatial indexing, cache-sensitive spatial scheduling of batch workloads, localizing computation through data partitioning, and load balancing techniques that minimize data movement. We provide real examples of how scientists use the system to perform high-resolution turbulence research from standard desktop computing environments.

statistical and scientific database management | 2013

The open connectome project data cluster: scalable analysis and vision for high-throughput neuroscience

Randal C. Burns; Kunal Lillaney; Daniel R. Berger; Logan Grosenick; Karl Deisseroth; R. Clay Reid; William Gray Roncal; Priya Manavalan; Davi Bock; Narayanan Kasthuri; Michael M. Kazhdan; Stephen J. Smith; Dean M. Kleissas; Eric Perlman; Kwanghun Chung; Nicholas C. Weiler; Jeff W. Lichtman; Alexander S. Szalay; Joshua T. Vogelstein; R. Jacob Vogelstein

We describe a scalable database cluster for the spatial analysis and annotation of high-throughput brain imaging data, initially for 3-d electron microscopy image stacks, but for time-series and multi-channel data as well. The system was designed primarily for workloads that build connectomes---neural connectivity maps of the brain---using the parallel execution of computer vision algorithms on high-performance compute clusters. These services and open-science data sets are publicly available at openconnecto.me. The system design inherits much from NoSQL scale-out and data-intensive computing architectures. We distribute data to cluster nodes by partitioning a spatial index. We direct I/O to different systems---reads to parallel disk arrays and writes to solid-state storage---to avoid I/O interference and maximize throughput. All programming interfaces are RESTful Web services, which are simple and stateless, improving scalability and usability. We include a performance evaluation of the production system, highlighting the effectiveness of spatial data organization.

Cell | 2018

A Complete Electron Microscopy Volume of the Brain of Adult Drosophila melanogaster

Zhihao Zheng; J. Scott Lauritzen; Eric Perlman; Camenzind G. Robinson; Matthew Nichols; Daniel E. Milkie; Omar N. Torrens; John H. Price; Corey B. Fisher; Nadiya Sharifi; Steven A. Calle-Schuler; Lucia Kmecova; Iqbal J. Ali; Bill Karsh; Eric T. Trautman; John A. Bogovic; Philipp Hanslovsky; Gregory S.X.E. Jefferis; Michael M. Kazhdan; Khaled Khairy; Stephan Saalfeld; Richard D. Fetter; Davi Bock

Summary Drosophila melanogaster has a rich repertoire of innate and learned behaviors. Its 100,000-neuron brain is a large but tractable target for comprehensive neural circuit mapping. Only electron microscopy (EM) enables complete, unbiased mapping of synaptic connectivity; however, the fly brain is too large for conventional EM. We developed a custom high-throughput EM platform and imaged the entire brain of an adult female fly at synaptic resolution. To validate the dataset, we traced brain-spanning circuitry involving the mushroom body (MB), which has been extensively studied for its role in learning. All inputs to Kenyon cells (KCs), the intrinsic neurons of the MB, were mapped, revealing a previously unknown cell type, postsynaptic partners of KC dendrites, and unexpected clustering of olfactory projection neurons. These reconstructions show that this freely available EM volume supports mapping of brain-spanning circuits, which will significantly accelerate Drosophila neuroscience. Video Abstract

statistical and scientific database management | 2010

Organization of data in non-convex spatial domains

Eric Perlman; Randal C. Burns; Michael M. Kazhdan; Rebecca R. Murphy; William P. Ball; Nina Amenta

We present a technique for organizing data in spatial databases with non-convex domains based on an automatic characterization using the medial-axis transform (MAT). We define a tree based on the MAT and enumerate its branches to partition space and define a linear order on the partitions. This ordering clusters data in a manner that respects the complex shape of the domain. The ordering has the property that all data down any branch of the medial axis, regardless of the geometry of the sub-region, are contiguous on disk. Using this data organization technique, we build a system to provide efficient data discovery and analysis of the observational and model data sets of the Chesapeake Bay Environmental Observatory (CBEO). On typical CBEO workloads in which scientists query contiguous substructures of the Chesapeake Bay, we improve query processing performance by a factor of two when compared with orderings derived from space filling curves.

very large data bases | 2008

Organizing and indexing non-convex regions

Eric Perlman; Randal C. Burns; Michael M. Kazhdan

We demonstrate data indexing and query processing techniques that improve the efficiency of comparing, correlating, and joining data contained in non-convex regions. We use computational geometry techniques to automatically characterize the region of space from which data are drawn, partition the region based on that characterization, and create an index from the partitions. Our motivating application performs distributed data analysis queries among federated database sites that store scientific data sets from the Chesapeake Bay. Our preliminary findings indicate that these techniques often reduce the number of I/Os needed to serve a query by a factor of five---depending on the geometry of the query region.

conference on high performance computing (supercomputing) | 2006

Engineering the 100 terabyte turbulence database (or how to track particles at home)

Eric Perlman; Randal C. Burns

We describe a new environment for large-scale turbulence simulations that uses a cluster of database nodes to store the complete space-time history of fluid velocities. This allows for rapid access to high resolution data that were traditionally too large to store and too computationally expensive to produce on demand.We perform the actual experimental analysis inside the database nodes, which allows for data-intensive computations to be performed across a large number of nodes with relatively little network traffic.We currently have a limited-scale prototype system running actual turbulence simulations and are in the process of establishing a production cluster with high-resolution data. We will discuss our design choices and initial results with load balancing a data-intensive, migratory workload.

ieee international conference on high performance computing data and analytics | 2011