Nolan Li | Researchain

Archive Network Publication Hotspot Collaboration

Network

Latest external collaboration on country level. Dive into details by clicking on the dots.

Explore More

Hotspot

Dive into the research topics where Nolan Li is active.

Explore More

Publication

Featured researches published by Nolan Li.

Computing in Science and Engineering | 2008

CasJobs and MyDB: A Batch Query Workbench

Nolan Li; Ani Thakar

Catalog archive server jobs (CasJobs) is an asynchronous query workbench service that lets users run unrestricted SQL queries against scientific catalog archives. After running queries in batch mode, users can save their results to a personal database called MyDB before downloading them, letting users manage their query workloads, results, and histories without causing network overloads.

statistical and scientific database management | 2012

SkyQuery: an implementation of a parallel probabilistic join engine for cross-identification of multiple astronomical databases

László Dobos; Tamas Budavari; Nolan Li; Alexander S. Szalay; István Csabai

Multi-wavelength astronomical studies require cross-identification of detections of the same celestial objects in multiple catalogs based on spherical coordinates and other properties. Because of the large data volumes and spherical geometry, the symmetric N-way association of astronomical detections is a computationally intensive problem, even when sophisticated indexing schemes are used to exclude obviously false candidates. Legacy astronomical catalogs already contain detections of more than a hundred million objects while ongoing and future surveys will produce catalogs of billions of objects with multiple detections of each at different times. One time, pair-wise cross-identification of these large catalogs is not sufficient for many astronomical scenarios. Consequently, a novel system is necessary that can cross-identify multiple catalogs on-demand, efficiently and reliably. In this paper, we present our solution based on a cluster of commodity servers and ordinary relational databases. The cross-identification problems are formulated in a language based on SQL, but extended with special clauses. These special queries are partitioned spatially by coordinate ranges and compiled into a complex workflow of ordinary SQL queries. Workflows are then executed in a parallel framework using a cluster of servers hosting identical mirrors of the same data sets.

statistical and scientific database management | 2013

Graywulf: a platform for federated scientific databases and services

László Dobos; István Csabai; Alexander S. Szalay; Tamas Budavari; Nolan Li

Many fields of science rely on relational database management systems to analyze, publish and share data. Since RDBMS are originally designed for, and their development directions are primarily driven by, business use cases they often lack features very important for scientific applications. Horizontal scalability is probably the most important missing feature which makes it challenging to adapt traditional relational database systems to the ever growing data sizes. Due to the limited support of array data types and metadata management, successful application of RDBMS in science usually requires the development of custom extensions. While some of these extensions are specific to the field of science, the majority of them could easily be generalized and reused in other disciplines. With the Graywulf project we intend to target several goals. We are building a generic platform that offers reusable components for efficient storage, transformation, statistical analysis and presentation of scientific data stored in Microsoft SQL Server. Graywulf also addresses the distributed computational issues arising from current RDBMS technologies. The current version supports load balancing of simple queries and parallel execution of partitioned queries over a set of mirrored databases. Uniform user access to the data is provided through a web based query interface and a data surface for software clients. Queries are formulated in a slightly modified syntax of SQL that offers a transparent view of the distributed data. The software library consists of several components that can be reused to develop complex scientific data warehouses: a system registry, administration tools to manage entire database server clusters, a sophisticated workflow execution framework, and a SQL parser library.

workflows in support of large-scale science | 2008

CASJobs: A workflow environment designed for large scientific catalogs

Nolan Li; Alexander S. Szalay

As some scientific data resources grow well past the terabyte mark, so do the technical challenges of analyzing and retrieving such data. When presented with such a large volume, prior data delivery techniques, such as the time-honored system of simply copying all data to local storage prior to analysis, become impractical. We present CASJobs, an online workbench that focuses on enabling server-side exploratory and analytical tasks on large sets of data.

international conference on web services | 2005

Batch is back: CasJobs, serving multi-TB data on the Web

William O'Mullane; Nolan Li; Maria A. Nieto-santisteban; Alexander S. Szalay; Ani Thakar; Jim Gray

hawaii international conference on system sciences | 2009

GrayWulf: Scalable Software Architecture for Data Intensive Computing

Yogesh Simmhan; Roger S. Barga; Catharine van Ingen; Maria A. Nieto-santisteban; Laszlo Dobos; Nolan Li; Michael Shipway; Alexander S. Szalay; Sue Werner; J. N. Heasley

Archive | 2004

Astronomical Data Query Language: Simple Query Protocol for the Virtual Observatory

Norihito Yasuda; Yoshihiko Mizumoto; Masatoshi Ohishi; William O'Mullane; Tamas Budavari; V. Haridas; Nolan Li; Tanu Malik; Alexander S. Szalay; Michael R. S. Hill; Tony Linde; Barbara J. Mann; Clive G. Page

Archive | 2003

Batch Query System with Interactive Local Storage for SDSS and the VO

William O'Mullane; Jim Gray; Nolan Li; Tamas Budavari; Maria Nieto Santisteban; Alexander S. Szalay

Archive | 2008

Pan-STARRS: Learning to Ride the Data Tsunami

Maria A. Nieto-santisteban; Yogesh Simmhan; Roger S. Barga; LaszloDobos; James N. Heasley; Conrad Holmberg; Nolan Li; MichaelShipway; Alexander S. Szalay; Catharine van Ingen; Suzanne Werner

Archive | 2012