Network


Latest external collaboration on country level. Dive into details by clicking on the dots.

Hotspot


Dive into the research topics where Maria A. Nieto-santisteban is active.

Publication


Featured researches published by Maria A. Nieto-santisteban.


international conference on management of data | 2005

Scientific data management in the coming decade

Jim Gray; David T. Liu; Maria A. Nieto-santisteban; Alexander S. Szalay; David J. DeWitt; Gerd Heber

Scientific instruments and computer simulations are creating vast data stores that require new scientific methods to analyze and organize the data. Data volumes are approximately doubling each year. Since these new instruments have extraordinary precision, the data quality is also rapidly improving. Analyzing this data to find the subtle effects missed by previous studies requires algorithms that can simultaneously deal with huge datasets and that can find very subtle effects --- finding both needles in the haystack and finding very small haystacks that were undetected in previous measurements.


hawaii international conference on system sciences | 2009

GrayWulf: Scalable Clustered Architecture for Data Intensive Computing

Alexander S. Szalay; Gordon Bell; Jan Vandenberg; Alainna Wonders; Randal C. Burns; Dan Fay; J. N. Heasley; Tony Hey; Maria A. Nieto-santisteban; Ani Thakar; Richard Wilton

Data intensive computing presents a significant challenge for traditional supercomputing architectures that maximize FLOPS since CPU speed has surpassed IO capabilities of HPC systems and BeoWulf clusters. We present the architecture for a three tier commodity component cluster designed for a range of data intensive computations operating on petascale data sets named GrayWulf. The design goal is a balanced system in terms of IO performance and memory size, according to Amdahls Laws. The hardware currently installed at JHU exceeds one petabyte of storage and has 0.5 bytes/sec of I/O and 1 byte of memory for each CPU cycle. The GrayWulf provides almost an order of magnitude better balance than existing systems. The paper covers its architecture and reference applications. The software design is presented in a companion paper.


The Astrophysical Journal | 2009

GALEX-SDSS CATALOGS FOR STATISTICAL STUDIES

Tamas Budavari; S. Heinis; Alexander S. Szalay; Maria A. Nieto-santisteban; Jayant Gupchup; Bernie Shiao; Myron A. Smith; R. X. Chang; Guinevere Kauffmann; Patrick Morrissey; David Schiminovich; Bruno Milliard; Ted K. Wyder; D. Christopher Martin; Tom A. Barlow; Mark Seibert; Karl Forster; Luciana Bianchi; Jose Donas; Peter G. Friedman; Timothy M. Heckman; Young-Wook Lee; Barry F. Madore; Susan G. Neff; R. Michael Rich; Barry Y. Welsh

We present a detailed study of the Galaxy Evolution Explorer’s (GALEX) photometric catalogs with special focus on the statistical properties of the All-sky and Medium Imaging Surveys. We introduce the concept of primaries to resolve the issue of multiple detections and follow a geometric approach to define clean catalogs with well understood selection functions. We cross-identify the GALEX sources (GR2+3) with Sloan Digital Sky Survey (SDSS; DR6) observations, which indirectly provides an invaluable insight into the astrometric model of the UV sources and allows us to revise the band merging strategy. We derive the formal description of the GALEX footprints as well as their intersections with the SDSS coverage along with analytic calculations of their areal coverage. The crossmatch catalogs are made available for the public. We conclude by illustrating the implementation of typical selection criteria in SQL for catalog subsets geared toward statistical analyses, e.g., correlation and luminosity function studies.


Publications of the Astronomical Society of the Pacific | 2000

Cosmic-Ray Rejection and Readout Efficiency for Large-Area Arrays

D. J. Fixsen; J. D. Offenberg; Robert J. Hanisch; John C. Mather; Maria A. Nieto-santisteban; R. Sengupta; Hervey S. Stockman

We present an algorithm to optimally process uniformly sampled array image data obtained with a nondestructive readout. The algorithm discards full wells, removes cosmic-ray (particle) hits and other glitches, and makes a nearly optimum estimate of the signal on each pixel. The algorithm also compresses the data. The computer requirements are modest, and the results are robust. The results are shown and compared to results of Fowler-sampled and -processed data. Nonideal detector performance may require some additional code, but this is not expected to cost much processing time. Known types of detector faults are addressed.


Proceedings of SPIE | 2006

Designing a Multi-Petabyte Database for LSST

Jacek Becla; Andrew Hanushevsky; Sergei Nikolaev; Ghaleb Abdulla; Alexander S. Szalay; Maria A. Nieto-santisteban; Ani Thakar; Jim Gray

The 3.2 giga-pixel LSST camera will produce approximately half a petabyte of archive images every month. These data need to be reduced in under a minute to produce real-time transient alerts, and then added to the cumulative catalog for further analysis. The catalog is expected to grow about three hundred terabytes per year. The data volume, the real-time transient alerting requirements of the LSST, and its spatio-temporal aspects require innovative techniques to build an efficient data access system at reasonable cost. As currently envisioned, the system will rely on a database for catalogs and metadata. Several database systems are being evaluated to understand how they perform at these data rates, data volumes, and access patterns. This paper describes the LSST requirements, the challenges they impose, the data access philosophy, results to date from evaluating available database technologies against LSST requirements, and the proposed database architecture to meet the data challenges.


Publications of the Astronomical Society of the Pacific | 2001

Validation of Up-the-Ramp Sampling with Cosmic-Ray Rejection on Infrared Detectors

Joel D. Offenberg; Dale J. Fixsen; Bernard J. Rauscher; W. J. Forrest; Robert J. Hanisch; John C. Mather; M. E. McKelvey; R. E. McMurray; Maria A. Nieto-santisteban; Judith L. Pipher; R. Sengupta; Hervey S. Stockman

We examine cosmic-ray rejection methodology on data collected from InSb and Si:As detectors. The application of an up-the-ramp sampling technique with cosmic-ray identification and mitigation is the focus of this study. This technique is valuable for space-based observatories which are exposed to high-radiation environments. We validate the up-the-ramp approach on radiation-test data sets with InSb and Si:As detectors which were generated for SIRTF. The up-the-ramp sampling method studied in this paper is over 99.9% effective at removing cosmic rays and preserves the structure and photometric quality of the image to well within the measurement error.


Proceedings of SPIE | 2012

The JWST data management system engineering database

Maria A. Nieto-santisteban

The Engineering Data Processing (EDP) component of the James Webb Telescope (JWST) Data Management System (DMS) will collect calibrated engineering values for about 15,000 parameters, 300 million samples per day, with a potential daily database growth of 14 GB, 5 TB per year, 50 TB for a 10-year mission. While data will be mostly received in (time, parameter) order, fast access requires translation into (parameter, time) organization and sorting. Organization and indexing of the data will affect storage requirements as well as ingest and access efficiency. Fast access is critical to pipelines processing and calibrating science data.


data compression conference | 1999

Data compression for the next-generation space telescope

Maria A. Nieto-santisteban; Dale J. Fixsen; Joel D. Offenberg; Robert J. Hanisch; Hervey S. Stockman

Summary form only given. The next-generation space telescope (NGST) will produce about 600 GB/day, assuming we use the NASA yardstick 8k/spl times/8k NIR camera (16 bits/pixel), save and transmit 64 non-destructive read-outs per image, and the camera is in continuous use (about 80 observations/day, 10/sup 3/ s each). However, with an L2 halo orbit, the NASA NGST study estimates a downlink rate of 5.35 GB/day using X-band. Clearly the volume of data to downlink must be reduced by at least a factor of 100. Astronomical images are noisy. This fact makes them difficult to compress by lossless compression algorithms such as Huffman, Lempel-Ziv, run-length, or arithmetic code. However, they also have the virtue of showing similar values among adjacent pixels. Techniques such as Rices algorithm (Rice et al., 1993) and derivatives (White and Becker, 1998; Stiavelli and White, 1997) can take advantage of this. We present the way in which some of these compression techniques would work with NGST images. Unfortunately, these lossless algorithms give us compression ratios that still exceed the telemetry guidelines. We have also looked into the feasibility of doing lossy compression by scaling the original image prior to the lossless compression. Under this scheme, we find substantial data reduction with a negligible effect on the data quality.


conference on innovative data systems research | 2005

When Database Systems Meet the Grid.

Maria A. Nieto-santisteban; Alexander S. Szalay; Aniruddha R. Thakar; William O'Mullane; Jim Gray; James Timothy Annis


arXiv: Instrumentation and Methods for Astrophysics | 2016

The Pan-STARRS1 Database and Data Products

H. A. Flewelling; E. A. Magnier; Ken Chambers; J. N. Heasley; C. Holmberg; M. E. Huber; W. E. Sweeney; C. Waters; Xiangjun Chen; Daniel J. Farrow; G. Hasinger; R. Henderson; K. S. Long; N. Metcalfe; Maria A. Nieto-santisteban; Peder Norberg; R. P. Saglia; Alexander S. Szalay; A. Rest; Aniruddha R. Thakar; John L. Tonry; J. Valenti; S. Werner; Richard L. White; Larry Denneau; P. W. Draper; K. W. Hodapp; Robert Jedicke; Nick Kaiser; R. P. Kudritzki

Collaboration


Dive into the Maria A. Nieto-santisteban's collaboration.

Top Co-Authors

Avatar
Top Co-Authors

Avatar
Top Co-Authors

Avatar

Robert J. Hanisch

Space Telescope Science Institute

View shared research outputs
Top Co-Authors

Avatar
Top Co-Authors

Avatar
Top Co-Authors

Avatar

Joel D. Offenberg

Goddard Space Flight Center

View shared research outputs
Top Co-Authors

Avatar

Hervey S. Stockman

Space Telescope Science Institute

View shared research outputs
Top Co-Authors

Avatar
Top Co-Authors

Avatar

John C. Mather

Goddard Space Flight Center

View shared research outputs
Top Co-Authors

Avatar

Nolan Li

Johns Hopkins University

View shared research outputs
Researchain Logo
Decentralizing Knowledge