Stefan Liess
University of Minnesota
Network
Latest external collaboration on country level. Dive into details by clicking on the dots.
Publication
Featured researches published by Stefan Liess.
advances in geographic information systems | 2011
Xun Zhou; Shashi Shekhar; Pradeep Mohan; Stefan Liess; Peter K. Snyder
Given a spatiotemporal (ST) dataset and a path in its embedding spatiotemporal framework, the goal is to to identify all interesting sub-paths defined by an interest measure. Sub-path discovery is of fundamental importance for understanding climate changes, agriculture, and many other application. However, this problem is computationally challenging due to the massive volume of data, the varying length of sub-paths and non-monotonicity of interestingness throughout a sub-path. Previous approaches find interesting unit sub-paths (e.g., unit time interval) or interesting points. By contrast, we propose a Sub-path Enumeration and Pruning (SEP) approach that finds collections of long interesting sub-paths. Two case studies using climate change datasets show that SEP can find long interesting sub-paths which represent abrupt climate change. We provide theoretical analyses of correctness, completeness and computational complexity of the proposed approach. We also provide experimental evaluation of two traversal strategies for enumerating and pruning candidate sub-paths.
conference on intelligent data understanding | 2012
James H. Faghmous; Luke Styles; Varun Mithal; Shyam Boriah; Stefan Liess; Vipin Kumar; Frode Vikebø; Michel D. S. Mesquita
Rotating coherent structures of water known as ocean eddies are the oceanic analog of storms in the atmosphere and a crucial component of ocean dynamics. In addition to dominating the oceans kinetic energy, eddies play a significant role in the transport of water, salt, heat, and nutrients. Therefore, understanding current and future eddy activity is a central challenge to address future sustainability of marine ecosystems. The emergence of sea surface height observations from satellite radar altimeter has recently enabled researchers to track eddies at a global scale. The majority of studies that identify eddies from observational data employ highly parametrized connected component algorithms using expert filtered data, effectively making reproducibility and scalability challenging. In this paper, we improve upon the state-of-the-art connected component eddy monitoring algorithms to track eddies globally. This work makes three main contributions: first, we do not pre-process the data therefore minimizing the risk of wiping out important signals within the data. Second, we employ a physically-consistent convexity requirement on eddies based on theoretical and empirical studies to improve the accuracy and computational complexity of our method from quadratic to linear time in the size of each eddy. Finally, we accurately separate eddies that are in close spatial proximity, something existing methods cannot accomplish. We compare our results to those of the state of the art and discuss the impact of our improvements on the difference in results.
knowledge discovery and data mining | 2012
Jaya Kawale; Snigdhansu Chatterjee; Dominick Ormsby; Karsten Steinhaeuser; Stefan Liess; Vipin Kumar
Dipoles represent long distance connections between the pressure anomalies of two distant regions that are negatively correlated with each other. Such dipoles have proven important for understanding and explaining the variability in climate in many regions of the world, e.g., the El Nino climate phenomenon is known to be responsible for precipitation and temperature anomalies over large parts of the world. Systematic approaches for dipole detection generate a large number of candidate dipoles, but there exists no method to evaluate the significance of the candidate teleconnections. In this paper, we present a novel method for testing the statistical significance of the class of spatio-temporal teleconnection patterns called as dipoles. One of the most important challenges in addressing significance testing in a spatio-temporal context is how to address the spatial and temporal dependencies that show up as high autocorrelation. We present a novel approach that uses the wild bootstrap to capture the spatio-temporal dependencies, in the special use case of teleconnections in climate data. Our approach to find the statistical significance takes into account the autocorrelation, the seasonality and the trend in the time series over a period of time. This framework is applicable to other problems in spatio-temporal data mining to assess the significance of the patterns.
Journal of Climate | 2014
Stefan Liess; Arjun Kumar; Peter K. Snyder; Jaya Kawale; Karsten Steinhaeuser; Frederick H. M. Semazzi; Auroop R. Ganguly; Nagiza F. Samatova; Vipin Kumar
AbstractA new approach is used to detect atmospheric teleconnections without being bound by orthogonality (such as empirical orthogonal functions). This method employs negative correlations in a global dataset to detect potential teleconnections. One teleconnection occurs between the Tasman Sea and the Southern Ocean. It is related to El Nino–Southern Oscillation (ENSO), the Indian Ocean dipole (IOD), and the southern annular mode (SAM). This teleconnection is significantly correlated with SAM during austral summer, fall, and winter, with IOD during spring, and with ENSO in summer. It can thus be described as a hybrid between these modes. Given previously found relationships between IOD and ENSO, and IOD’s proximity to the teleconnection centers, correlations to IOD are generally stronger than to ENSO.Increasing pressure over the Tasman Sea leads to higher (lower) surface temperature over eastern Australia (the southwestern Pacific) in all seasons and is related to reduced surface temperature over Wilkes ...
conference on intelligent data understanding | 2012
Jaya Kawale; Stefan Liess; Vipin Kumar; Upmanu Lall; Auroop R. Ganguly
Time series data in climate are often characterized by a delayed relationship between two variables, for example precipitation and temperature anomalies occurring at a place might also occur at another place after some time. These lagged relations generally signify the time lag between the cause and the effect or the spread of a common cause and are important to study and understand as they can aid in prediction. Identifying lagged relationships in climate data is challenging due to the various complex dependencies present in the data like spatial and temporal auto-correlation, seasonality, trends and long distance teleconnections. In this paper, we present a general framework for finding all pairs of lagged positive and negative relations that can exist in a given spatio-temporal dataset. We use a graph based approach based upon the concept of shared reciprocal nearest neighbor to generate cluster pairs of locations sharing similar or opposing behavior for every time lag. Our framework can be generalized to extract multivariate lagged relationships across different variables thus can be used to understand the lagged response of one variable on another. We show the utility of our approach by extracting some of the known delayed relationships like the Madden Julian Oscillation (MJO) and the Pacific North American (PNA) pattern at different lags using the sea level pressure dataset provided by the NCEP/NCAR. Our approach can be broadly applied to other problems in spatio-temporal domain to extract lagged relationships.
Journal of Climate | 2017
Stefan Liess; Saurabh Agrawal; Snigdhansu Chatterjee; Vipin Kumar
AbstractThe Walker circulation is linked to extratropical waves that are deflected from the Northern Hemisphere polar regions and travel southeastward over central Asia toward the western Pacific warm pool during northern winter. The wave pattern resembles the east Atlantic–west Russia pattern and influences the El Nino–Southern Oscillation (ENSO) region. A tripole pattern between the West Siberian Plain and the two centers of action of ENSO indicates that the background state of ENSO with respect to global sea level pressure (SLP) has a significant negative correlation to the West Siberian Plain. The correlation with the background state, which is defined by the sum of the two centers of action of ENSO, is higher than each of the pairwise correlations with either of the ENSO centers alone. The centers are defined with a clustering algorithm that detects regions with similar characteristics. The normalized monthly SLP time series for the two centers of ENSO (around Darwin, Australia, and Tahiti) are area ...
Journal of Climate | 2016
Mengqian Lu; Upmanu Lall; Jaya Kawale; Stefan Liess; Vipin Kumar
AbstractCorrelation networks identified from financial, genomic, ecological, epidemiological, social, and climatic data are being used to provide useful topological insights into the structure of high-dimensional data. Strong convection over the oceans and the atmospheric moisture transport and flow convergence indicated by atmospheric pressure fields may determine where and when extreme precipitation occurs. Here, the spatiotemporal relationship among sea surface temperature (SST), sea level pressure (SLP), and extreme global precipitation is explored using a graph-based approach that uses the concept of reciprocity to generate cluster pairs of locations with similar spatiotemporal patterns at any time lag. A global time-lagged relationship between pentad SST anomalies and pentad SLP anomalies is investigated to understand the linkages and influence of the slowly changing oceanic boundary conditions on the development of the global atmospheric circulation. This study explores the use of this correlation ...
Computing in Science and Engineering | 2015
Stefan Liess
Recent growth in the scale and variety of Earth science data has provided unprecedented opportunities to big data analytics research for understanding the Earths physical processes. An upsurge of Earth science datasets in the past few decades are being continually collected using various modes of acquisition, at different scales of observation, and in diverse data types and formats. Earth science datasets, however, exhibit some unique characteristics (such as adherence to physical properties and spatiotemporal constraints) that present challenges to traditional data-centric approaches. In this article, the authors briefly introduce the different categories of Earth science datasets and further describe some of the major data-centric challenges in analyzing Earth science data.
Managing and Mining Sensor Data | 2013
James H. Faghmous; Jaya Kawale; Luke Styles; Mace Blank; Varun Mithal; Xi C. Chen; Ankush Khandelwal; Shyam Boriah; Karsten Steinhaeuser; Michael Steinbach; Vipin Kumar; Stefan Liess
Advances in earth observation technologies have led to the acquisition of vast volumes of accurate, timely and reliable environmental data which encompass a multitude of information about the land, ocean and atmosphere of the planet. Earth science sensor datasets capture multiple facets of information about natural processes and human activities that shape the physical landscape and environmental quality of our planet, and thus, offer an opportunity to monitor and understand the diverse phenomena affecting earth’s complex system. The monitoring, analysis and understanding of these rich sensor datasets is thus of prime importance for the efficient planning and management of critical resources, since the societal costs of mitigation or adaptation decisions for natural or human-induced adverse events are significant. Hence, a thorough understanding of earth science sensor datasets has a direct impact on a range of societally relevant issues. Moreover, earth science sensor datasets possess unique domain-specific properties that distinguish them from sensor datasets used in other domains, and thus demand the need for novel tools and techniques to be developed for their analysis, adhering to their characteristic issues and challenges.
knowledge discovery and data mining | 2017
Saurabh Agrawal; Gowtham Atluri; William Haltom; Stefan Liess; Snigdhansu Chatterjee; Vipin Kumar
Mining relationships in time series data is of immense interest to several disciplines such as neuroscience, climate science, and transportation. Traditional approaches for mining relationships focus on discovering pair-wise relationships in the data. In this work, we define a novel relationship pattern involving three interacting time series, which we refer to as a tripole. We show that tripoles capture interesting relationship patterns in the data that are not possible to be captured using traditionally studied pair-wise relationships. We demonstrate the utility of tripoles in multiple real-world datasets from various domains including climate science and neuroscience. In particular, our approach is able to discover tripoles that are statistically significant, reproducible across multiple independent data sets, and lead to novel domain insights.