Samuel Gratzl
Johannes Kepler University of Linz
Network
Latest external collaboration on country level. Dive into details by clicking on the dots.
Publication
Featured researches published by Samuel Gratzl.
IEEE Transactions on Visualization and Computer Graphics | 2013
Samuel Gratzl; Alexander Lex; Nils Gehlenborg; Hanspeter Pfister; Marc Streit
Rankings are a popular and universal approach to structuring otherwise unorganized collections of items by computing a rank for each item based on the value of one or more of its attributes. This allows us, for example, to prioritize tasks or to evaluate the performance of products relative to each other. While the visualization of a ranking itself is straightforward, its interpretation is not, because the rank of an item represents only a summary of a potentially complicated relationship between its attributes and those of the other items. It is also common that alternative rankings exist which need to be compared and analyzed to gain insight into how multiple heterogeneous attributes affect the rankings. Advanced visual exploration tools are needed to make this process efficient. In this paper we present a comprehensive analysis of requirements for the visualization of multi-attribute rankings. Based on these considerations, we propose LineUp - a novel and scalable visualization technique that uses bar charts. This interactive technique supports the ranking of items based on multiple heterogeneous attributes with different scales and semantics. It enables users to interactively combine attributes and flexibly refine parameters to explore the effect of changes in the attribute combination. This process can be employed to derive actionable insights as to which attributes of an item need to be modified in order for its rank to change. Additionally, through integration of slope graphs, LineUp can also be used to compare multiple alternative rankings on the same set of items, for example, over time or across different attribute combinations. We evaluate the effectiveness of the proposed multi-attribute visualization technique in a qualitative study. The study shows that users are able to successfully solve complex ranking tasks in a short period of time.
IEEE Transactions on Visualization and Computer Graphics | 2014
Thomas Mühlbacher; Harald Piringer; Samuel Gratzl; Michael Sedlmair; Marc Streit
An increasing number of interactive visualization tools stress the integration with computational software like MATLAB and R to access a variety of proven algorithms. In many cases, however, the algorithms are used as black boxes that run to completion in isolation which contradicts the needs of interactive data exploration. This paper structures, formalizes, and discusses possibilities to enable user involvement in ongoing computations. Based on a structured characterization of needs regarding intermediate feedback and control, the main contribution is a formalization and comparison of strategies for achieving user involvement for algorithms with different characteristics. In the context of integration, we describe considerations for implementing these strategies either as part of the visualization tool or as part of the algorithm, and we identify requirements and guidelines for the design of algorithmic APIs. To assess the practical applicability, we provide a survey of frequently used algorithm implementations within R regarding the fulfillment of these guidelines. While echoing previous calls for analysis modules which support data exploration more directly, we conclude that a range of pragmatic options for enabling user involvement in ongoing computations exists on both the visualization and algorithm side and should be used.
IEEE Transactions on Visualization and Computer Graphics | 2013
Alexander Lex; Christian Partl; Denis Kalkofen; Marc Streit; Samuel Gratzl; Anne Mai Wassermann; Dieter Schmalstieg; Hanspeter Pfister
Biological pathway maps are highly relevant tools for many tasks in molecular biology. They reduce the complexity of the overall biological network by partitioning it into smaller manageable parts. While this reduction of complexity is their biggest strength, it is, at the same time, their biggest weakness. By removing what is deemed not important for the primary function of the pathway, biologists lose the ability to follow and understand cross-talks between pathways. Considering these cross-talks is, however, critical in many analysis scenarios, such as judging effects of drugs. In this paper we introduce Entourage, a novel visualization technique that provides contextual information lost due to the artificial partitioning of the biological network, but at the same time limits the presented information to what is relevant to the analysts task. We use one pathway map as the focus of an analysis and allow a larger set of contextual pathways. For these context pathways we only show the contextual subsets, i.e., the parts of the graph that are relevant to a selection. Entourage suggests related pathways based on similarities and highlights parts of a pathway that are interesting in terms of mapped experimental data. We visualize interdependencies between pathways using stubs of visual links, which we found effective yet not obtrusive. By combining this approach with visualization of experimental data, we can provide domain experts with a highly valuable tool. We demonstrate the utility of Entourage with case studies conducted with a biochemist who researches the effects of drugs on pathways. We show that the technique is well suited to investigate interdependencies between pathways and to analyze, understand, and predict the effect that drugs have on different cell types.
BMC Bioinformatics | 2014
Marc Streit; Samuel Gratzl; Michael Gillhofer; Andreas Mayr; Andreas Mitterecker; Sepp Hochreiter
BackgroundCluster analysis is widely used to discover patterns in multi-dimensional data. Clustered heatmaps are the standard technique for visualizing one-way and two-way clustering results. In clustered heatmaps, rows and/or columns are reordered, resulting in a representation that shows the clusters as contiguous blocks. However, for biclustering results, where clusters can overlap, it is not possible to reorder the matrix in this way without duplicating rows and/or columns.ResultsWe present Furby, an interactive visualization technique for analyzing biclustering results. Our contribution is twofold. First, the technique provides an overview of a biclustering result, showing the actual data that forms the individual clusters together with the information which rows and columns they share. Second, for fuzzy clustering results, the proposed technique additionally enables analysts to interactively set the thresholds that transform the fuzzy (soft) clustering into hard clusters that can then be investigated using heatmaps or bar charts. Changes in the membership value thresholds are immediately reflected in the visualization. We demonstrate the value of Furby by loading biclustering results applied to a multi-tissue dataset into the visualization.ConclusionsThe proposed tool allows analysts to assess the overall quality of a biclustering result. Based on this high-level overview, analysts can then interactively explore the individual biclusters in detail. This novel way of handling fuzzy clustering results also supports analysts in finding the optimal thresholds that lead to the best clusters.
IEEE Transactions on Visualization and Computer Graphics | 2014
Samuel Gratzl; Nils Gehlenborg; Alexander Lex; Hanspeter Pfister; Marc Streit
Answering questions about complex issues often requires analysts to take into account information contained in multiple interconnected datasets. A common strategy in analyzing and visualizing large and heterogeneous data is dividing it into meaningful subsets. Interesting subsets can then be selected and the associated data and the relationships between the subsets visualized. However, neither the extraction and manipulation nor the comparison of subsets is well supported by state-of-the-art techniques. In this paper we present Domino, a novel multiform visualization technique for effectively representing subsets and the relationships between them. By providing comprehensive tools to arrange, combine, and extract subsets, Domino allows users to create both common visualization techniques and advanced visualizations tailored to specific use cases. In addition to the novel technique, we present an implementation that enables analysts to manage the wide range of options that our approach offers. Innovative interactive features such as placeholders and live previews support rapid creation of complex analysis setups. We introduce the technique and the implementation using a simple example and demonstrate scalability and effectiveness in a use case from the field of cancer genomics.
ieee vgtc conference on visualization | 2016
Samuel Gratzl; Alexander Lex; Nils Gehlenborg; Nicola Cosgrove; Marc Streit
The primary goal of visual data exploration tools is to enable the discovery of new insights. To justify and reproduce insights, the discovery process needs to be documented and communicated. A common approach to documenting and presenting findings is to capture visualizations as images or videos. Images, however, are insufficient for telling the story of a visual discovery, as they lack full provenance information and context. Videos are difficult to produce and edit, particularly due to the non‐linear nature of the exploratory process. Most importantly, however, neither approach provides the opportunity to return to any point in the exploration in order to review the state of the visualization in detail or to conduct additional analyses. In this paper we present CLUE (Capture, Label, Understand, Explain), a model that tightly integrates data exploration and presentation of discoveries. Based on provenance data captured during the exploration process, users can extract key steps, add annotations, and author “Vistories”, visual stories based on the history of the exploration. These Vistories can be shared for others to view, but also to retrace and extend the original analysis. We discuss how the CLUE approach can be integrated into visualization tools and provide a prototype implementation. Finally, we demonstrate the general applicability of the model in two usage scenarios: a Gapminder‐inspired visualization to explore public health data and an example from molecular biology that illustrates how Vistories could be used in scientific journals.
Nature Methods | 2014
Marc Streit; Alexander Lex; Samuel Gratzl; Christian Partl; Dieter Schmalstieg; Hanspeter Pfister; Peter J. Park; Nils Gehlenborg
To the editor: Cancer is a heterogeneous disease, and molecular profiling of tumors from large cohorts has enabled characterization of new tumor subtypes. This is a prerequisite for improving personalized treatment and ultimately better patient outcomes. Potential tumor subtypes can be identified with methods such as unsupervised clustering1 or network-based stratification2, which assign patients to sets based on high-dimensional molecular profiles. Detailed characterization of identified sets and their interpretation, however, remain a time-consuming exploratory process. To address these challenges, we combine ‘StratomeX’3, an interactive visualization tool, freely available at http://www.caleydo.org, with exploration tools to efficiently compare multiple patient stratifications, to correlate patient sets with clinical information or genomic alterations, and to view the differences between molecular profiles across patient sets. Although we focus on cancer genomics here, StratomeX can also be applied in other disease cohorts. Thousands of patient stratifications can be derived from large cancer genomics datasets. This space of patient stratifications—which we call the ‘stratome’—contains stratifications based on, for example, clustering of mRNA, microRNA, or protein expression matrices; the mutation or copy number status of genes; or on clinical variables. Due to the size of the stratome and the heterogeneity of the underlying datasets, integration of computational and visual approaches is indispensable to the analyst in identifying biologically or clinically meaningful stratifications, as well as clinical parameters and pathways that together provide a comprehensive view of each patient set. StratomeX complements the network viewers, heat maps, and genome browsers typically used in cancer genomics4 (Supplementary Discussion and Supplementary Table 1). To visualize the relationships between multiple patient stratifications as well as other data (Fig. 1 and Supplementary Fig. 1), stratifications are represented as columns of stacked blocks where each block corresponds to a patient set. Blocks contain visualizations of the data associated with those patients, such as heat maps, pathway maps overlaid with expression data, or survival plots (Supplementary Fig. 2). Bands connecting the blocks show the pairwise overlap of sets in adjacent stratifications, with the width of the bands representing the size of the overlap relative to the size of the patient sets (Supplementary Fig. 3). This visualization is an efficient tool to confirm hypotheses about gene functions or subtypes defined by molecular profiles. Figure 1 Seamless integration of visual and computational components in the extended StratomeX tool StratomeX also integrates a computational framework for query-based guided exploration of the stratome directly into the visualization (Fig. 1), enabling discovery of novel relationships between patient sets and efficient generation and refinement of hypotheses about tumor subtypes. A ‘query wizard’ provides step-by-step instructions (Supplementary Fig. 1 and 4) for defining queries, and a range of computational methods are used to generate rankings (Supplementary Methods). Queries score stratifications, for example, based on their overlap with a particular patient set, or based on their overall similarity to a selected stratification. Furthermore, the analyst can query the collection for stratifications that contain patient sets that exhibit differences in survival or differential regulation of pathways. We use ‘LineUp’5, a multi-attribute ranking technique, to visualize the results of these queries and to show which stratifications or pathways score high (Fig. 1 and Supplementary Fig. 5). The tight integration between the StratomeX and LineUp views, as well as the dynamic computation of scores, is essential for rapid identification of meaningful relationships between stratifications, clinical parameters, and pathways. We demonstrate the effectiveness of StratomeX in a case study (Supplementary Note, Supplementary Figs. 6-18, Supplementary Tables 2 and 3, and Supplementary Video 1) in which we explored molecular and clinical data to characterize tumor subtypes in a cohort of over 400 clear cell renal cell carcinoma cases reported by The Cancer Genome Atlas consortium6.
ieee pacific visualization symposium | 2015
Holger Stitz; Samuel Gratzl; Michael T. Krieger; Marc Streit
With the rise of virtualization and cloud-based networks of various scales and degrees of complexity, new approaches to managing such infrastructures are required. In these networks, relationships among components can be of arbitrary cardinality (1:1, 1:n, n:m), making it challenging for administrators to investigate which components influence others. In this paper we present CloudGazer, a scalable visualization system that allows users to monitor and optimize cloud-based networks effectively to reduce energy consumption and to increase the quality of service. Instead of visualizing the overall network, we split the graph into semantic perspectives that provide a much simpler view of the network. CloudGazer is a multiple coordinated view system that visualizes either static or live status information about the components of a perspective while reintroducing lost inter-perspective relationships on demand using dynamically created inlays. We demonstrate the effectiveness of CloudGazer in two usage scenarios: The first is based on a real-world network of our domain partners where static performance parameters are used to find an optimal design. In the second scenario we use the VAST 2013 Challenge dataset to demonstrate how the system can be employed with live streaming data.
ieee vgtc conference on visualization | 2016
Christian Partl; Samuel Gratzl; Marc Streit; Anne Mai Wassermann; Hanspeter Pfister; Dieter Schmalstieg; Alexander Lex
The analysis of paths in graphs is highly relevant in many domains. Typically, path‐related tasks are performed in node‐link layouts. Unfortunately, graph layouts often do not scale to the size of many real world networks. Also, many networks are multivariate, i.e., contain rich attribute sets associated with the nodes and edges. These attributes are often critical in judging paths, but directly visualizing attributes in a graph layout exacerbates the scalability problem. In this paper, we present visual analysis solutions dedicated to path‐related tasks in large and highly multivariate graphs. We show that by focusing on paths, we can address the scalability problem of multivariate graph visualization, equipping analysts with a powerful tool to explore large graphs. We introduce Pathfinder, a technique that provides visual methods to query paths, while considering various constraints. The resulting set of paths is visualized in both a ranked list and as a node‐link diagram. For the paths in the list, we display rich attribute data associated with nodes and edges, and the node‐link diagram provides topological context. The paths can be ranked based on topological properties, such as path length or average node degree, and scores derived from attribute data. Pathfinder is designed to scale to graphs with tens of thousands of nodes and edges by employing strategies such as incremental query results. We demonstrate Pathfinders fitness for use in scenarios with data from a coauthor network and biological pathways.
bioRxiv | 2018
Marc Streit; Samuel Gratzl; Holger Stitz; Andreas Wernitznig; Thomas Zichner; Christian Haslinger
Summary Ordino is a web-based analysis tool for cancer genomics that allows users to flexibly rank, filter, and explore genes, cell lines, and tissue samples based on pre-loaded data, including The Cancer Genome Atlas (TCGA), the Cancer Cell Line Encyclopedia (CCLE), and manually uploaded information. Interactive tabular data visualization that facilitates the user-driven prioritization process forms a core component of Ordino. Detail views of selected items complement the exploration. Findings can be stored, shared, and reproduced via the integrated session management. Availability and Implementation Ordino is publicly available at https://ordino.caleydoapp.org. The source code is released at https://github.com/Caleydo/ordino under the Mozilla Public License 2.0. Contact [email protected]