Dianne Cook
Iowa State University
Network
Latest external collaboration on country level. Dive into details by clicking on the dots.
Publication
Featured researches published by Dianne Cook.
Journal of Computational and Graphical Statistics | 1996
Andreas Buja; Dianne Cook
Abstract We propose a rudimentary taxonomy of interactive data visualization based on a triad of data analytic tasks: finding Gestalt, posing queries, and making comparisons. These tasks are supported by three classes of interactive view manipulations: focusing, linking, and arranging views. This discussion extends earlier work on the principles of focusing and linking and sets them on a firmer base. Next, we give a high-level introduction to a particular system for multivariate data visualization—XGobi. This introduction is not comprehensive but emphasizes XGobi tools that are examples of focusing, linking, and arranging views; namely, high-dimensional projections, linked scatterplot brushing, and matrices of conditional plots. Finally, in a series of case studies in data visualization, we show the powers and limitations of particular focusing, linking, and arranging tools. The discussion is dominated by high-dimensional projections that form an extremely well-developed part of XGobi. Of particular inter...
Journal of Computational and Graphical Statistics | 1998
Deborah F. Swayne; Dianne Cook; Andreas Buja
Abstract XGobi is a data visualization system with state-of-the-art interactive and dynamic methods for the manipulation of views of data. It implements 2-D displays of projections of points and lines in high-dimensional spaces, as well as parallel coordinate displays and textual views thereof. Projection tools include dotplots of single variables, plots of pairs of variables, 3-D data rotations, various grand tours, and interactive projection pursuit. Views of the data can be reshaped. Points can be labeled and brushed with glyphs and colors. Lines can be edited and colored. Several XGobi processes can be run simultaneously and linked for labeling, brushing, and sharing of projections. Missing data are accommodated and their patterns can be examined; multiple imputations can be given to XGobi for rapid visual diagnostics. XGobi includes an extensive online help facility. XGobi can be integrated in other software systems, as has been done for the data analysis language S, the geographic information system...
Computational Statistics & Data Analysis | 2003
Deborah F. Swayne; Duncan Temple Lang; Andreas Buja; Dianne Cook
GGobi is a direct descendent of a data visualization system called XGobi that has been around since the early 1990s. GGobis new features include multiple plotting windows, a color lookup table manager, and an Extensible Markup Language file format for data. Perhaps the biggest advance is that GGobi can be easily extended, either by being embedded in other software or by the addition of plugins; either way, it can be controlled using an Application Programming Interface. An illustration of its extensibility is that it can be embedded in R. The result is a full marriage between GGobis direct manipulation graphical environment and Rs familiar extensible environment for statistical data analysis.
Journal of Computational and Graphical Statistics | 1995
Dianne Cook; Andreas Buja; Javier Cabrera; Catherine B. Hurley
Abstract The grand tour and projection pursuit are two methods for exploring multivariate data. We show how to combine them into a dynamic graphical tool for exploratory data analysis, called a projection pursuit guided tour. This tool assists in clustering data when clusters are oddly shaped and in finding general low-dimensional structure in high-dimensional, and in particular, sparse data. An example shows that the method, which is projection-based, can be quite powerful in situations that may cause grief for methods based on kernel smoothing. The projection pursuit guided tour is also useful for comparing and developing projection pursuit indexes and illustrating some types of asymptotic results.
Genome Biology | 2012
Tengfei Yin; Dianne Cook; Michael Lawrence
We introduce ggbio, a new methodology to visualize and explore genomics annotationsand high-throughput data. The plots provide detailed views of genomic regions,summary views of sequence alignments and splicing patterns, and genome-wide overviewswith karyogram, circular and grand linear layouts. The methods leverage thestatistical functionality available in R, the grammar of graphics and the datahandling capabilities of the Bioconductor project. The plots are specified within amodular framework that enables users to construct plots in a systematic way, and aregenerated directly from Bioconductor data structures. The ggbio R package isavailable athttp://www.bioconductor.org/packages/2.11/bioc/html/ggbio.html.
Archive | 2007
Dianne Cook; Deborah F. Swayne
This richly illustrated book describes the use of interactive and dynamic graphics as part of multidimensional data analysis. Chapters include clustering, supervised classification, and working with missing values. A variety of plots and interaction methods are used in each analysis, often starting with brushing linked low-dimensional views and working up to manual manipulation of tours of several variables. The role of graphical methods is shown at each step of the analysis, not only in the early exploratory phase, but in the later stages, too, when comparing and evaluating models. All examples are based on freely available software: GGobi for interactive graphics and R for static graphics, modeling, and programming. The printed book is augmented by a wealth of material on the web, encouraging readers follow the examples themselves. The web site has all the data and code necessary to reproduce the analyses in the book, along with movies demonstrating the examples. The book may be used as a text in a class on statistical graphics or exploratory data analysis, for example, or as a guide for the independent learner. Each chapter ends with a set of exercises. The authors are both Fellows of the American Statistical Association, past chairs of the Section on Statistical Graphics, and co-authors of the GGobi software. Dianne Cook is Professor of Statistics at Iowa State University. Deborah Swayne is a member of the Statistics Research Department at AT&T Labs.
Nucleic Acids Research | 2004
Lishuang Shen; Jian Gong; Rico A. Caldo; Dan Nettleton; Dianne Cook; Roger P. Wise; Julie A. Dickerson
BarleyBase (BB) (www.barleybase.org) is an online database for plant microarrays with integrated tools for data visualization and statistical analysis. BB houses raw and normalized expression data from the two publicly available Affymetrix genome arrays, Barley1 and Arabidopsis ATH1 with plans to include the new Affymetrix 61K wheat, maize, soybean and rice arrays, as they become available. BB contains a broad set of query and display options at all data levels, ranging from experiments to individual hybridizations to probe sets down to individual probes. Users can perform cross-experiment queries on probe sets based on observed expression profiles and/or based on known biological information. Probe set queries are integrated with visualization and analysis tools such as the R statistical toolbox, data filters and a large variety of plot types. Controlled vocabularies for gene and plant ontologies, as well as interconnecting links to physical or genetic map and other genomic data in PlantGDB, Gramene and GrainGenes, allow users to perform EST alignments and gene function prediction using Barley1 exemplar sequences, thus, enhancing cross-species comparison.
Philosophical Transactions of the Royal Society A | 2009
Andreas Buja; Dianne Cook; Heike Hofmann; Michael S. Lawrence; Eun-Kyung Lee; Deborah F. Swayne; Hadley Wickham
We propose to furnish visual statistical methods with an inferential framework and protocol, modelled on confirmatory statistical testing. In this framework, plots take on the role of test statistics, and human cognition the role of statistical tests. Statistical significance of ‘discoveries’ is measured by having the human viewer compare the plot of the real dataset with collections of plots of simulated datasets. A simple but rigorous protocol that provides inferential validity is modelled after the ‘lineup’ popular from criminal legal procedures. Another protocol modelled after the ‘Rorschach’ inkblot test, well known from (pop-)psychology, will help analysts acclimatize to random variability before being exposed to the plot of the real data. The proposed protocols will be useful for exploratory data analysis, with reference datasets simulated by using a null assumption that structure is absent. The framework is also useful for model diagnostics in which case reference datasets are simulated from the model in question. This latter point follows up on previous proposals. Adopting the protocols will mean an adjustment in working procedures for data analysts, adding more rigour, and teachers might find that incorporating these protocols into the curriculum improves their students’ statistical thinking.
Comparative and Functional Genomics | 2003
Eve Syrkin Wurtele; Jie Li; Lixia Diao; Hailong Zhang; Carol M. Foster; Beth Fatland; Julie A. Dickerson; Andrew W. Brown; Zach Cox; Dianne Cook; Eun Lee; Heike Hofmann
MetNet (http://www.botany.iastate.edu/∼mash/metnetex/metabolicnetex.html) is publicly available software in development for analysis of genome-wide RNA, protein and metabolite profiling data. The software is designed to enable the biologist to visualize, statistically analyse and model a metabolic and regulatory network map of Arabidopsis, combined with gene expression profiling data. It contains a JAVA interface to an interactions database (MetNetDB) containing information on regulatory and metabolic interactions derived from a combination of web databases (TAIR, KEGG, BRENDA) and input from biologists in their area of expertise. FCModeler captures input from MetNetDB in a graphical form. Sub-networks can be identified and interpreted using simple fuzzy cognitive maps. FCModeler is intended to develop and evaluate hypotheses, and provide a modelling framework for assessing the large amounts of data captured by high-throughput gene expression experiments. FCModeler and MetNetDB are currently being extended to three-dimensional virtual reality display. The MetNet map, together with gene expression data, can be viewed using multivariate graphics tools in GGobi linked with the data analytic tools in R. Users can highlight different parts of the metabolic network and see the relevant expression data highlighted in other data plots. Multi-dimensional expression data can be rotated through different dimensions. Statistical analysis can be computed alongside the visual. MetNet is designed to provide a framework for the formulation of testable hypotheses regarding the function of specific genes, and in the long term provide the basis for identification of metabolic and regulatory networks that control plant composition and development.
Computers & Geosciences | 1997
Dianne Cook; Jürgen Symanzik; James J. Majure; Noel A Cressie
Abstract This document describes the linking of two software packages to provide exploratory dynamic graphical tools directly from within a geographic information system (GIS). The GIS we have used is ArcView 2.1, which is a widely used package for examining maps and images. XGobi is the dynamic graphics package that is publicly available and also widely used for exploring multivariate data. The link involves cross-referencing each location in the map view provided by ArcView with a multitude of plot types in XGobi. This cross-referencing allows a user to interact with either the map view or the XGobi plots by painting (that is, brushing) groups of points with different colors or glyph (shape) types, identifying points, or erasing them, and have the similar changes made automatically in the other view. Examples of the different types of plots and interactions are given in the paper in both image and video format. ArcView is a registered trademark of Environmental Systems Research Institute, Inc., Redlands, CA, U.S.A.