Dynamic Visualization of Coexpression in Systems Genetics Data
Biologists hope to address grand scientific challenges by exploring
the abundance of data made available through microarray
analysis and other high-throughput techniques.
However, the impact of this large volume of data is limited unless
researchers can effectively assimilate the entirety of this complex
information and integrate it into their daily research;
interactive visualization tools are called for to support the effort.
Specifically, typical studies of gene coexpression can make use of
novel visualization tools that enable the dynamic formulation and
fine-tuning of hypotheses to aid the process of evaluating
sensitivity of key parameters and achieving data reduction.
These tools should allow biologists to develop an intuitive
understanding of the structure of biological networks and discover
genes that reside in critical positions in networks and pathways.
By using a graph as a universal data representation of correlation
in gene expression data, our novel visualization tool employs several
techniques that when used in an integrated manner provide innovative
analytical capabilities.
Our tool for interacting with gene coexpression data integrates
techniques such as graph layout, qualitative subgraph extraction
through a novel 2D user interface, quantitative data selection
using graph-theoretic algorithms or by querying an optimized B-tree,
dynamic level-of-detail graph abstraction, and template-based fuzzy
classification using neural networks.
We demonstrate our system using a real-world workflow from a
large-scale systems genetics study of mammalian gene coexpression.
This work has been published in IEEE Transactions on Visualization and Computer
Graphics, Vol. 14, No. 5, 2008.
pdf
In the screenshot above, two gene networks (bottom left and right) have
been discovered with a single putatively coregulating gene as a potential
target of knock-out study (center) with proximity information for other
potential regulatory genes (top left) undergoing further study. This
illustrates the discovery of candidate genes, which can affect expression
of several genes throughout the genome that play a role in the locomotor
response of mice exposed to methamphetamine and cocaine.
Credits:
The systems genetics data was provided by Dr. Elissa Chesler
et al. under the auspices of Oak Ridge National Lab's Life Sciences Division
while subsequent paraclique extraction and data processing was performed by
Dr. Michael Langston et al. as part of the ongoing collaboration between
The University of Tennessee and Oak Ridge National Laboratory.
Funding:
The work is funded in part by US National Science Foundation
(NSF) CNS-0437508 and through the US Department of
Energy (DOE) SciDAC Institute of Ultra-Scale Visualization
under DOE DE-FC02-06ER25778. Elissa J. Chesler is supported
by NIH/NIAAA Integrative Neuroscience Initiative
on Alcoholism (U01AA13499, U24AA13513, U01AA016662),
NIH/NIDA R01DA020677, and NICHD R01HD052472-01.
Back to Research Highlights