Skip Navigation


Bioinformatics Advance Access originally published online on February 1, 2008
Bioinformatics 2008 24(6):878-879; doi:10.1093/bioinformatics/btn021
This Article
Right arrow Abstract Freely available
Right arrow FREE Full Text (Print PDF) Freely available
Right arrow Supplementary Data
Right arrowOA All Versions of this Article:
24/6/878    most recent
btn021v1
Right arrow Comments: Submit a response
Right arrow Alert me when this article is cited
Right arrow Alert me when Comments are posted
Right arrow Alert me if a correction is posted
Services
Right arrow Email this article to a friend
Right arrow Similar articles in this journal
Right arrow Similar articles in ISI Web of Science
Right arrow Similar articles in PubMed
Right arrow Alert me to new issues of the journal
Right arrow Add to My Personal Archive
Right arrow Download to citation manager
Right arrow Search for citing articles in:
ISI Web of Science (3)
Google Scholar
Right arrow Articles by Sarkar, D.
Right arrow Articles by Gentleman, R.
Right arrow Search for Related Content
PubMed
Right arrow PubMed Citation
Right arrow Articles by Sarkar, D.
Right arrow Articles by Gentleman, R.
Social Bookmarking
 Add to CiteULike   Add to Connotea   Add to Del.icio.us  
What's this?

© 2008 The Author(s)
This is an Open Access article distributed under the terms of the Creative Commons Attribution Non-Commercial License (http://creativecommons.org/licenses/by-nc/2.0/uk/) which permits unrestricted non-commercial use, distribution, and reproduction in any medium, provided the original work is properly cited.

Using flowViz to visualize flow cytometry data

D. Sarkar *, N. Le Meur and R. Gentleman

Fred Hutchinson Cancer Research Center, 1100 Fairview Avenue North, Seattle, WA 98109, USA

*To whom correspondence should be addressed.


    ABSTRACT
 TOP
 ABSTRACT
 1 INTRODUCTION
 2 DATA STRUCTURES
 3 VISUALIZATION
 ACKNOWLEDGEMENT
 REFERENCES
 

Summary: Automated analysis of flow cytometry (FCM) data is essential for it to become successful as a high throughput technology. We believe that the principles of Trellis graphics can be adapted to provide useful visualizations that can aid such automation. In this article, we describe the R/Bioconductor package flowViz that implements such visualizations.

Availability: flowViz is available as an R package from the Bioconductor project: http://bioconductor.org

Contact: dsarkar{at}fhcrc.org


    1 INTRODUCTION
 TOP
 ABSTRACT
 1 INTRODUCTION
 2 DATA STRUCTURES
 3 VISUALIZATION
 ACKNOWLEDGEMENT
 REFERENCES
 
Traditionally, flow cytometry (FCM) has been a tube-based technique limited to small-scale laboratory studies. High throughput methods have recently been developed and are now used in both basic and clinical research. One of the long-standing challenges in the analysis of FCM data has been calibration, or normalization, as the measurements made on each cell vary by non-biological factors such as machine, experimenter and date. The traditional approach has been to manually curate each sample, usually through the process of hand-gating. This approach is untenable in the high-throughput paradigm, and research on automating analysis of FCM data is ongoing. However, any automated method, however efficient, is bound to fail on occasion. Thus, it is important to have diagnostic tools that can quickly identify such failures so that they can be dealt with appropriately. In this article, we give examples of graphical diagnostics and quality assessment applications using the Bioconductor package flowViz, which adapts principles of Trellis graphics (Becker et al., 1996; Cleveland, 1993) to FCM data. The primary challenges in implementation arise from the need to handle the high volume of data typical in FCM experiments and the multivariate nature of the data.


    2 DATA STRUCTURES
 TOP
 ABSTRACT
 1 INTRODUCTION
 2 DATA STRUCTURES
 3 VISUALIZATION
 ACKNOWLEDGEMENT
 REFERENCES
 
flowViz uses data structures defined in the flowCore package, also available from Bioconductor. FCM data are typically stored in the form of FCS files, which are represented as flowFrame objects in flowCore. Experiments, usually consisting of multiple FCS files, are organized using the flowSet class, which can efficiently host multiple flowFrames. Like most Bioconductor organizational classes, a flowSet also contains experimental meta-data.

As an example, we use the GvHD dataset available in flowCore. The data are a subset of an experiment (Brinkman et al., 2007) that originated from a collection of weekly peripheral blood samples from patients following allogenic blood and marrow transplant. The goal of the study was to identify cellular markers that would predict the development of Graft versus Host Disease (GvHD). Samples were taken at various time points and labeled with four different fluorescent markers whose intensities were determined in addition to the usual forward and side scatter measurements.

Transforming the measured fluorescent intensities is often helpful, especially for visualization. flowCore provides a number of commonly used parameterized transforms in an abstract form that can be applied to all flowFrames in a flowSet:

> data(GvHD)

> GvHD.trans < –

    transform("FSC – H" = asinh, "SSC – H" = asinh,

        "FL1 – H" = asinh, "FL2 – H" = asinh,

        "FL4 – H" = asinh) %on% GvHD


    3 VISUALIZATION
 TOP
 ABSTRACT
 1 INTRODUCTION
 2 DATA STRUCTURES
 3 VISUALIZATION
 ACKNOWLEDGEMENT
 REFERENCES
 
Figure 1 A plots the empirical CDF of the FL2-H channel for all samples, using one panel for every patient. Noting that patient 10 seems to have unusual samples, Figure 1 B has one panel per visit, just for patient 10. These plots are implemented in the flowViz package using the infrastructure from the R package lattice, which also provides the model for the deceptively simple formula interface.


Figure 1
View larger version (29K):
[in this window]
[in a new window]
[Download PowerPoint slide]
 
Fig. 1. (A) ECDF plots of FL2-H for five patients; each line represents a different visit. Patient 10 has two samples (visits) that look different from the others. (B) ECDF plots of FL2-H restricted to Patient 10. We can now see that the samples taken 6 and 13 days after transplant were the unusual ones. (C) Smoothed scatter plots of SSC-H and FL2-H for patient 10. The result of a data-driven filter is superposed, and we can see that a different population has been identified for the sample taken at day 6.

 
A common task in the analysis of FCM data is some form of filtering (gating), either to obtain summary statistics about the number of events that meet a certain criteria or to perform further analysis on a subset of the data. flowCore implements some data-driven filters not usually found in flow cytometry software; for example, norm2Filter implements a robust method for finding a region that most resembles a bivariate Normal distribution. Such filters can be applied to a dataset with the intent of defining a population of ‘live cells’, e.g. and restricting further analysis to it. Filters can also be supplied to certain visualization methods that include it in the display, as we do in Figure 1C. As we can see, the samples previously noted as ‘unusual’ (Days 6 and 13) seem to have two distinct populations, and the automated filter has chosen a different one in each. Depending on the purpose of the analysis, the user may wish to manually intervene at this point. These plots are complemented by numerical summaries, such as the IQR values in Table 1. Another useful piece of information is the time associated with each observation. Figure 2 plots side scatter values over time for each sample from patient 10, showing not only different distributions across samples, but also an unusual artifact for the sample taken 6 days before transplant.


Figure 2
View larger version (69K):
[in this window]
[in a new window]
[Download PowerPoint slide]
 
Fig. 2. Side scatter values over time (scaled separately for each sample) for patient 10. The first sample (labeled –6) possibly had problems (e.g., bubbles, clogs, etc.) in the initial period of collection.

 

View this table:
[in this window]
[in a new window]

 
Table 1. Interquartile range of all channels for patient 10

 

    ACKNOWLEDGEMENT
 TOP
 ABSTRACT
 1 INTRODUCTION
 2 DATA STRUCTURES
 3 VISUALIZATION
 ACKNOWLEDGEMENT
 REFERENCES
 
The work on this manuscript was supported by NIH 1 R01 EB005034, Bioinformatics Standards for Flow Cytometry.

Conflict of Interest: none declared.


    FOOTNOTES
 
Associate Editor: Olga Troyanskaya

Received on September 18, 2007; revised on December 18, 2007; accepted on January 11, 2008

    REFERENCES
 TOP
 ABSTRACT
 1 INTRODUCTION
 2 DATA STRUCTURES
 3 VISUALIZATION
 ACKNOWLEDGEMENT
 REFERENCES
 

    Becker RA, et al. The visual design and control of trellis display. JCGS (1996) 5:123–155.

    Brinkman RR, et al. High- content flow cytometry and temporal data analysis for defining a cellular signature of graft-versus-host disease. BBMT (2007) 13:671–700.

    Cleveland WS. Visualizing Data (1993) Summit, New Jersey: Hobart Press.


Add to CiteULike CiteULike   Add to Connotea Connotea   Add to Del.icio.us Del.icio.us    What's this?



This Article
Right arrow Abstract Freely available
Right arrow FREE Full Text (Print PDF) Freely available
Right arrow Supplementary Data
Right arrowOA All Versions of this Article:
24/6/878    most recent
btn021v1
Right arrow Comments: Submit a response
Right arrow Alert me when this article is cited
Right arrow Alert me when Comments are posted
Right arrow Alert me if a correction is posted
Services
Right arrow Email this article to a friend
Right arrow Similar articles in this journal
Right arrow Similar articles in ISI Web of Science
Right arrow Similar articles in PubMed
Right arrow Alert me to new issues of the journal
Right arrow Add to My Personal Archive
Right arrow Download to citation manager
Right arrow Search for citing articles in:
ISI Web of Science (3)
Google Scholar
Right arrow Articles by Sarkar, D.
Right arrow Articles by Gentleman, R.
Right arrow Search for Related Content
PubMed
Right arrow PubMed Citation
Right arrow Articles by Sarkar, D.
Right arrow Articles by Gentleman, R.
Social Bookmarking
 Add to CiteULike   Add to Connotea   Add to Del.icio.us  
What's this?