Bioinformatics Advance Access originally published online on November 5, 2004
Bioinformatics 2005 21(6):821-822; doi:10.1093/bioinformatics/bti113
| ||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||
CGH-Explorer: a program for analysis of array-CGH data
1Department of Informatics, University of Oslo PO Box 1080 Blindern, N-0316 Oslo, Norway
2Department of Genetics, Institute for Cancer Research, Norwegian Radium Hospital Montebello, N-0310 Oslo, Norway
3Department of Mathematics, University of Oslo PO Box 1053 Blindern, N-0316 Oslo, Norway
*To whom correspondence should be addressed.
| Abstract |
|---|
|
|
|---|
Summary: CGH-Explorer is a program for visualization and statistical analysis of microarray-based comparative genomic hybridization (array-CGH) data. The program has preprocessing facilities, tools for graphical exploration of individual arrays or groups of arrays, and tools for statistical identification of regions of amplification and deletion.
Availability: The program is available as Java class files that runs on any platform with the Java 2 runtime environment (J2SE JRE) installed, and as a Windows executable. Java source files are also available. See http://www.ifi.uio.no/bioinf/Papers/CGH/
Contact: ole{at}ifi.uio.no
| INTRODUCTION |
|---|
|
|
|---|
Genomic DNA copy number changes are key genetic events in the development of many cancers. Copy number changes vary widely with respect to genomic location, magnitude of change and size of the affected genomic region. Changes may cause dysregulation of gene expression for genes that are critical to tumorigenesis and the progression of cancer. The detection and mapping of these genes is important for understanding cancer progression and for associating aberrations with cancer phenotypes (Lengauer et al., 1998).
In array-CGH, differentially labeled tumor and normal DNA are cohybridized to a microarray of mapped sequences, and fluorescence intensities are measured. Intensity ratios are roughly proportional to the copy number ratios of the corresponding DNA samples, providing a means to quantitatively measure genomic copy number for thousands of genes at a time and to map these changes to chromosomal regions (Solinas-Toldo et al., 1997; Pinkel et al., 1998).
Earlier reportings of software for analysis of array-CGH data include a MATLAB toolbox for plotting the data genome-wide or chromosome-by-chromosome and for detecting change points (Autio et al., 2003). Their approach does not include any assessment of the statistical significance of the reported copy number changes. Also, Frankenberger et al. (2003) reported on the development of a data analysis tool for array-CGH data.
In this paper, we report on a comprehensive menu-driven stand-alone program for graphical exploration and statistical analysis of array-CGH data. The program is open-source and portable. The program supports import of text files and Excel files, data preprocessing (including missing data imputation, centering, various data transformations and smoothing), graphical exploration of individual arrays or groups of arrays on arbitrary genomic scales, and detection of statistically significant changes in copy number ratio. The program offers the user many choices to modify settings and customize views. Microarray data measuring mRNA expression levels may also be imported and displayed, either separately or together with copy number data.
| GRAPHICAL ANALYSIS |
|---|
|
|
|---|
Basic plotting functionality includes point plots, frequency plots and checkerboard plots of copy numbers as a function of genomic position. The program supports data views on arbitrary genomic scales, ranging from the whole genome to part of a chromosome. The user can zoom in on any part of the present view using the mouse to indicate the region of interest. Data from different arrays can be plotted together in the same coordinate system, with layout (color and plotting symbol) chosen independently for each array (Fig. 1).
|
Smoothing filters may be applied and the effect instantly seen on a plot. The user may choose between a symmetric moving average filter and a filter specifically designed for discontinuous signals that identifies change points (i.e. starts and ends of all amplified and deleted regions) and thus provides an estimate of the copy number ratio between each pair of consecutive change points.
Information about individual genes in a plot can be obtained in several ways. One may search for genes with names that contain a particular phrase and highlight these genes in a plot. Alternatively, pointing with the mouse at a measurement in a plot shows the name of the gene. Finally, using the mouse the user can select a collection of measurements and obtain a table with textual information about the corresponding genes and links for each gene to a genomic gateway such as Entrez or Ensembl.
| DETECTION OF SIGNIFICANT COPY NUMBER CHANGES |
|---|
|
|
|---|
Several methods for detecting copy number changes are implemented in CGH-Explorer. The most basic functionality is to manually specify an upper and lower threshold and the number of hits required within a region of specified width in order to count the region as an amplification or deletion. Thresholds may also be found automatically, using either a normal DNA reference sample or a novel bootstrap-based method implemented in CGH-Explorer. In addition, a novel method called analysis of copy errors (ACE), not based on thresholding the copy number ratios, is implemented. See the papers website for information about the above methods.
Received on April 14, 2004; revised on October 24, 2004; accepted on October 25, 2004
| REFERENCES |
|---|
|
|
|---|
Autio, R., Hautaniemi, S., Kauraniemi, P., Yli-Harja, O., Astola, J., Wolf, M., Kallioniemi, A. (2003) CGH-Plotter: MATLAB toolbox for CGH-data analysis. Bioinformatics, 19, 17141715
Frankenberger, C., Urzúa, U., Church, D., Powell, J., Burgess, T., Tawady, T., Gangi, L., Munroe, D. (2003) aCGH explorer: a tool for high-resolution analysis of chromosomal imbalances using cDNA and oligo DNA arrays (poster abstract). Biomedical Information Science and Technology Initiative (BISTI) 2003 Symposium, NIH, , Bethesda, MD November 67.
Lengauer, C., Kinzler, K.W., Vogelstein, B. (1998) Genetic instabilities in human cancers. Nature, 396, , pp. 643649[CrossRef][Medline].
Pinkel, D., Segraves, R., Sudar, D., Clark, S., Poole, I., Kowbel, D., Collins, C., Kuo, W.L., Chen, C., Zhai, Y., et al. (1998) High resolution analysis of DNA copy number variation using comparative genomic hybridization to microarrays. Nature Genetics, 20, 207211[CrossRef][ISI][Medline].
Solinas-Toldo, S., Lampel, S., Stilgenbauer, S., Nickolenko, J., Benner, A., Dohner, H., Cremer, T., Lichter, P. (1997) Matrix-based comparative genomic hybridization: biochips to screen for genomic imbalances. Genes Chromosomes Cancer, 20, 399407[CrossRef][ISI][Medline].
This article has been cited by other articles:
![]() |
I. Scheinin, S. Myllykangas, I. Borze, T. Bohling, S. Knuutila, and J. Saharinen CanGEM: mining gene copy number changes in cancer Nucleic Acids Res., January 11, 2008; 36(suppl_1): D830 - D835. [Abstract] [Full Text] [PDF] |
||||
![]() |
L. Melchor, E. Honrado, J. Huang, S. Alvarez, T. L. Naylor, M. J. Garcia, A. Osorio, D. Blesa, M. R. Stratton, B. L. Weber, et al. Estrogen Receptor Status Could Modulate the Genomic Pattern in Familial and Sporadic Breast Cancer Clin. Cancer Res., December 15, 2007; 13(24): 7305 - 7313. [Abstract] [Full Text] [PDF] |
||||
![]() |
J. Lapointe, C. Li, C. P. Giacomini, K. Salari, S. Huang, P. Wang, M. Ferrari, T. Hernandez-Boussard, J. D. Brooks, and J. R. Pollack Genomic Profiling Reveals Alternative Genetic Pathways of Prostate Tumorigenesis Cancer Res., September 15, 2007; 67(18): 8504 - 8510. [Abstract] [Full Text] [PDF] |
||||
![]() |
J. Davidsson, A. Andersson, K. Paulsson, M. Heidenblad, M. Isaksson, A. Borg, J. Heldrup, M. Behrendtz, I. Panagopoulos, T. Fioretos, et al. Tiling resolution array comparative genomic hybridization, expression and methylation analyses of dup(1q) in Burkitt lymphomas and pediatric high hyperdiploid acute lymphoblastic leukemias reveal clustered near-centromeric breakpoints and overexpression of genes in 1q22-32.3 Hum. Mol. Genet., September 15, 2007; 16(18): 2215 - 2225. [Abstract] [Full Text] [PDF] |
||||
![]() |
R. Diaz-Uriarte, A. Alibes, E. R. Morrissey, A. Canada, O. M. Rueda, and M. L. Neves Asterias: integrated analysis of expression and aCGH data using an open-source, web-based, parallelized software suite Nucleic Acids Res., July 13, 2007; 35(suppl_2): W75 - W80. [Abstract] [Full Text] [PDF] |
||||
![]() |
J. Hu, J.-B. Gao, Y. Cao, E. Bottinger, and W. Zhang Exploiting noise in array CGH data to improve detection of DNA copy number change Nucleic Acids Res., March 12, 2007; 35(5): e35 - e35. [Abstract] [Full Text] [PDF] |
||||
![]() |
L. A. Meza-Zepeda, S. H. Kresse, A. H. Barragan-Polania, B. Bjerkehagen, H. O. Ohnstad, H. M. Namlos, J. Wang, B. E. Kristiansen, and O. Myklebost Array Comparative Genomic Hybridization Reveals Distinct DNA Copy Number Differences between Gastrointestinal Stromal Tumors and Leiomyosarcomas. Cancer Res., September 15, 2006; 66(18): 8984 - 8993. [Abstract] [Full Text] [PDF] |
||||
![]() |
W. R. Lai, M. D. Johnson, R. Kucherlapati, and P. J. Park Comparative analysis of algorithms for identifying amplifications and deletions in array CGH data Bioinformatics, October 1, 2005; 21(19): 3763 - 3770. [Abstract] [Full Text] [PDF] |
||||
![]() |
S. Y. Kim, S. W. Nam, S. H. Lee, W. S. Park, N. J. Yoo, J. Y. Lee, and Y.-J. Chung ArrayCyGHt: a web application for analysis and visualization of array-CGH data Bioinformatics, May 15, 2005; 21(10): 2554 - 2555. [Abstract] [Full Text] [PDF] |
||||
| ||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||





