Bioinformatics Advance Access originally published online on August 4, 2005
Bioinformatics 2005 21(19):3763-3770; doi:10.1093/bioinformatics/bti611
| ||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||
Comparative analysis of algorithms for identifying amplifications and deletions in array CGH data
1Harvard-Partners Center for Genetics and Genomics 77 Avenue Louis Pasteur, Boston, MA 02115, USA
2Department of Neurological Surgery, Brigham and Women's Hospital and Harvard Medical School 75 Francis Street, Boston, MA 02115, USA
3Children's Hospital Informatics Program 300 Longwood Ave, Boston, MA 02115, USA
*To whom correspondence should be addressed.
Motivation: Array Comparative Genomic Hybridization (CGH) can reveal chromosomal aberrations in the genomic DNA. These amplifications and deletions at the DNA level are important in the pathogenesis of cancer and other diseases. While a large number of approaches have been proposed for analyzing the large array CGH datasets, the relative merits of these methods in practice are not clear.
Results: We compare 11 different algorithms for analyzing array CGH data. These include both segment detection methods and smoothing methods, based on diverse techniques such as mixture models, Hidden Markov Models, maximum likelihood, regression, wavelets and genetic algorithms. We compute the Receiver Operating Characteristic (ROC) curves using simulated data to quantify sensitivity and specificity for various levels of signal-to-noise ratio and different sizes of abnormalities. We also characterize their performance on chromosomal regions of interest in a real dataset obtained from patients with Glioblastoma Multiforme. While comparisons of this type are difficult due to possibly sub-optimal choice of parameters in the methods, they nevertheless reveal general characteristics that are helpful to the biological investigator.
Contact: peter_park{at}harvard.edu
Received on June 17, 2005; revised on July 12, 2005; accepted on August 2, 2005
This article has been cited by other articles:
![]() |
L. Winchester, C. Yau, and J. Ragoussis Comparing CNV detection methods for SNP arrays Brief Funct Genomic Proteomic, September 8, 2009; (2009) elp017v1. [Abstract] [Full Text] [PDF] |
||||
![]() |
H. Bengtsson, P. Wirapati, and T. P. Speed A single-array preprocessing method for estimating full-resolution raw copy numbers from all Affymetrix genotyping arrays including GenomeWideSNP 5 & 6 Bioinformatics, September 1, 2009; 25(17): 2149 - 2156. [Abstract] [Full Text] [PDF] |
||||
![]() |
O. M. Rueda and R. Diaz-Uriarte RJaCGH: Bayesian analysis of aCGH arrays for detecting copy number changes and recurrent regions Bioinformatics, August 1, 2009; 25(15): 1959 - 1960. [Abstract] [Full Text] [PDF] |
||||
![]() |
T. LaFramboise Single nucleotide polymorphism arrays: a decade of biological, computational and technological advances Nucleic Acids Res., July 1, 2009; (2009) gkp552v1. [Abstract] [Full Text] [PDF] |
||||
![]() |
Z. Barutcuoglu, E. M. Airoldi, V. Dumeaux, R. E. Schapire, and O. G. Troyanskaya Aneuploidy prediction and tumor classification with heterogeneous hidden conditional random fields Bioinformatics, May 15, 2009; 25(10): 1307 - 1313. [Abstract] [Full Text] [PDF] |
||||
![]() |
R. Pique-Regi, A. Ortega, and S. Asgharzadeh Joint estimation of copy number variation and reference intensities on multiple DNA arrays using GADA Bioinformatics, May 15, 2009; 25(10): 1223 - 1230. [Abstract] [Full Text] [PDF] |
||||
![]() |
B. Nilsson, M. Johansson, F. Al-Shahrour, A. E. Carpenter, and B. L. Ebert Ultrasome: efficient aberration caller for copy number studies of ultra-high resolution Bioinformatics, April 15, 2009; 25(8): 1078 - 1079. [Abstract] [Full Text] [PDF] |
||||
![]() |
E. Budinska, E. Gelnarova, and M. G. Schimek MSMAD: a computationally efficient method for the analysis of noisy array CGH data Bioinformatics, March 15, 2009; 25(6): 703 - 713. [Abstract] [Full Text] [PDF] |
||||
![]() |
L.-y. Wang, A. Abyzov, J. O. Korbel, M. Snyder, and M. Gerstein MSB: A mean-shift-based approach for the analysis of structural variation in the genome Genome Res., January 1, 2009; 19(1): 106 - 117. [Abstract] [Full Text] [PDF] |
||||
![]() |
P.-A. Chen, H.-F. Liu, and K.-M. Chao CNVDetector: locating copy number variations using array CGH data Bioinformatics, December 1, 2008; 24(23): 2773 - 2775. [Abstract] [Full Text] [PDF] |
||||
![]() |
C. Erdman and J. W. Emerson A fast Bayesian change point analysis for the segmentation of microarray data Bioinformatics, October 1, 2008; 24(19): 2143 - 2148. [Abstract] [Full Text] [PDF] |
||||
![]() |
H.-I H. Chen, F.-H. Hsu, Y. Jiang, M.-H. Tsai, P.-C. Yang, P. S. Meltzer, E. Y. Chuang, and Y. Chen A probe-density-based analysis method for array CGH data: simulation, normalization and centralization Bioinformatics, August 15, 2008; 24(16): 1749 - 1756. [Abstract] [Full Text] [PDF] |
||||
![]() |
E. Ben-Yaacov and Y. C. Eldar A fast and flexible method for the segmentation of aCGH data Bioinformatics, August 15, 2008; 24(16): i139 - i145. [Abstract] [Full Text] [PDF] |
||||
![]() |
P. Cahan, L. E. Godfrey, P. S. Eis, T. A. Richmond, R. R. Selzer, M. Brent, H. L. McLeod, T. J. Ley, and T. A. Graubert wuHMM: a robust algorithm to detect DNA copy number variation using long oligonucleotide microarray data Nucleic Acids Res., April 1, 2008; 36(7): e41 - e41. [Abstract] [Full Text] [PDF] |
||||
![]() |
T. L. Lai, H. Xing, and N. Zhang Stochastic segmentation models for array-based comparative genomic hybridization data analysis Biostat., April 1, 2008; 9(2): 290 - 307. [Abstract] [Full Text] [PDF] |
||||
![]() |
H. Lee, S. W. Kong, and P. J. Park Integrative analysis reveals the direct and indirect interactions between DNA copy number aberrations and gene expression changes Bioinformatics, April 1, 2008; 24(7): 889 - 896. [Abstract] [Full Text] [PDF] |
||||
![]() |
W. Lai, V. Choudhary, and P. J. Park CGHweb: a tool for comparing DNA copy number segmentations from multiple algorithms Bioinformatics, April 1, 2008; 24(7): 1014 - 1015. [Abstract] [Full Text] [PDF] |
||||
![]() |
R. Andersson, C. E. G. Bruder, A. Piotrowski, U. Menzel, H. Nord, J. Sandgren, T. R. Hvidsten, T. Diaz de Stahl, J. P. Dumanski, and J. Komorowski A segmental maximum a posteriori approach to genome-wide copy number profiling Bioinformatics, March 15, 2008; 24(6): 751 - 758. [Abstract] [Full Text] [PDF] |
||||
![]() |
C. Klijn, H. Holstege, J. de Ridder, X. Liu, M. Reinders, J. Jonkers, and L. Wessels Identification of cancer genes using a statistical framework for multiexperiment analysis of nondiscretized array CGH data Nucleic Acids Res., February 2, 2008; 36(2): e13 - e13. [Abstract] [Full Text] [PDF] |
||||
![]() |
R. Pique-Regi, J. Monso-Varona, A. Ortega, R. C. Seeger, T. J. Triche, and S. Asgharzadeh Sparse representation and Bayesian detection of genome copy number alterations from microarray data Bioinformatics, February 1, 2008; 24(3): 309 - 318. [Abstract] [Full Text] [PDF] |
||||
![]() |
Y. Li and J. Zhu Analysis of array CGH data for cancer studies using fused quantile regression Bioinformatics, September 15, 2007; 23(18): 2470 - 2476. [Abstract] [Full Text] [PDF] |
||||
![]() |
J. Huang, A. Gusnanto, K. O'Sullivan, J. Staaf, A. Borg, and Y. Pawitan Robust smooth segmentation approach for array CGH data analysis Bioinformatics, September 15, 2007; 23(18): 2463 - 2469. [Abstract] [Full Text] [PDF] |
||||
![]() |
L. Conde, D. Montaner, J. Burguet-Castell, J. Tarraga, I. Medina, F. Al-Shahrour, and J. Dopazo ISACGH: a web-based environment for the analysis of Array CGH and gene expression which includes functional profiling Nucleic Acids Res., July 13, 2007; 35(suppl_2): W81 - W85. [Abstract] [Full Text] [PDF] |
||||
![]() |
R. Diaz-Uriarte, A. Alibes, E. R. Morrissey, A. Canada, O. M. Rueda, and M. L. Neves Asterias: integrated analysis of expression and aCGH data using an open-source, web-based, parallelized software suite Nucleic Acids Res., July 13, 2007; 35(suppl_2): W75 - W80. [Abstract] [Full Text] [PDF] |
||||
![]() |
B. Xing, C. M. T. Greenwood, and S. B. Bull A hierarchical clustering method for estimating copy number variation Biostat., July 1, 2007; 8(3): 632 - 653. [Abstract] [Full Text] [PDF] |
||||
![]() |
S. Stjernqvist, T. Ryden, M. Skold, and J. Staaf Continuous-index hidden Markov modelling of array CGH copy number data Bioinformatics, April 15, 2007; 23(8): 1006 - 1014. [Abstract] [Full Text] [PDF] |
||||
![]() |
P. J. Park Integrated Analysis of Array-CGH and Expression Data Am. Assoc. Cancer Res. Educ. Book, April 14, 2007; 2007(1): 209 - 212. [Full Text] [PDF] |
||||
![]() |
E. S. Venkatraman and A. B. Olshen A faster circular binary segmentation algorithm for the analysis of array CGH data Bioinformatics, March 15, 2007; 23(6): 657 - 663. [Abstract] [Full Text] [PDF] |
||||
![]() |
J. Oosting, E. H. Lips, R. van Eijk, P. H.C. Eilers, K. Szuhai, C. Wijmenga, H. Morreau, and T. van Wezel High-resolution copy number analysis of paraffin-embedded archival tissue using SNP BeadArrays Genome Res., March 1, 2007; 17(3): 368 - 376. [Abstract] [Full Text] [PDF] |
||||
![]() |
J. Hu, J.-B. Gao, Y. Cao, E. Bottinger, and W. Zhang Exploiting noise in array CGH data to improve detection of DNA copy number change Nucleic Acids Res., February 1, 2007; (2007) gkl730v1. [Abstract] [Full Text] [PDF] |
||||
![]() |
F. Liu, P. J. Park, W. Lai, E. Maher, A. Chakravarti, L. Durso, X. Jiang, Y. Yu, A. Brosius, M. Thomas, et al. A Genome-Wide Screen Reveals Functional Gene Clusters in the Cancer Genome and Identifies EphA2 as a Mitogen in Glioblastoma Cancer Res., November 15, 2006; 66(22): 10815 - 10823. [Abstract] [Full Text] [PDF] |
||||
![]() |
D. A. Peiffer, J. M. Le, F. J. Steemers, W. Chang, T. Jenniges, F. Garcia, K. Haden, J. Li, C. A. Shaw, J. Belmont, et al. High-resolution genomic profiling of chromosomal aberrations using Infinium whole-genome genotyping Genome Res., September 1, 2006; 16(9): 1136 - 1148. [Abstract] [Full Text] [PDF] |
||||
![]() |
H. Ji, J. Kumm, M. Zhang, K. Farnam, K. Salari, M. Faham, J. M. Ford, and R. W. Davis Molecular Inversion Probe Analysis of Gene Copy Alterations Reveals Distinct Categories of Colorectal Carcinoma Cancer Res., August 15, 2006; 66(16): 7910 - 7919. [Abstract] [Full Text] [PDF] |
||||
![]() |
S. Pounds and C. Cheng Robust estimation of the false discovery rate Bioinformatics, August 15, 2006; 22(16): 1979 - 1987. [Abstract] [Full Text] [PDF] |
||||
![]() |
J. C. Marioni, N. P. Thorne, and S. Tavare BioHMM: a heterogeneous hidden Markov model for segmenting array CGH data Bioinformatics, May 1, 2006; 22(9): 1144 - 1146. [Abstract] [Full Text] [PDF] |
||||
![]() |
C. Le Caignec, C. Spits, K. Sermon, M. De Rycke, B. Thienpont, S. Debrock, C. Staessen, Y. Moreau, J.-P. Fryns, A. Van Steirteghem, et al. Single-cell chromosomal imbalances detection by array CGH. Nucleic Acids Res., January 1, 2006; 34(9): e68 - e68. [Abstract] [Full Text] [PDF] |
||||
![]() |
P. van den IJssel, M. Tijssen, S.-F. Chin, P. Eijk, B. Carvalho, E. Hopmans, H. Holstege, D. K. Bangarusamy, J. Jonkers, G. A. Meijer, et al. Human and mouse oligonucleotide-based array CGH Nucleic Acids Res., December 16, 2005; 33(22): e192 - e192. [Abstract] [Full Text] [PDF] |
||||
![]() |
H. Willenbrock and J. Fridlyand A comparison study: applying segmentation to array CGH data for downstream analyses Bioinformatics, November 15, 2005; 21(22): 4084 - 4091. [Abstract] [Full Text] [PDF] |
||||






