Skip Navigation


Bioinformatics Advance Access originally published online on July 29, 2004
Bioinformatics 2004 20(18):3533-3543; doi:10.1093/bioinformatics/bth440
This Article
Right arrow FREE Full Text (Print PDF) Freely available
Right arrow FREE Full Text (Screen PDF)
Right arrow All Versions of this Article:
20/18/3533    most recent
bth440v1
Right arrow Comments: Submit a response
Right arrow Alert me when this article is cited
Right arrow Alert me when Comments are posted
Right arrow Alert me if a correction is posted
Services
Right arrow Email this article to a friend
Right arrow Similar articles in this journal
Right arrow Similar articles in PubMed
Right arrow Alert me to new issues of the journal
Right arrow Add to My Personal Archive
Right arrow Download to citation manager
Right arrowRequest Permissions
Google Scholar
Right arrow Articles by Myers, C. L.
Right arrow Articles by Troyanskaya, O. G.
Right arrow Search for Related Content
PubMed
Right arrow PubMed Citation
Right arrow Articles by Myers, C. L.
Right arrow Articles by Troyanskaya, O. G.
Social Bookmarking
 Add to CiteULike   Add to Connotea   Add to Del.icio.us  
What's this?

Bioinformatics vol. 20 issue 18 © Oxford University Press 2004; all rights reserved.

Accurate detection of aneuploidies in array CGH and gene expression microarray data

Chad L. Myers 1,3, Maitreya J. Dunham 1, S. Y. Kung 2 and Olga G. Troyanskaya 1,3,*

1 Lewis-Sigler Institute for Integrative Genomics, Carl Icahn Laboratory, 2 Department of Electrical Engineering and 3 Department of Computer Science, Princeton University, Princeton, NJ 08544, USA

Received on May 28, 2004; revised on July 12, 2004; accepted on July 24, 2004
Advance Access Publication July 29, 2004

Motivation: Chromosomal copy number changes (aneuploidies) are common in cell populations that undergo multiple cell divisions including yeast strains, cell lines and tumor cells. Identification of aneuploidies is critical in evolutionary studies, where changes in copy number serve an adaptive purpose, as well as in cancer studies, where amplifications and deletions of chromosomal regions have been identified as a major pathogenetic mechanism. Aneuploidies can be studied on whole-genome level using array CGH (a microarray-based method that measures the DNA content), but their presence also affects gene expression. In gene expression microarray analysis, identification of copy number changes is especially important in preventing aberrant biological conclusions based on spurious gene expression correlation or masked phenotypes that arise due to aneuploidies. Previously suggested approaches for aneuploidy detection from microarray data mostly focus on array CGH, address only whole-chromosome or whole-arm copy number changes, and rely on thresholds or other heuristics, making them unsuitable for fully automated general application to gene expression datasets. There is a need for a general and robust method for identification of aneuploidies of any size from both array CGH and gene expression microarray data.

Results: We present ChARM (Chromosomal Aberration Region Miner), a robust and accurate expectation–maximization based method for identification of segmental aneuploidies (partial chromosome changes) from gene expression and array CGH microarray data. Systematic evaluation of the algorithm on synthetic and biological data shows that the method is robust to noise, aneuploidal segment size and P-value cutoff. Using our approach, we identify known chromosomal changes and predict novel potential segmental aneuploidies in commonly used yeast deletion strains and in breast cancer. ChARM can be routinely used to identify aneuploidies in array CGH datasets and to screen gene expression data for aneuploidies or array biases. Our methodology is sensitive enough to detect statistically significant and biologically relevant aneuploidies even when expression or DNA content changes are subtle as in mixed populations of cells.

Availability: Code available by request from the authors and on Web supplement at http://function.cs.princeton.edu/ChARM/

Contact: ogt{at}cs.princeton.edu

* To whom correspondence should be addressed.


Add to CiteULike CiteULike   Add to Connotea Connotea   Add to Del.icio.us Del.icio.us    What's this?


This article has been cited by other articles:


Home page
BioinformaticsHome page
S. Sindi, E. Helman, A. Bashir, and B. J. Raphael
A geometric approach for classification and comparison of structural variants
Bioinformatics, June 15, 2009; 25(12): i222 - i230.
[Abstract] [Full Text] [PDF]


Home page
BioinformaticsHome page
Z. Barutcuoglu, E. M. Airoldi, V. Dumeaux, R. E. Schapire, and O. G. Troyanskaya
Aneuploidy prediction and tumor classification with heterogeneous hidden conditional random fields
Bioinformatics, May 15, 2009; 25(10): 1307 - 1313.
[Abstract] [Full Text] [PDF]


Home page
Appl. Environ. Microbiol.Home page
C. Belloch, R. Perez-Torrado, S. S. Gonzalez, J. E. Perez-Ortin, J. Garcia-Martinez, A. Querol, and E. Barrio
Chimeric Genomes of Natural Hybrids of Saccharomyces cerevisiae and Saccharomyces kudriavzevii
Appl. Envir. Microbiol., April 15, 2009; 75(8): 2534 - 2544.
[Abstract] [Full Text] [PDF]


Home page
Genome ResHome page
L.-y. Wang, A. Abyzov, J. O. Korbel, M. Snyder, and M. Gerstein
MSB: A mean-shift-based approach for the analysis of structural variation in the genome
Genome Res., January 1, 2009; 19(1): 106 - 117.
[Abstract] [Full Text] [PDF]


Home page
BioinformaticsHome page
C. Erdman and J. W. Emerson
A fast Bayesian change point analysis for the segmentation of microarray data
Bioinformatics, October 1, 2008; 24(19): 2143 - 2148.
[Abstract] [Full Text] [PDF]


Home page
BioinformaticsHome page
R. Andersson, C. E. G. Bruder, A. Piotrowski, U. Menzel, H. Nord, J. Sandgren, T. R. Hvidsten, T. Diaz de Stahl, J. P. Dumanski, and J. Komorowski
A segmental maximum a posteriori approach to genome-wide copy number profiling
Bioinformatics, March 15, 2008; 24(6): 751 - 758.
[Abstract] [Full Text] [PDF]


Home page
BioinformaticsHome page
G. Rigaill, P. Hupe, A. Almeida, P. La Rosa, J.-P. Meyniel, C. Decraene, and E. Barillot
ITALICS: an algorithm for normalization and DNA copy number calling for Affymetrix SNP arrays
Bioinformatics, March 15, 2008; 24(6): 768 - 774.
[Abstract] [Full Text] [PDF]


Home page
BiostatisticsHome page
R. Tibshirani and P. Wang
Spatial smoothing and hot spot detection for CGH data using the fused lasso
Biostat., January 1, 2008; 9(1): 18 - 29.
[Abstract] [Full Text] [PDF]


Home page
BioinformaticsHome page
A. Buness, R. Kuner, M. Ruschhaupt, A. Poustka, H. Sultmann, and A. Tresch
Identification of aberrant chromosomal regions from gene expression microarray studies applied to human breast cancer
Bioinformatics, September 1, 2007; 23(17): 2273 - 2280.
[Abstract] [Full Text] [PDF]


Home page
BioinformaticsHome page
S. Stjernqvist, T. Ryden, M. Skold, and J. Staaf
Continuous-index hidden Markov modelling of array CGH copy number data
Bioinformatics, April 15, 2007; 23(8): 1006 - 1014.
[Abstract] [Full Text] [PDF]


Home page
Nucleic Acids ResHome page
J. Hu, J.-B. Gao, Y. Cao, E. Bottinger, and W. Zhang
Exploiting noise in array CGH data to improve detection of DNA copy number change
Nucleic Acids Res., March 12, 2007; 35(5): e35 - e35.
[Abstract] [Full Text] [PDF]


Home page
JCOHome page
C. Morrison, M. Radmacher, N. Mohammed, D. Suster, H. Auer, S. Jones, J. Riggenbach, N. Kelbick, G. Bos, and J. Mayerson
MYC Amplification and Polysomy 8 in Chondrosarcoma: Array Comparative Genomic Hybridization, Fluorescent In Situ Hybridization, and Association With Outcome
J. Clin. Oncol., December 20, 2005; 23(36): 9369 - 9376.
[Abstract] [Full Text] [PDF]


Home page
BioinformaticsHome page
H. Willenbrock and J. Fridlyand
A comparison study: applying segmentation to array CGH data for downstream analyses
Bioinformatics, November 15, 2005; 21(22): 4084 - 4091.
[Abstract] [Full Text] [PDF]


Home page
BioinformaticsHome page
W. R. Lai, M. D. Johnson, R. Kucherlapati, and P. J. Park
Comparative analysis of algorithms for identifying amplifications and deletions in array CGH data
Bioinformatics, October 1, 2005; 21(19): 3763 - 3770.
[Abstract] [Full Text] [PDF]


Home page
BioinformaticsHome page
A. A. Margolin, J. Greshock, T. L. Naylor, Y. Mosse, J. M. Maris, G. Bignell, A. I. Saeed, J. Quackenbush, and B. L. Weber
CGHAnalyzer: a stand-alone software package for cancer genome analysis using array-based DNA copy number data
Bioinformatics, August 1, 2005; 21(15): 3308 - 3311.
[Abstract] [Full Text] [PDF]



Disclaimer: Please note that abstracts for content published before 1996 were created through digital scanning and may therefore not exactly replicate the text of the original print issues. All efforts have been made to ensure accuracy, but the Publisher will not be held responsible for any remaining inaccuracies. If you require any further clarification, please contact our Customer Services Department.