Skip Navigation

Bioinformatics 2006 22(14):e431-e439; doi:10.1093/bioinformatics/btl238
This Article
Right arrow FREE Full Text (Print PDF) Freely available
Right arrow Comments: Submit a response
Right arrow Alert me when this article is cited
Right arrow Alert me when Comments are posted
Right arrow Alert me if a correction is posted
Services
Right arrow Email this article to a friend
Right arrow Similar articles in this journal
Right arrow Similar articles in PubMed
Right arrow Alert me to new issues of the journal
Right arrow Add to My Personal Archive
Right arrow Download to citation manager
Google Scholar
Right arrow Articles by Shah, S. P.
Right arrow Articles by Murphy, K. P.
Right arrow Search for Related Content
PubMed
Right arrow PubMed Citation
Right arrow Articles by Shah, S. P.
Right arrow Articles by Murphy, K. P.
Social Bookmarking
 Add to CiteULike   Add to Connotea   Add to Del.icio.us  
What's this?

© The Author 2006. Published by Oxford University Press. All rights reserved. For Permissions, please email: journals.permissions@oxfordjournals.org
The online version of this article has been published under an open access model. Users are entitled to use, reproduce, disseminate, or display the open access version of this article for non-commercial purposes provided that: the original authorship is properly and fully attributed; the Journal and Oxford University Press are attributed as the original place of publication with the correct citation details given; if an article is subsequently reproduced or disseminated not in its entirety but only in part or as a derivative work this must be clearly indicated. For commercial re-use, please contact journals.permissions@oxfordjournals.org

Integrating copy number polymorphisms into array CGH analysis using a robust HMM

Sohrab P. Shah 1,*, Xiang Xuan 1, Ron J. DeLeeuw 2, Mehrnoush Khojasteh 2, Wan L. Lam 2, Raymond Ng 1 and Kevin P. Murphy 1

1 Department of Computer Science, University of British Columbia 201-2366 Main Mall Vancouver, BC V6T 1Z4 Canada
2 British Columbia Cancer Research Centre 675 West 10th Avenue Vancouver, BC V5Z 1L3 Canada

*To whom correspondence should be addressed.

Motivation: Array comparative genomic hybridization (aCGH) is a pervasive technique used to identify chromosomal aberrations in human diseases, including cancer. Aberrations are defined as regions of increased or decreased DNA copy number, relative to a normal sample. Accurately identifying the locations of these aberrations has many important medical applications. Unfortunately, the observed copy number changes are often corrupted by various sources of noise, making the boundaries hard to detect. One popular current technique uses hidden Markov models (HMMs) to divide the signal into regions of constant copy number called segments; a subsequent classification phase labels each segment as a gain, a loss or neutral. Unfortunately, standard HMMs are sensitive to outliers, causing over-segmentation, where segments erroneously span very short regions.

Results: We propose a simple modification that makes the HMM robust to such outliers. More importantly, this modification allows us to exploit prior knowledge about the likely location of "outliers", which are often due to copy number polymorphisms (CNPs). By "explaining away" these outliers with prior knowledge about the locations of CNPs, we can focus attention on the more clinically relevant aberrated regions. We show significant improvements over the current state of the art technique (DNAcopy with MergeLevels) on previously published data from mantle cell lymphoma cell lines, and on published benchmark synthetic data augmented with outliers.

Availability: Source code written in Matlab is available from http://www.cs.ubc.ca/~sshah/acgh.

Contact: sshah{at}cs.ubc.ca



Add to CiteULike CiteULike   Add to Connotea Connotea   Add to Del.icio.us Del.icio.us    What's this?


This article has been cited by other articles:


Home page
BiostatisticsHome page
C. D. Greenman, G. Bignell, A. Butler, S. Edkins, J. Hinton, D. Beare, S. Swamy, T. Santarius, L. Chen, S. Widaa, et al.
PICNIC: an algorithm to predict absolute allelic copy number variation with microarray cancer data
Biostat., October 15, 2009; (2009) kxp045v1.
[Abstract] [Full Text] [PDF]


Home page
BiostatisticsHome page
S. Stjernqvist and T. Ryden
A continuous-index hidden Markov jump process for modeling DNA copy number data
Biostat., October 1, 2009; 10(4): 773 - 778.
[Abstract] [Full Text] [PDF]


Home page
Endocr Relat CancerHome page
F. H van Nederveen, E. Korpershoek, R. J deLeeuw, A. A Verhofstad, J. W Lenders, W. N M Dinjens, W. L Lam, and R. R de Krijger
Array-comparative genomic hybridization in sporadic benign pheochromocytomas
Endocr. Relat. Cancer, June 1, 2009; 16(2): 505 - 513.
[Abstract] [Full Text] [PDF]


Home page
BloodHome page
K.-J. J. Cheung, S. P. Shah, C. Steidl, N. Johnson, T. Relander, A. Telenius, B. Lai, K. P. Murphy, W. Lam, A. J. Al-Tourah, et al.
Genome-wide profiling of follicular lymphoma by array comparative genomic hybridization reveals prognostically significant DNA copy number imbalances
Blood, January 1, 2009; 113(1): 137 - 148.
[Abstract] [Full Text] [PDF]


Home page
Cancer Prevention ResearchHome page
I. F.L. Tsui, M. P. Rosin, L. Zhang, R. T. Ng, and W. L. Lam
Multiple Aberrations of Chromosome 3p Detected in Oral Premalignant Lesions
Cancer Prevention Research, November 1, 2008; 1(6): 424 - 429.
[Abstract] [Full Text] [PDF]


Home page
BloodHome page
D. T. Starczynowski, S. Vercauteren, A. Telenius, S. Sung, K. Tohyama, A. Brooks-Wilson, J. J. Spinelli, C. J. Eaves, A. C. Eaves, D. E. Horsman, et al.
High-resolution whole genome tiling path array CGH analysis of CD34+ cells from patients with low-risk myelodysplastic syndromes reveals cryptic copy number alterations and predicts overall and leukemia-free survival
Blood, October 15, 2008; 112(8): 3412 - 3424.
[Abstract] [Full Text] [PDF]


Home page
BioinformaticsHome page
H.-I H. Chen, F.-H. Hsu, Y. Jiang, M.-H. Tsai, P.-C. Yang, P. S. Meltzer, E. Y. Chuang, and Y. Chen
A probe-density-based analysis method for array CGH data: simulation, normalization and centralization
Bioinformatics, August 15, 2008; 24(16): 1749 - 1756.
[Abstract] [Full Text] [PDF]


Home page
BioinformaticsHome page
R. Pique-Regi, J. Monso-Varona, A. Ortega, R. C. Seeger, T. J. Triche, and S. Asgharzadeh
Sparse representation and Bayesian detection of genome copy number alterations from microarray data
Bioinformatics, February 1, 2008; 24(3): 309 - 318.
[Abstract] [Full Text] [PDF]



Disclaimer: Please note that abstracts for content published before 1996 were created through digital scanning and may therefore not exactly replicate the text of the original print issues. All efforts have been made to ensure accuracy, but the Publisher will not be held responsible for any remaining inaccuracies. If you require any further clarification, please contact our Customer Services Department.