Bioinformatics Advance Access published online on January 19, 2008
Bioinformatics, doi:10.1093/bioinformatics/btn003
| ||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||
A Segmental Maximum A Posteriori Approach to Genome-wide Copy Number Profiling
The Linnaeus Centre for Bioinformatics, Uppsala University, 751 24 Uppsala, Sweden
Department of Genetics, University of Alabama at Birmingham, Birmingham AL 35294-0024, USA
Department of Genetics and Pathology, Rudbeck Laboratory, Uppsala University, 751 85 Uppsala,Sweden
Department of Surgical Sciences, Uppsala University Hospital, 751 85 Uppsala, Sweden
Interdisciplinary Center for Mathematical and Computational Modelling, Warsaw University, 02-106Warsaw, Poland
*To whom correspondence should be addressed. Jan Komorowski, E-mail: Jan.Komorowski{at}lcb.uu.se
| Abstract |
|---|
Motivation: Copy number profiling methods aim at assigning DNA copy numbers to chromosomal regions using measurements from microarray-based comparative genomic hybridizations. Among the proposed methods to this end, Hidden Markov Model (HMM)-based approaches seem promising since DNA copy number transitions are naturally captured in the model. Current discrete-index HMM-based approaches do not, however, take into account heterogeneous information regarding the genomic overlap between clones. Moreover, the majority of existing methods are restricted to chromosome-wise analysis.
Results: We introduce a novel Segmental Maximum A Posteriori approach,SMAP, for DNA copy number profiling. Our method is based on discrete-index Hidden Markov Modeling and incorporates genomic distance and overlap between clones. We exploit a priori information through user-controllable parameterization that enables the identification of copy number deviations of various lengths and amplitudes. The model parameters may be inferred at a genome-wide scale to avoid over-fitting of model parameters often resulting from chromosomewise model inference. We report superior performances of SMAP on synthetic data when compared with two recent methods. When applied on our new experimental data, SMAP readily recognizes already known genetic aberrations including both large-scale regions with aberrant DNA copy number and changes affecting only single features on the array. We highlight the differences between the prediction of SMAP and the compared methods and show that SMAP accurately determines copy number changes and benefits from overlap consideration.
Availability: SMAP is available from Bioconductor (Gentleman et al., 2004) and within the Linnaeus Centre for Bioinformatics Data Warehouse (Ameur et al., 2006).
Contact: Jan.Komorowski{at}lcb.uu.se
Supplementary Information: http://www.lcb.uu.se/papers/r_andersson/SMAP/
Associate Editor: Dr. Alex Bateman
Received on December 19, 2007; revised on December 19, 2007; accepted on January 2, 2008
This article has been cited by other articles:
![]() |
S. Stjernqvist and T. Ryden A continuous-index hidden Markov jump process for modeling DNA copy number data Biostat., October 1, 2009; 10(4): 773 - 778. [Abstract] [Full Text] [PDF] |
||||
![]() |
S. Bicciato, R. Spinelli, M. Zampieri, E. Mangano, F. Ferrari, L. Beltrame, I. Cifola, C. Peano, A. Solari, and C. Battaglia A computational procedure to identify significant overlap of differentially expressed and genomic imbalanced regions in cancer datasets Nucleic Acids Res., August 1, 2009; 37(15): 5057 - 5070. [Abstract] [Full Text] [PDF] |
||||
![]() |
E. Budinska, E. Gelnarova, and M. G. Schimek MSMAD: a computationally efficient method for the analysis of noisy array CGH data Bioinformatics, March 15, 2009; 25(6): 703 - 713. [Abstract] [Full Text] [PDF] |
||||


