Bioinformatics Advance Access originally published online on February 19, 2007
Bioinformatics 2007 23(8):1006-1014; doi:10.1093/bioinformatics/btm059
| ||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||
Continuous-index hidden Markov modelling of array CGH copy number data
1Centre for Mathematical Sciences, Lund University, Box 118, 221 00 Lund and 2Department of Oncology, University Hospital, 221 85 Lund, Sweden
*To whom correspondence should be addressed.
| Abstract |
|---|
Motivation: In recent years, a range of techniques for analysis and segmentation of array comparative genomic hybridization (aCGH) data have been proposed. For array designs in which clones are of unequal lengths, are unevenly spaced or overlap, the discrete-index view typically adopted by such methods may be questionable or improved.
Results: We describe a continuous-index hidden Markov model for aCGH data as well as a Monte Carlo EM algorithm to estimate its parameters. It is shown that for a dataset from the BT-474 cell line analysed on 32K BAC tiling microarrays, this model yields considerably better model fit in terms of lag-1 residual autocorrelations compared to a discrete-index HMM, and it is also shown how to use the model for e.g. estimation of change points on the base-pair scale and for estimation of conditional state probabilities across the genome. In addition, the model is applied to the Glioblastoma Multiforme data used in the comparative study by Lai et al. (Lai,W.R. et al. (2005) Comparative analysis of algorithms for identifying amplifications and deletions in array CGH data. Bioinformatics, 21, 3763–3370.) giving result similar to theirs but with certain features highlighted in the continuous-index setting.
Contact: susann.stjernqvist{at}matstat.lu.se
Supplementary information: Supplementary data are available at Bioinformatics online.
Associate Editor: Keith Crandall
Received on November 8, 2006; revised on December 20, 2006; accepted on February 12, 2007
This article has been cited by other articles:
![]() |
C. D. Greenman, G. Bignell, A. Butler, S. Edkins, J. Hinton, D. Beare, S. Swamy, T. Santarius, L. Chen, S. Widaa, et al. PICNIC: an algorithm to predict absolute allelic copy number variation with microarray cancer data Biostat., October 15, 2009; (2009) kxp045v1. [Abstract] [Full Text] [PDF] |
||||
![]() |
S. Stjernqvist and T. Ryden A continuous-index hidden Markov jump process for modeling DNA copy number data Biostat., October 1, 2009; 10(4): 773 - 778. [Abstract] [Full Text] [PDF] |
||||
![]() |
P. Nicolas, A. Leduc, S. Robin, S. Rasmussen, H. Jarmer, and P. Bessieres Transcriptional landscape estimation from tiling array data using a model of signal shift and drift Bioinformatics, September 15, 2009; 25(18): 2341 - 2347. [Abstract] [Full Text] [PDF] |
||||
![]() |
E. Budinska, E. Gelnarova, and M. G. Schimek MSMAD: a computationally efficient method for the analysis of noisy array CGH data Bioinformatics, March 15, 2009; 25(6): 703 - 713. [Abstract] [Full Text] [PDF] |
||||
![]() |
L.-y. Wang, A. Abyzov, J. O. Korbel, M. Snyder, and M. Gerstein MSB: A mean-shift-based approach for the analysis of structural variation in the genome Genome Res., January 1, 2009; 19(1): 106 - 117. [Abstract] [Full Text] [PDF] |
||||
![]() |
R. Andersson, C. E. G. Bruder, A. Piotrowski, U. Menzel, H. Nord, J. Sandgren, T. R. Hvidsten, T. Diaz de Stahl, J. P. Dumanski, and J. Komorowski A segmental maximum a posteriori approach to genome-wide copy number profiling Bioinformatics, March 15, 2008; 24(6): 751 - 758. [Abstract] [Full Text] [PDF] |
||||


