Bioinformatics Advance Access published online on May 14, 2009
Bioinformatics, doi:10.1093/bioinformatics/btp312
| ||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||
Hierarchical Hidden Markov Model with Application to Joint Analysis of ChIP-chip and ChIP-seq data

1Department of Pathology, University of Michigan, Ann Arbor, MI 48109, USA
2Center for Computational Medicine and Biology, University of Michigan, Ann Arbor, MI 48109, USA
3Departments of Statistics and Public Health Sciences, Penn State University, University Park, PA 16802, USA
4Center for Statistical Genetics, Department of Biostatistics, University of Michigan, Ann Arbor, MI 48109, USA
*To whom correspondence should be addressed. Dr. Hyungwon Choi, E-mail: hwchoi{at}umich.edu
| Abstract |
|---|
Motivation: Chromatin immunoprecipitation (ChIP) experiments followed by array hybridization, or ChIP-chip, is a powerful approach for identifying transcription factor binding sites (TFBS) and has been widely used. Recently, massively parallel sequencing coupled with ChIP experiments (ChIP-seq) has been increasingly used as an alternative to ChIP-chip, offering cost-effective genomewide coverage and resolution up to a single basepair. For many well-studied transcription factors, both ChIP-seq and ChIP-chip experiments have been applied and their data are publicly available.Previous analyses have revealed substantial technology-specific binding signals despite strong correlation between the two sets of results. Therefore, it is of interest to see whether the two data sources can be combined to enhance the detection of TFBS.
Results: In this work, hierarchical hidden Markov model (HHMM) is proposed for combining data from ChIP-seq and ChIP-chip. In HHMM, inference results from individual HMMs in ChIP-seq and ChIP-chip experiments are summarized by a higher level HMM. Simulation studies show the advantage of HHMM when data from both technologies co-exist. Analysis of two well-studied transcription factors, NRSF and CTCF, also suggests that HHMM yields improved TFBS identification in comparison to analyses using individual data sources or a simple merger of the two.
Availability: Source code for the software ChIPmeta freely available for download at URL http://www.umich.edu/~hwchoi/HHMMsoftware.zip, implemented in C and supported on linux.
Contact: qin{at}umich.edu, ghoshd{at}psu.edu
Associate Editor: Prof. Alfonso Valencia
Received on January 14, 2009; revised on April 15, 2009; accepted on May 6, 2009
This article has been cited by other articles:
![]() |
Z. Liu, M. Liu, G. Niu, Y. Cheng, and J. Fei Genome-wide identification of target genes repressed by the zinc finger transcription factor REST/NRSF in the HEK 293 cell line Acta Biochim Biophys Sin, November 10, 2009; (2009) gmp095v1. [Abstract] [Full Text] [PDF] |
||||
co-corresponding author 