Bioinformatics Advance Access originally published online on February 8, 2008
Bioinformatics 2008 24(7):889-896; doi:10.1093/bioinformatics/btn034
Integrative analysis reveals the direct and indirect interactions between DNA copy number aberrations and gene expression changes


1Harvard-Partners Center for Genetics and Genomics, 2Informatics Program, Children's Hospital, 3Department of Cardiology, Children's Hospital, Boston, MA, USA and 4Department of Information and Communication, Gwangju Institute of Science and Technology, Gwangju, Republic of Korea
*To whom correspondence should be addressed.
| Abstract |
|---|
Motivation: DNA copy number aberrations (CNAs) and gene expression (GE) changes provide valuable information for studying chromosomal instability and its consequences in cancer. While it is clear that the structural aberrations and the transcript levels are intertwined, their relationship is more complex and subtle than initially suspected. Most studies so far have focused on how a CNA affects the expression levels of those genes contained within that CNA.
Results: To better understand the impact of CNAs on expression, we investigated the correlation of each CNA to all other genes in the genome. The correlations are computed over multiple patients that have both expression and copy number measurements in brain, bladder and breast cancer data sets. We find that a CNA has a direct impact on the gene amplified or deleted, but it also has a broad, indirect impact elsewhere. To identify a set of CNAs that is coordinately associated with the expression changes of a set of genes, we used a biclustering algorithm on the correlation matrix. For each of the three cancer types examined, the aberrations in several loci are associated with cancer-type specific biological pathways that have been described in the literature: CNAs of chromosome (chr) 7p13 were significantly correlated with epidermal growth factor receptor signaling pathway in glioblastoma multiforme, chr 13q with NF-kappaB cascades in bladder cancer, and chr 11p with Reck pathway in breast cancer. In all three data sets, gene sets related to cell cycle/division such as M phase, DNA replication and cell division were also associated with CNAs. Our results suggest that CNAs are both directly and indirectly correlated with changes in expression and that it is beneficial to examine the indirect effects of CNAs.
Availability: The code is available upon request.
Contact: peter_park{at}harvard.edu
Supplementary Information: Supplementary data are available at Bioinformatics online.
Associate Editor: Chris Stoeckert
The authors wish to be known that, in their opinion, the first two authors should be regarded as joint First Authors.
Received on September 20, 2007; revised on December 29, 2008; accepted on January 22, 2008
This article has been cited by other articles:
![]() |
R. Shen, A. B. Olshen, and M. Ladanyi Integrative clustering of multiple genomic data types using a joint latent variable model with application to breast and lung cancer subtype analysis Bioinformatics, November 15, 2009; 25(22): 2906 - 2912. [Abstract] [Full Text] [PDF] |
||||
![]() |
Y. Choi and C. Kendziorski Statistical methods for gene set co-expression analysis Bioinformatics, November 1, 2009; 25(21): 2780 - 2786. [Abstract] [Full Text] [PDF] |
||||
