Bioinformatics Vol. 18 no. 8 2002
Pages 1021-1033
© 2002 Oxford University Press
Why are complementary DNA strands symmetric?
1 Department of Information and Computer Science,
Institute for Genomics and Bioinformatics and
2 Department of Biological Chemistry, College of Medicine,
University of California, Irvine, CA 92697-3425, USA
Received on November 24, 2001
; revised on March 7, 2002
; accepted on March 11, 2002
Motivation: Over sufficiently long windows, complementary strands of DNA tend to have the same base composition. A few reports have indicated that this first-order parity rule extends at higher orders to oligonucleotide composition, at least in some organisms or taxa. However, the scientific literature falls short of providing a comprehensive study of reverse-complement symmetry at multiple orders and across the kingdom of life. It also lacks a characterization of this symmetry and a convincing explanation or clarification of its origin.
Results: We develop methods to measure and characterize symmetry at multiple orders, and analyze a wide set of genomes, encompassing single- and double-stranded RNA and DNA viruses, bacteria, archae, mitochondria, and eukaryota. We quantify symmetry at orders 1 to 9 for contiguous sequences and pools of coding and non-coding upstream regions, compare the observed symmetry levels to those predicted by simple statistical models, and factor out the effect of lower-order distributions. We establish the universality and variability range of first-order strand symmetry, as well as of its higher-order extensions, and demonstrate the existence of genuine high-order symmetric constraints. We show that ubiquitous reverse-complement symmetry does not result from a single cause, such as point mutation or recombination, but rather emerges from the combined effects of a wide spectrum of mechanisms operating at multiple orders and length scales.
Contact: baisnee{at}ics.uci.edu hampson{at}ics.uci.edu pfbaldi{at}ics.uci.edu
Data: http://promoter.ics.uci.edu/RevCompSym/
* To whom correspondence should be addressed.
![]()
CiteULike
Connotea
Del.icio.us What's this?
This article has been cited by other articles:
![]() |
C. Zhang, W.-H. Li, A. R. Krainer, and M. Q. Zhang RNA landscape of evolution for optimal exon and intron discrimination PNAS, April 15, 2008; 105(15): 5797 - 5802. [Abstract] [Full Text] [PDF] |
||||
![]() |
J. Klockgether, D. Wurdemann, O. Reva, L. Wiehlmann, and B. Tummler Diversity of the Abundant pKLC102/PAGI-2 Family of Genomic Islands in Pseudomonas aeruginosa J. Bacteriol., March 15, 2007; 189(6): 2443 - 2459. [Abstract] [Full Text] [PDF] |
||||
![]() |
L. Chen and H. Zhao Negative correlation between compositional symmetries and local recombination rates Bioinformatics, November 1, 2005; 21(21): 3951 - 3958. [Abstract] [Full Text] [PDF] |
||||
![]() |
M. Touchon, A. Arneodo, Y. d'Aubenton-Carafa, and C. Thermes Transcription-coupled and splicing-coupled strand asymmetries in eukaryotic genomes Nucleic Acids Res., September 23, 2004; 32(17): 4969 - 4978. [Abstract] [Full Text] [PDF] |
||||
![]() |
E. Louie, J. Ott, and J. Majewski Nucleotide Frequency Variation Across Human Genes Genome Res., December 1, 2003; 13(12): 2594 - 2601. [Abstract] [Full Text] [PDF] |
||||




