Identification of a set of frequent decanucleotides in plants and in animals
Department of Evolutionary Biology, University of Ferrara 44100 Ferrara, Italy
2To whom all correspondence should be addressed
We studied the frequency distribution of 1048576 oligo-nucleotides 10 bp long in a sample of 1.961 Mbase of genes from plants, made of 635 sequences extracted from GenBank 71.0, with the aim of detecting transcription control signals. Among all decamers, 3255, or 0.3%, had a frequency 10 times higher than the mean and were subjected to further statistical analysis. For each of the 3255 decamers (parents), we counted the individual frequencies of the 30 decamers (progeny) differing from the parent by one base mutation, and calculated two variance/mean chi-squares for the progeny, with and without the parent decamer. By studying the distribution of the ratio between the two chi-squares we observed that out of 3255 decamers >10 times frequent than average, 432 had a chi-square ratio >1.9. In this residual set, which corresponds to <0.04 per cent of all possible decamers, only 15 known eukaryotic transcription control elements were found; on the other hand, it included 29 decanucleotides that matched with decanucleotides of a set of Drosophila, 24 with a set from mammals, 13 with a set from yeast and four with a set of virusesall sets identified with the statistical procedures here described. These decanucloetides are highly repetitive and seem to be present throughout all higher organisms, whereas they are uncommon in mammalian viruses.
Received on January 28, 1994; accepted on April 27, 1994
This article has been cited by other articles:
![]() |
D. Endoh, T. Mizutani, R. Kirisawa, Y. Maki, H. Saito, Y. Kon, S. Morikawa, and M. Hayashi Species-independent detection of RNA virus by representational difference analysis using non-ribosomal hexanucleotides for reverse transcription Nucleic Acids Res., April 7, 2005; 33(6): e65 - e65. [Abstract] [Full Text] [PDF] |
||||
