Prediction of probable genes by Fourier analysis of genomic sequences
1School of Physical Sciences Jawaharlal Nehru University, New-Delhi 110067, India
2School of Environmental Sciences Jawaharlal Nehru University, New-Delhi 110067, India
3School of Life Sciences Jawaharlal Nehru University, New-Delhi 110067, India
4 To whom correspondence should be addressed
Motivation: The major signal in coding regions of genomic sequences is a three-base periodicity. Our aim is to use Fourier techniques to analyse this periodicity, and thereby to develop a tool to recognize coding regions in genomic DNA.
Result: The three-base periodicity in the nucleotide arrangement is evidenced as a sharp peak at frequency f = 1/3 in the Fourier (or power) spectrum. From extensive spectral analysis of DNA sequences of total length over 5.5 million base pairs from a wide variety or organisms (including the human genome), and by separately examining coding and non-coding sequences, we find that the relative height of the peak at f = 1/3 in the Fourier spectrum is a good discriminator of coding potential. This feature is utilized by us to detect probable coding regions in DNA sequences, by examining the local signal-to-noise ratio of the peak within a sliding window. While the overall accuracy is comparable to that of other techniques currently in use, the measure that is presently proposed is independent of training sets or existing database information, and can thus find general application.
Availability: A computer program GeneScan which locates coding open reading frames and exonic regions in genomic sequences has been developed, and is available on request.
Received on September 15, 1996; accepted on December 31, 1996
This article has been cited by other articles:
![]() |
S. P. Pandey, E. Gaquerel, K. Gase, and I. T. Baldwin RNA-Directed RNA Polymerase3 from Nicotiana attenuata Is Required for Competitive Growth in Natural Environments Plant Physiology, July 1, 2008; 147(3): 1212 - 1224. [Abstract] [Full Text] [PDF] |
||||
![]() |
Y. Saeys, T. Abeel, S. Degroeve, and Y. Van de Peer Translation initiation site prediction on a genomic scale: beauty in simplicity Bioinformatics, July 1, 2007; 23(13): i418 - i423. [Abstract] [Full Text] [PDF] |
||||
![]() |
Z. Jehan, S. Vallinayagam, S. Tiwari, S. Pradhan, L. Singh, A. Suresh, H. M. Reddy, Y.R. Ahuja, and R. A. Jesudasan Novel noncoding RNA from human Y distal heterochromatic block (Yq12) generates testis-specific chimeric CDC2L2 Genome Res., April 1, 2007; 17(4): 433 - 440. [Abstract] [Full Text] [PDF] |
||||
![]() |
Y. Saeys, P. Rouze, and Y. Van de Peer In search of the small ones: improved prediction of short exons in vertebrates, plants, fungi and protists Bioinformatics, February 15, 2007; 23(4): 414 - 420. [Abstract] [Full Text] [PDF] |
||||
![]() |
E. S. Balakirev, V. R. Chechetkin, V. V. Lobzin, and F. J. Ayala Entropy and GC Content in the {beta}-esterase Gene Cluster of the Drosophila melanogaster Subgroup Mol. Biol. Evol., October 1, 2005; 22(10): 2063 - 2072. [Abstract] [Full Text] [PDF] |
||||
![]() |
E. A. Worthey, S. Martinez-Calvillo, A. Schnaufer, G. Aggarwal, J. Cawthra, G. Fazelinia, C. Fong, G. Fu, M. Hassebrock, G. Hixson, et al. Leishmania major chromosome 3 contains two long convergent polycistronic gene clusters separated by a tRNA gene Nucleic Acids Res., July 15, 2003; 31(14): 4201 - 4210. [Abstract] [Full Text] [PDF] |
||||
![]() |
A. Ciccodicola, M. D'Esposito, T. Esposito, F. Gianfrancesco, C. Migliaccio, M. G. Miano, M. R. Matarazzo, M. Vacca, A. Franze, M. Cuccurese, et al. Differentially regulated and evolved genes in the fully sequenced Xq/Yq pseudoautosomal region Hum. Mol. Genet., February 12, 2000; 9(3): 395 - 401. [Abstract] [Full Text] [PDF] |
||||
![]() |
P. S. Nelson, L. Gan, C. Ferguson, P. Moss, R. Gelinas, L. Hood, and K. Wang Molecular cloning and characterization of prostase, an androgen-regulated serine protease with prostate-restricted expression PNAS, March 16, 1999; 96(6): 3114 - 3119. [Abstract] [Full Text] [PDF] |
||||
![]() |
S. Audic and J.-M. Claverie Self-identification of protein-coding regions in microbial genomes PNAS, August 18, 1998; 95(17): 10026 - 10031. [Abstract] [Full Text] [PDF] |
||||






