Bioinformatics, Vol 14, 685-690, Copyright © 1998 by Oxford University Press
M Yan, ZS Lin and CT Zhang
MOTIVATION: At the core of most protein gene-finding algorithms are the
coding measures used to make a decision on coding/non-coding. Of the
protein coding measures, the Fourier measure is one of the most important.
However, due to the limited length of the windows usually used, the
accuracy of the measure is not satisfactory. This paper is devoted to
improving the accuracy by lengthening the sequence to amplify the
periodicity of 3 in the coding regions. RESULTS: A new algorithm is
presented called the lengthen-shuffle Fourier transform algorithm. For the
same window length, the percentage accuracy of the new algorithm is 6-7%
higher than that of the ordinary Fourier transform algorithm. The resulting
percentage accuracy (average of specificity and sensitivity) of the new
measure is 84.9% for the window length 162 bp. AVAILABILITY: The program is
available on request fromC.- T. Zhang. Contact: ctzhang@tju.edu.cn
ARTICLES
A new fourier transform approach for protein coding measure based on the format of the Z curve
Department of Physics, Tianjin University, Tianjin 300072, China.
![]()
CiteULike
Connotea
Del.icio.us What's this?
This article has been cited by other articles:
![]() |
C.-T. Zhang and J. Wang Recognition of protein coding genes in the yeast genome at better than 95% accuracy based on the Z curve Nucleic Acids Res., July 15, 2000; 28(14): 2804 - 2814. [Abstract] [Full Text] [PDF] |
||||
