Bioinformatics Advance Access originally published online on February 24, 2006
Bioinformatics 2006 22(9):1055-1063; doi:10.1093/bioinformatics/btl049
| ||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||
A novel sensitive method for the detection of user-defined compositional bias in biological sequences
Gen*NY*sis Center for Excellence in Cancer Genomics, Department of Epidemiology and Biostatistics, University at Albany, State University of New York One Discovery Drive, Rensselaer, NY 12144, USA
*To whom correspondence should be addressed.
Motivation: Most biological sequences contain compositionally biased segments in which one or more residue types are significantly overrepresented. The function and evolution of these segments are poorly understood. Usually, all types of compositionally biased segments are masked and ignored during sequence analysis. However, it has been shown for a number of proteins that biased segments that contain amino acids with similar chemical properties are involved in a variety of molecular functions and human diseases. A detailed large-scale analysis of the functional implications and evolutionary conservation of different compositionally biased segments requires a sensitive method capable of detecting user-specified types of compositional bias.
Results: We present BIAS, a novel sensitive method for the detection of compositionally biased segments composed of a user-specified set of residue types. BIAS uses the discrete scan statistics that provides a highly accurate correction for multiple tests to compute analytical estimates of the significance of each compositionally biased segment. The method can take into account global compositional bias when computing analytical estimates of the significance of local clusters. BIAS is benchmarked against SEG, SAPS and CAST programs. We also use BIAS to show that groups of proteins with the same biological function are significantly associated with particular types of compositionally biased segments.
Availability: The software is available at http://lcg.rit.albany.edu/bias/
Contact: ikuznetsov{at}albany.edu
Supplementary information: Supplementary data are available at Bioinformatics online.
Received on June 26, 2005; revised on December 23, 2005; accepted on February 7, 2006
This article has been cited by other articles:
![]() |
I. B. Kuznetsov ProBias: a web-server for the identification of user-specified types of compositionally biased segments in protein sequences Bioinformatics, July 1, 2008; 24(13): 1534 - 1535. [Abstract] [Full Text] [PDF] |
||||
