Bioinformatics Vol. 17 no. 4 2001
Pages 338-342
© 2001 Oxford University Press
Original Paper |
Limits of homology detection by pairwise sequence comparison
1 Deutsches Krebsforschungszentrum, Theoretische Bioinformatik, Im Neuenheimer Feld 280, 69120 Heidelberg, Germany
Received on August 7, 2000
; revised on December 18, 2000
; accepted on December 21, 2000
Motivation: Noise in database searches resulting from random sequence similarities increases as the databases expand rapidly. The noise problems are not a technical shortcoming of the database search programs, but a logical consequence of the idea of homology searches. The effect can be observed in simulation experiments.
Results: We have investigated noise levels in pairwise alignment based database searches. The noise levels of 38 releases of the SwissProt database, display perfect logarithmic growth with the total length of the databases. Clustering of real biological sequences reduces noise levels, but the effect is marginal.
Contact: rainer{at}stat.duke.edu; m.vingron{at}dkfz-heidelberg.de
2 To whom correspondence should be addressed. Pressent address: Duke University, Institute of Statistics and Decision Sciences, Box 90251 Duke University, Durham, NC 27708-0251, USA.
![]()
CiteULike
Connotea
Del.icio.us What's this?
This article has been cited by other articles:
![]() |
T. Domazet-Loso and D. Tautz An Evolutionary Analysis of Orphan Genes in Drosophila Genome Res., October 1, 2003; 13(10): 2213 - 2219. [Abstract] [Full Text] [PDF] |
||||
![]() |
C. P. Ponting, R. Mott, P. Bork, and R. R. Copley Novel Protein Domains and Repeats in Drosophila melanogaster: Insights into Structure, Function, and Evolution Genome Res., December 1, 2001; 11(12): 1996 - 2008. [Abstract] [Full Text] [PDF] |
||||
