A system for pattern matching applications on biosequences
Department of Computer Science, University of Arizona Tucson, AZ 85721, USA
ANREP is a system for finding matches to patterns composed of (i) spacing constraints called spacers, and (ii) approximate matches to motifs that are, recursively, patterns composed of atomic symbols. A user specifies such patterns via a declarative, free-format and strongly typed language called A that is presented here in a tutorial style through a series of progressively more complex examples. The sample patterns are for protein and DNA sequences, the application domain for which ANREP wos specifically created. ANREP provides a unified framework for almost all previously proposed biosequence patterns and extends them by providing approximate matching, a feature heretofore unavailable except for the limited case of individual sequences. The pemformance of ANREP is discussed and an appendix gives
concise specification of syntax and semantics. A portable C softwore package implementing ANREP is available via anonymous remote file transfer.
Received on April 8, 1992; accepted on September 15, 1992
This article has been cited by other articles:
![]() |
S. Chakrabarti, A. P. Anand, N. Bhardwaj, G. Pugalenthi, and R. Sowdhamini SCANMOT: searching for similar sequences using a simultaneous scan of multiple sequence motifs Nucleic Acids Res., July 1, 2005; 33(suppl_2): W274 - W276. [Abstract] [Full Text] [PDF] |
||||
![]() |
S. Schwartz, L. Elnitski, M. Li, M. Weirauch, C. Riemer, A. Smit, N. C. S. Program, E. D. Green, R. C. Hardison, and W. Miller MultiPipMaker and supporting tools: alignments and analysis of multiple genomic DNA sequences Nucleic Acids Res., July 1, 2003; 31(13): 3518 - 3524. [Abstract] [Full Text] [PDF] |
||||
![]() |
S. Graf, D. Strothmann, S. Kurtz, and G. Steger HyPaLib: a database of RNAs and RNA structural elements defined by hybrid patterns Nucleic Acids Res., January 1, 2001; 29(1): 196 - 198. [Abstract] [Full Text] [PDF] |
||||
