Bioinformatics Vol. 18 no. 90001 2002
Pages S354-S363
© 2002 Oxford University Press
Finding composite regulatory patterns in DNA sequences
1 Department of Computer Science, Columbia University,
New York, 10027 NY
2 Department of Computer Science and Engineering,
University of California at San Diego, La Jolla 92093-0114, CA
Received on January 22, 2002
; revised on April 1, 2002
; accepted on April 1, 2002
Pattern discovery in unaligned DNA sequences is a fundamental problem in computational biology with important applications in finding regulatory signals. Current approaches to pattern discovery focus on monad patterns that correspond to relatively short contiguous strings. However, many of the actual regulatory signals are composite patterns that are groups of monad patterns that occur near each other. A difficulty in discovering composite patterns is that one or both of the component monad patterns in the group may be too weak. Since the traditional monad-based motif finding algorithms usually output one (or a few) high scoring patterns, they often fail to find composite regulatory signals consisting of weak monad parts. In this paper, we present a MITRA (MIsmatch TRee Algorithm) approach for discovering composite signals. We demonstrate that MITRA performs well for both monad and composite patterns by presenting experiments over biological and synthetic data.
Availability: MITRA is available at http://www.cs.columbia.edu/compbio/mitra/
Contact: eeskin{at}cs.columbia.edu
Keywords: regulatory motif finding; pattern finding; dyad motifs.
![]()
CiteULike
Connotea
Del.icio.us What's this?
This article has been cited by other articles:
![]() |
C. Linhart, Y. Halperin, and R. Shamir Transcription factor and microRNA motif discovery: The Amadeus platform and a compendium of metazoan target sets Genome Res., July 1, 2008; 18(7): 1180 - 1189. [Abstract] [Full Text] [PDF] |
||||
![]() |
E. Wijaya, K. Rajaraman, S.-M. Yiu, and W.-K. Sung Detection of generic spaced motifs using submotif pattern mining Bioinformatics, June 15, 2007; 23(12): 1476 - 1485. [Abstract] [Full Text] [PDF] |
||||
![]() |
A. Chakravarty, J. M. Carlson, R. S. Khetani, C. E. DeZiel, and R. H. Gross SPACER: identification of cis-regulatory elements with non-contiguous critical residues Bioinformatics, April 15, 2007; 23(8): 1029 - 1031. [Abstract] [Full Text] [PDF] |
||||
![]() |
D. GuhaThakurta Computational identification of transcriptional regulatory elements in DNA sequence Nucleic Acids Res., July 19, 2006; 34(12): 3585 - 3598. [Abstract] [Full Text] [PDF] |
||||
![]() |
T. Waleev, D. Shtokalo, T. Konovalova, N. Voss, E. Cheremushkin, P. Stegmaier, O. Kel-Margoulis, E. Wingender, and A. Kel Composite Module Analyst: identification of transcription factor binding site combinations using genetic algorithm. Nucleic Acids Res., July 1, 2006; 34(Web Server issue): W541 - W545. [Abstract] [Full Text] [PDF] |
||||
![]() |
A. Kel, T. Konovalova, T. Waleev, E. Cheremushkin, O. Kel-Margoulis, and E. Wingender Composite Module Analyst: a fitness-based tool for identification of transcription factor binding site combinations Bioinformatics, May 15, 2006; 22(10): 1190 - 1197. [Abstract] [Full Text] [PDF] |
||||
![]() |
A. Zick, I. Onn, R. Bezalel, H. Margalit, and J. Shlomai Assigning functions to genes: identification of S-phase expressed genes in Leishmania major based on post-transcriptional control elements Nucleic Acids Res., July 28, 2005; 33(13): 4235 - 4242. [Abstract] [Full Text] [PDF] |
||||
![]() |
X. Li and W. H. Wong Sampling motifs on phylogenetic trees PNAS, July 5, 2005; 102(27): 9481 - 9486. [Abstract] [Full Text] [PDF] |
||||
![]() |
A. Kel, S. Reymann, V. Matys, P. Nettesheim, E. Wingender, and J. Borlak A Novel Computational Approach for the Prediction of Networked Transcription Factors of Aryl Hydrocarbon-Receptor-Regulated Genes Mol. Pharmacol., December 1, 2004; 66(6): 1557 - 1572. [Abstract] [Full Text] [PDF] |
||||




