Bioinformatics Vol. 19 no. 12 2003
Pages 1461-1468
© 2003 Oxford University Press
Comprehensive aligned sequence construction for automated design of effective probes (CASCADE-P) using 16S rDNA
Center for Environmental Biotechnology, Lawrence Berkeley National Laboratory, 1 Cyclotron Road, Mail Stop 70A-3317, Berkeley, CA 94720, USA
Received on October 4, 2002
; revised on January 27, 2003
; accepted on February 19, 2003
Motivation: Prokaryotic organisms have been identified utilizing the sequence variation of the 16S rRNA gene. Variations steer the design of DNA probes for the detection of taxonomic groups or specific organisms. The long-term goal of our project is to create probe arrays capable of identifying 16S rDNA sequences in unknown samples. This necessitated the authentication, categorization and alignment of the >75 000 publicly available 16S sequences. Preferably, the entire process should be computationally administrated so the aligned collection could periodically absorb 16S rDNA sequences from the public records. A complete multiple sequence alignment would provide a foundation for computational probe selection and facilitates microbial taxonomy and phylogeny.
Results: Here we report the alignment and similarity clustering of 62 662 16S rDNA sequences and an approach for designing effective probes for each cluster. A novel alignment compression algorithm, NAST (Nearest Alignment Space Termination), was designed to produce the uniform multiple sequence alignment referred to as the prokMSA. From the prokMSA, 9020 Operational Taxonomic Units (OTUs) were found based on transitive sequence similarities. An automated approach to probe design was straightforward using the prokMSA clustered into OTUs. As a test case, multiple probes were computationally picked for each of the 27 OTUs that were identified within the Staphylococcus Group. The probes were incorporated into a customized microarray and were able to correctly categorize Staphylococcus aureus and Bacillus anthracis into their correct OTUs. Although a successful probe picking strategy is outlined, the main focus of creating the prokMSA was to provide a comprehensive, categorized, updateable 16S rDNA collection useful as a foundation for any probe selection algorithm.
Availability: http://greengenes.llnl.gov/16S/
Contact: GLAndersen{at}lbl.gov
Supplementary Information: Complete prokMSA in aligned fasta format: http://greengenes.llnl.gov/16S_cgi/download/update_28Dec02/prokMSA_aligned.fasta.Z Sequences within each Operational Taxonomic Unit: http://greengenes.llnl.gov/16S_cgi/download/update_15Mar02/SeqDescByOTU.txt DNA probe sequences: http://greengenes.llnl.gov/16S_cgi/download/update_15Mar02/15Mar02_StaphGrpPrbs.xls
* To whom correspondence should be addressed.
![]()
CiteULike
Connotea
Del.icio.us What's this?
This article has been cited by other articles:
![]() |
J. F. Petrosino, S. Highlander, R. A. Luna, R. A. Gibbs, and J. Versalovic Metagenomic Pyrosequencing and Microbial Identification Clin. Chem., May 1, 2009; 55(5): 856 - 866. [Abstract] [Full Text] [PDF] |
||||
![]() |
T. A. Isenbarger, M. Finney, C. Rios-Velazquez, J. Handelsman, and G. Ruvkun Miniprimer PCR, a New Lens for Viewing the Microbial World Appl. Envir. Microbiol., February 1, 2008; 74(3): 840 - 849. [Abstract] [Full Text] [PDF] |
||||
![]() |
E. Pruesse, C. Quast, K. Knittel, B. M. Fuchs, W. Ludwig, J. Peplies, and F. O. Glockner SILVA: a comprehensive online resource for quality checked and aligned ribosomal RNA sequence data compatible with ARB Nucleic Acids Res., December 18, 2007; 35(21): 7188 - 7196. [Abstract] [Full Text] [PDF] |
||||
![]() |
Q. Wang, G. M. Garrity, J. M. Tiedje, and J. R. Cole Naive Bayesian Classifier for Rapid Assignment of rRNA Sequences into the New Bacterial Taxonomy Appl. Envir. Microbiol., August 15, 2007; 73(16): 5261 - 5267. [Abstract] [Full Text] [PDF] |
||||
![]() |
E. L. Brodie, T. Z. DeSantis, D. C. Joyner, S. M. Baek, J. T. Larsen, G. L. Andersen, T. C. Hazen, P. M. Richardson, D. J. Herman, T. K. Tokunaga, et al. Application of a High-Density Oligonucleotide Microarray Approach To Study Bacterial Population Dynamics during Uranium Reduction and Reoxidation Appl. Envir. Microbiol., September 1, 2006; 72(9): 6288 - 6298. [Abstract] [Full Text] [PDF] |
||||
![]() |
M. L. Sogin, H. G. Morrison, J. A. Huber, D. M. Welch, S. M. Huse, P. R. Neal, J. M. Arrieta, and G. J. Herndl Microbial diversity in the deep sea and the underexplored "rare biosphere" PNAS, August 8, 2006; 103(32): 12115 - 12120. [Abstract] [Full Text] [PDF] |
||||
![]() |
T. Z. DeSantis, P. Hugenholtz, N. Larsen, M. Rojas, E. L. Brodie, K. Keller, T. Huber, D. Dalevi, P. Hu, and G. L. Andersen Greengenes, a Chimera-Checked 16S rRNA Gene Database and Workbench Compatible with ARB. Appl. Envir. Microbiol., July 1, 2006; 72(7): 5069 - 5072. [Abstract] [Full Text] [PDF] |
||||
![]() |
T. Z. DeSantis Jr, P. Hugenholtz, K. Keller, E. L. Brodie, N. Larsen, Y. M. Piceno, R. Phan, and G. L. Andersen NAST: a multiple sequence alignment server for comparative analysis of 16S rRNA genes. Nucleic Acids Res., July 1, 2006; 34(Web Server issue): W394 - W399. [Abstract] [Full Text] [PDF] |
||||
![]() |
S. J. Green, E. Inbar, F. C. Michel Jr., Y. Hadar, and D. Minz Succession of Bacterial Communities during Early Plant Development: Transition from Seed to Root and Effect of Compost Amendment. Appl. Envir. Microbiol., June 1, 2006; 72(6): 3975 - 3983. [Abstract] [Full Text] [PDF] |
||||
![]() |
J. Pett-Ridge and M. K. Firestone Redox Fluctuation Structures Microbial Communities in a Wet Tropical Soil Appl. Envir. Microbiol., November 1, 2005; 71(11): 6998 - 7007. [Abstract] [Full Text] [PDF] |
||||
![]() |
P. Pancoska, Z. Moravek, and U. M. Moll Rational design of DNA sequences for nanotechnology, microarrays and molecular computers using Eulerian graphs Nucleic Acids Res., August 27, 2004; 32(15): 4630 - 4645. [Abstract] [Full Text] [PDF] |
||||



