Skip Navigation


Bioinformatics Advance Access originally published online on November 11, 2004
Bioinformatics 2005 21(7):1263-1264; doi:10.1093/bioinformatics/bti134
This Article
Right arrow Abstract Freely available
Right arrow FREE Full Text (Print PDF) Freely available
Right arrow All Versions of this Article:
21/7/1263    most recent
bti134v1
Right arrow Alert me when this article is cited
Right arrow Alert me if a correction is posted
Services
Right arrow Email this article to a friend
Right arrow Similar articles in this journal
Right arrow Similar articles in ISI Web of Science
Right arrow Similar articles in PubMed
Right arrow Alert me to new issues of the journal
Right arrow Add to My Personal Archive
Right arrow Download to citation manager
Right arrow Search for citing articles in:
ISI Web of Science (13)
Right arrowRequest Permissions
Google Scholar
Right arrow Articles by Gadberry, M. D.
Right arrow Articles by Kellogg, E. A.
Right arrow Search for Related Content
PubMed
Right arrow PubMed Citation
Right arrow Articles by Gadberry, M. D.
Right arrow Articles by Kellogg, E. A.
Social Bookmarking
 Add to CiteULike   Add to Connotea   Add to Del.icio.us  
What's this?

© The Author 2004. Published by Oxford University Press. All rights reserved. For Permissions, please email: journals.permissions{at}oupjournals.org

Primaclade—a flexible tool to find conserved PCR primers across multiple species

Michael D. Gadberry , Simon T. Malcomber *, Andrew N. Doust * and Elizabeth A. Kellogg {dagger}

Department of Biology, University of Missouri-St. Louis One University Boulevard, St. Louis, MO 63121, USA

*To whom correspondence should be addressed.


    Abstract
 TOP
 Abstract
 INTRODUCTION
 ALGORITHM AND IMPLEMENTATION
 TESTS OF THE PROGRAM
 REFERENCES
 

Summary: Primaclade is a web-based application that accepts a multiple species nucleotide alignment file as input and identifies a set of polymerase chain reaction (PCR) primers that will bind across the alignment. Primaclade iteratively runs the Primer3 application for each alignment sequence and collates the results. Primaclade creates an HTML results page that recaps the original alignment, provides a consensus sequence and lists primers for each alignment area, with primers color-coded to reflect the level of degeneracy in the primer.

Availability: Primaclade can be accessed freely at http://www.umsl.edu/~biology/Kellogg/primaclade.html

Contact: tkellogg{at}umsl.edu


    INTRODUCTION
 TOP
 Abstract
 INTRODUCTION
 ALGORITHM AND IMPLEMENTATION
 TESTS OF THE PROGRAM
 REFERENCES
 
Comparative studies of genes and genomes—including studies of molecular evolution, organism evolution and genetic mapping—rely on the polymerase chain reaction (PCR) to amplify orthologous genes among related organisms. Such studies require efficient methods to design primers from a nucleotide alignment. Currently, most available primer design software accepts only a single nucleotide sequence. Software that does design primers from a multiple species alignment, such as Primer Premier (Premier Biosoft International, Palo Alto, CA, USA) is limited by input format and is only available commercially. Thus we have developed a free, web-based, primer prediction application, Primaclade, to design minimally degenerate primers for comparative studies of multiple species.


    ALGORITHM AND IMPLEMENTATION
 TOP
 Abstract
 INTRODUCTION
 ALGORITHM AND IMPLEMENTATION
 TESTS OF THE PROGRAM
 REFERENCES
 
Primaclade employs a BioPerl-based executable file, which runs as a typical CGI script on an Apache-based web server (Fielding and Kaiser, 1997). Running the application requires a standard Perl 5.8.0 installation, a few Comprehensive Perl Archive Network (CPAN) Perl modules, the BioPerl 1.4.0 set of modules (Stajich et al., 2002) and version 0.9 of the Primer3 software (Rozen and Skaletsky, 2000; Code available at http://www-genome.wi.mit.edu/genome_software/other/primer3.html).

Primaclade accepts as input a multiple alignment file saved in Clustal (Thompson et al., 1997) NEXUS (Maddison et al., 1997), EMBOSS (Rice et al., 2000), PHYLIP (Felsenstein, 2004) or numerous other alignment formats. Users can specify the maximum number of degenerate base pairs per primer (up to five), the number of gapped sequence lines in the alignment file to ignore and a single region of the alignment to exclude. The last feature is most useful in excluding areas that are so conserved that they would be shared by many paralogous genes. Melting temperatures and percent GC content can also be input for each run, or default values can be used.

To determine primers for a set of sequences, the alignment file is read and a consensus computed using the ‘consensus_iupac’ method from BioPerl AlignIO.pm (pm = Perl Module). The alignment is then split into individual sequences. To find as many unique primers as possible, the script runs Primer3 11 times for each sequence of the alignment starting with a search for an 18-mer primer, and incrementing, each time by 1 bp, up to a 28-mer. The output file from each run of Primer3 is then parsed, and both upstream and downstream primers are saved into a unique array for each line of sequence data in the alignment. After any gaps are accounted for, the primer starting location and length are calculated, and the primer sequence is compared to the corresponding nucleotides in the alignment consensus sequence. If the consensus sequence contains the correct number or fewer degenerate nucleotides then the primer is saved; otherwise it is discarded. Primers that pass the test for degeneracy are screened to determine the number of gap sequences that occur at their positions within the alignment, and primers that meet the input criteria are saved into a final array. The array is sorted, any duplicates are removed, and a final result HTML document is generated.

A typical output page contains the original alignment file followed by a single line showing the consensus sequence (black and white version, Fig. 1), with highly conserved nucleotides in colored text and less-conserved bases in black. At the bottom of the page, the list of primers is printed under their correct position within the alignment display. The primer list is color-coded, with green for primers with no degenerate base pairs, orange for primers with one or two degenerate bases, and red for primers with three or more degenerates. The reverse complement for the 3' primers is provided, as are Tm and %GC. The output page can also be saved in plain text format.



View larger version (40K):
[in this window]
[in a new window]
 
Fig. 1 Primaclade output screen (converted to black and white) showing primers identified in the 5' region of the LEAFY HULL STERILE 1 (LHS1) gene from a selection of grass species. Upper block consists of aligned sequences, middle block the consensus. The bottom four lines list primer sequences, followed by the number of the start position, Tm, %GC content and length.

 

    TESTS OF THE PROGRAM
 TOP
 Abstract
 INTRODUCTION
 ALGORITHM AND IMPLEMENTATION
 TESTS OF THE PROGRAM
 REFERENCES
 
We have used Primaclade successfully to design primers in alignments of 200–2128 bp comprising 2–17 sequences. Primaclade generally performed best with alignments of up to about eight sequences and up to 29.0% sequence divergence (see Primaclade webpage). Including more sequences causes the program to run more slowly, but the precise effect depends on the quality of the alignment. Input of a good alignment is vital, as the software is not effective in finding primers in ambiguously aligned regions, or in alignments with poor consensus. For very divergent alignments we partition the alignment into several smaller files and run Primaclade independently on each file. In general, an iterative approach works well, starting with input of an entire alignment and using the default settings. If suitable primers are not found, we then increase the allowable number of degenerate sites, range of melting temperatures, range of GC percentages and the number of alignment gaps to skip. If this still is unsatisfactory, we sequentially remove the most divergent sequences and/or divide the file into two more homogeneous subfiles.

In summary, Primaclade provides a quick, easy, powerful and freely available solution for researchers who want to design PCR primers across multiple species. It can greatly simplify the design of PCR primers for any comparative molecular study.


    Acknowledgments
 
We thank Rosa Ortiz-Gentry and Jill Preston for test data sets, and Patrick Sweeney for help with the web interface, and two anonymous reviewers for comments on the manuscript. This project was supported by NSF grants MCB-0110809 and DBI-0110189 to EAK.


    Footnotes
 
{dagger}These authors contributed equally to this work. Back

Received on August 29, 2004; revised on November 1, 2004; accepted on November 2, 2004

    REFERENCES
 TOP
 Abstract
 INTRODUCTION
 ALGORITHM AND IMPLEMENTATION
 TESTS OF THE PROGRAM
 REFERENCES
 

    Fielding, R.T. and Kaiser, G. (1997) The Apache HTTP Server Project. IEEE Internet Comput., 1, 88–90.

    Felsenstein, J. PHYLIP (Phylogeny Inference Package) version 3.6, (2004) , Seattle Distributed by the author Department of Genome Sciences, University of Washington.

    Maddison, D.R., Swofford, D.L., Maddison, W.P. (1997) NEXUS: an extensible file format for systematic information. Syst. Biol., 46, , pp. 590–621[CrossRef][ISI][Medline].

    Rozen, S. and Skaletsky, H.J. (2000) Primer3 on the WWW for general users and for biologist programmers. In Krawetz, S. and Misener, S. (Eds.). Bioinformatics Methods and Protocols: Methods in Molecular Biology, , Totowa, NJ Humana Press, pp. 365–386.

    Rice, P., Longden, I., Bleasby, A. (2000) EMBOSS: The European Molecular Biology Open Software Suite. Trends Genet., 16, 276–277[CrossRef][ISI][Medline].

    Stajich, J.E., Block, D., Boulez, K., Brenner, S.E., Chervitz, S.A., Dagdigian, C., Fuellen, G., Gilbert, J.G.R., Korf, I., Lapp, H., et al. (2002) The Bioperl toolkit: Perl modules for the life sciences. Genome Res., 12, 1611–1618[Abstract/Free Full Text].

    Thompson, J.D., Gibson, T.J., Plewniak, F., Jeanmougin, F., Higgins, D.G. (1997) The ClustalX windows interface: flexible strategies for multiple sequence alignment aided by quality analysis tools. Nucleic Acids Res., 24, 4876–4882.


Add to CiteULike CiteULike   Add to Connotea Connotea   Add to Del.icio.us Del.icio.us    What's this?


This article has been cited by other articles:


Home page
Nucleic Acids ResHome page
M. Bekaert and E. C. Teeling
UniPrime: a workflow-based platform for improved universal primer design
Nucleic Acids Res., June 1, 2008; 36(10): e56 - e56.
[Abstract] [Full Text] [PDF]


Home page
BioinformaticsHome page
N. Kim and C. Lee
QPRIMER: a quick web-based application for designing conserved PCR primers from multigenome alignments
Bioinformatics, September 1, 2007; 23(17): 2331 - 2333.
[Abstract] [Full Text] [PDF]


Home page
Am. J. Bot.Home page
R. D. C. Ortiz, E. A. Kellogg, and H. V. D. Werff
Molecular phylogeny of the moonseed family (Menispermaceae): implications for morphological diversification
Am. J. Botany, August 1, 2007; 94(8): 1425 - 1438.
[Abstract] [Full Text] [PDF]


Home page
Nucleic Acids ResHome page
H.-Y. Ou, X. He, E. M. Harrison, B. R. Kulasekara, A. B. Thani, A. Kadioglu, S. Lory, J. C. D. Hinton, M. R. Barer, Z. Deng, et al.
MobilomeFINDER: web-based tools for in silico and experimental discovery of bacterial genomic islands
Nucleic Acids Res., July 13, 2007; 35(suppl_2): W97 - W104.
[Abstract] [Full Text] [PDF]


Home page
Plant Physiol.Home page
M. E. Schranz, A. J. Windsor, B.-h. Song, A. Lawton-Rauh, and T. Mitchell-Olds
Comparative Genetic Mapping in Boechera stricta, a Close Relative of Arabidopsis
Plant Physiology, May 1, 2007; 144(1): 286 - 298.
[Abstract] [Full Text] [PDF]


Home page
J. Bacteriol.Home page
B. Berger, R. D. Pridmore, C. Barretto, F. Delmas-Julien, K. Schreiber, F. Arigoni, and H. Brussow
Similarity and Differences in the Lactobacillus acidophilus Group Identified by Polyphasic Analysis and Comparative Genomics
J. Bacteriol., February 15, 2007; 189(4): 1311 - 1321.
[Abstract] [Full Text] [PDF]


Home page
J Exp BotHome page
E. A. Kellogg
Progress and challenges in studies of the evolution of development
J. Exp. Bot., October 1, 2006; 57(13): 3505 - 3516.
[Abstract] [Full Text] [PDF]


Home page
Brief BioinformHome page
J. E. Stajich and H. Lapp
Open source tools and toolkits for bioinformatics: significance, and where are we?
Brief Bioinform, September 1, 2006; 7(3): 287 - 296.
[Abstract] [Full Text] [PDF]


Home page
Nucleic Acids ResHome page
H.-Y. Ou, L.-L. Chen, J. Lonnen, R. R. Chaudhuri, A. B. Thani, R. Smith, N. J. Garton, J. Hinton, M. Pallen, M. R. Barer, et al.
A novel strategy for the identification of genomic islands by comparative analysis of the contents and contexts of tRNA sites in closely related bacteria
Nucleic Acids Res., January 9, 2006; 34(1): e3 - e3.
[Abstract] [Full Text] [PDF]


This Article
Right arrow Abstract Freely available
Right arrow FREE Full Text (Print PDF) Freely available
Right arrow All Versions of this Article:
21/7/1263    most recent
bti134v1
Right arrow Alert me when this article is cited
Right arrow Alert me if a correction is posted
Services
Right arrow Email this article to a friend
Right arrow Similar articles in this journal
Right arrow Similar articles in ISI Web of Science
Right arrow Similar articles in PubMed
Right arrow Alert me to new issues of the journal
Right arrow Add to My Personal Archive
Right arrow Download to citation manager
Right arrow Search for citing articles in:
ISI Web of Science (13)
Right arrowRequest Permissions
Google Scholar
Right arrow Articles by Gadberry, M. D.
Right arrow Articles by Kellogg, E. A.
Right arrow Search for Related Content
PubMed
Right arrow PubMed Citation
Right arrow Articles by Gadberry, M. D.
Right arrow Articles by Kellogg, E. A.
Social Bookmarking
 Add to CiteULike   Add to Connotea   Add to Del.icio.us  
What's this?