Skip Navigation


Bioinformatics Advance Access originally published online on January 29, 2006
Bioinformatics 2006 22(7):902-903; doi:10.1093/bioinformatics/btl021
This Article
Right arrow Abstract Freely available
Right arrow FREE Full Text (Print PDF) Freely available
Right arrow All Versions of this Article:
22/7/902    most recent
btl021v1
Right arrow Comments: Submit a response
Right arrow Alert me when this article is cited
Right arrow Alert me when Comments are posted
Right arrow Alert me if a correction is posted
Services
Right arrow Email this article to a friend
Right arrow Similar articles in this journal
Right arrow Similar articles in ISI Web of Science
Right arrow Similar articles in PubMed
Right arrow Alert me to new issues of the journal
Right arrow Add to My Personal Archive
Right arrow Download to citation manager
Right arrow Search for citing articles in:
ISI Web of Science (8)
Right arrowRequest Permissions
Google Scholar
Right arrow Articles by Feijão, P. C.
Right arrow Articles by Lessinger, A. C.
Right arrow Search for Related Content
PubMed
Right arrow PubMed Citation
Right arrow Articles by Feijão, P. C.
Right arrow Articles by Lessinger, A. C.
Social Bookmarking
 Add to CiteULike   Add to Connotea   Add to Del.icio.us  
What's this?

© The Author 2006. Published by Oxford University Press. All rights reserved. For Permissions, please email: journals.permissions@oxfordjournals.org

AMiGA: the arthropodan mitochondrial genomes accessible database

Pedro Cipriano Feijão , Lissiene Silva Neiva , Ana Maria Lima de Azeredo-Espin and Ana Cláudia Lessinger *

Laboratório de Genética Animal, Centro de Biologia Molecular e Engenharia Genética (CBMEG), Universidade Estadual de Campinas (UNICAMP) CP 6010, CEP 13035-875, Campinas, São Paulo, Brazil

*To whom correspondence should be addressed.


    ABSTRACT
 TOP
 ABSTRACT
 1 INTRODUCTION
 2 STRUCTURE AND OPERATION
 3 FUTURE DEVELOPMENTS
 REFERENCES
 

Summary: The Arthropodan Mitochondrial Genomes Accessible database (AMiGA) is a relational database developed to help in managing access to the increasing amount of data arising from developments in arthropodan mitochondrial genomics (136 mitochondrial genomes as of September 2005). The strengths of AMiGA include (1) a more accessible and up-to-date database containing a more comprehensive set of mitochondrial genomes for this phylum, (2) the provision of flexible search options for retrieving detailed information such as bibliographical data, genomic graphics, FASTA sequences and taxonomical status, (3) the possibility of enhanced comparative analyses by multiple alignment of single or concatenated sets of genes, (4) more accurate and updated information resulting from a specific curation process called AMiGA Notes and (5) the possibility of including unpublished sequences in a password-restricted area for comparative analysis with the other sequences stored in the database.

Availability: http://amiga.cbmeg.unicamp.br

Contact: lessinger{at}amiga.cbmeg.unicamp.br

Supplementary information: Detailed information, including an illustrated tutorial, is available from the above URL.


    1 INTRODUCTION
 TOP
 ABSTRACT
 1 INTRODUCTION
 2 STRUCTURE AND OPERATION
 3 FUTURE DEVELOPMENTS
 REFERENCES
 
Because of characteristics such as a reduced size, haploidy and a conserved gene content (37 genes in most cases: 13 protein-coding genes, 22 tRNAs and 2 rRNAs), the animal mitochondrial genome has been widely used as an important source of data for evolutionary studies (Boore, 1999). Primary sequence databases such as GenBank (Benson et al., 2005), maintained by the National Center for Biotechnology Information (http://www.ncbi.nlm.nih.gov), the European Molecular Biology Laboratory (EMBL) (http://www.embl.org) and the DNA Databank of Japan (DDBJ) (http://www.ddbj.nig.ac.jp/), have drastically improved and stimulated evolutionary studies based on comparative analyses of DNA sequences. More exclusive databases, such as Gobase (O'Brien et al., 2003), the Organellar Genome Retrieval (OGRe) database (Jameson et al., 2003), the Joint Genome Institute (JGI/DOE) Organelle database (http://evogen.jgi.doe.gov/top_level/organelles.html), MamMiBase (Vasconcelos et al., 2005) and the NCBI Organelle Resources database (Wolfsberg et al., 2001), are specialized sources focused in organellar genomes that offer specific tools for the retrieval of detailed information related to complete mitochondrial genomes. Organellar databases usually offer more refined search engines than primary public repositories. These initiatives provide major contributions to comparative mitochondrial genomics. In this context, the Arthropodan Mitochondrial Genomes Accessible database or AMiGA aims to provide the primary tools needed to optimize the access to sequence information and comparative analyses in order to stimulate the study of arthropodan mitochondrial genomics.


    2 STRUCTURE AND OPERATION
 TOP
 ABSTRACT
 1 INTRODUCTION
 2 STRUCTURE AND OPERATION
 3 FUTURE DEVELOPMENTS
 REFERENCES
 
2.1 Infrastructure
AMiGA is implemented in a relational MySQL system in a GNU/Linux server (the organization of the data is shown in Table 1), and its web interface is based on an Apache 1.3 server with dynamic content generated by Perl CGI scripts, including the Bioperl module (Stajich et al., 2002). AMiGA contains sequences and related information from GenBank and NCBI-Taxonomy (http://www.ncbi.nih.gov/Taxonomy/) and runs automatic daily searches for new entries and updated information of current mitochondrial genomes. When a new mitochondrial genome is included in the NCBI RefSeq collection (Pruitt et al., 2005), AMiGA automatically substitutes its corresponding non-RefSeq file with this new one, which has been curated. Then, the new genome passes by an AMiGA curation process to uniformize nomenclature and, if necessary, AMiGA notes are included to provide additional or more accurate information about the genome. This updating system provides AMiGA with the most recent information available for complete arthropodan mitochondrial genomes.


View this table:
[in this window]
[in a new window]
 
Table 1 AMiGA data organization

 
2.2 Operation
Genome searches in AMiGA can be done using keywords for a taxon, a species' common name, the GenBank accession number, author or journal names, or year of submission, with logical operators such as AND, OR and NOT. The search can also be done using an alphabetically ordered list of species or by taxonomical level. This flexibility in searching is determined by the way the information is stored in the database (Table 1). The results are sorted by the NCBI submission date, by the most recently updated NCBI files or by alphabetical order. Following the selection of an entry from a diversified set of keywords and additional gene choices, the user has five options of results: detailed information, FASTA files, genomic graphics, multiple alignments and primer annealing sites.

(1) Detailed information. The results of this search are displayed in a tree structure with a folder for each genome and sub-folders with relevant information about bibliographic references, nucleotide and amino acid sequences, codon usage tables, and taxonomy and genome graphical views.

(2) FASTA files. The user can download the full genome sequence, individual genes or the main non-coding region, for a diversified set of organisms. AMiGA generates one FASTA file containing all of the selected sequences and one file per sequence, making it possible to analyze several genes at once.

(3) Graphics. Two types of graphics, linear and circular, are available through this menu. These graphics are the same as the detailed information menu, except that they are displayed for all of the selected species at once on the same page. For the linear graphics, the user can choose which features to show among protein genes, RNAs, overlaps, miscellaneous features and intergenic regions.

(4) Alignment. A major component of the AMiGA database is an online, multiple global alignment tool for nucleotide and amino acid sequences for one or multiple genes using the software ClustalW (Thompson et al., 1994). When selecting multiple gene analysis, an alignment for each gene is provided, in addition to the concatenated output retrieved from these individual alignments.

(5) Primer Finder. AMiGA uses software PrimerMatch (www.cbcb.umd.edu) to find primer annealing sites throughout the selected genomes. Degenerate primers can be entered using IUPAC ambiguity codes. The potential annealing sites are shown over the genome graphics.

2.3 Tools
AMiGA also provides a set of useful tools such as

  • a standalone BLAST server (Altschul et al., 1990) for comparing user's sequences against a set of daily updated arthropodan mitochondrial genome or coding gene databases,
  • a chromatogram processing tool with a graphic interface based on phred (Ewing, et al., 1998),
  • a page with all recent AMiGA updates and inclusions, also accessible via RSS feed.

2.4 Password protected area
The AMiGA structure allows any user to create a password-protected area in which unpublished annotated genomes, complete or partial, can be submitted for analysis and comparison with the other genomes available in the database. A JavaScript script included in registration page generates the password's MD5 digest encryption at the submission to increase security.


    3 FUTURE DEVELOPMENTS
 TOP
 ABSTRACT
 1 INTRODUCTION
 2 STRUCTURE AND OPERATION
 3 FUTURE DEVELOPMENTS
 REFERENCES
 
A broader database for metazoan mitochondrial genomes, MetAMiGA, has been built based on the original AMiGA platform and is available for testing on the AMiGA main page. AMiGA should contribute to our knowledge of arthropodan mitochondrial genomics and stimulate further research on organismal and organellar diversity and evolution.


    Acknowledgments
 
We thank Renato Vicentini dos Santos for helpful advice. This work and L.S.N. and A.C.L were supported by CNPq (PROFIX, grant no. 540.602/2001-9). P.C.F and A.M.L.A.E. were supported by FAPESP (grants no. 03/01458-9 and 04/09654-4).

Conflict of Interest: none declared.


    FOOTNOTES
 
Associate Editor: Alfonso Valencia

Received on September 28, 2005; revised on January 20, 2006; accepted on January 23, 2006

    REFERENCES
 TOP
 ABSTRACT
 1 INTRODUCTION
 2 STRUCTURE AND OPERATION
 3 FUTURE DEVELOPMENTS
 REFERENCES
 

    Altschul, S. F., et al. (1990) Basic local alignment search tool. J. Mol. Biol, . 215, 403–410[CrossRef][Web of Science][Medline].

    Benson, D. A., et al. (2005) GenBank. Nucleic Acids Res, . 33, D34–D38[Abstract/Free Full Text].

    Boore, J. L. (1999) Animal mitochondrial genomes. Nucleic Acids Res, . 27, 1767–1780[Abstract/Free Full Text].

    Ewing, B. (1998) Base-calling of automated sequencer traces using phred. I. Accuracy assessment. Genome Res, . 8, 175–185[Abstract/Free Full Text].

    Jameson, D., et al. (2003) OGRe: a relational database for comparative analysis of mitochondrial genomes. Nucleic Acids Res, . 31, 202–206[Abstract/Free Full Text].

    O'Brien, E. A., et al. (2003) GOBASE–a database of mitochondrial and chloroplast information. Nucleic Acids Res, . 31, 176–178[Abstract/Free Full Text].

    Pruitt, K. D., et al. (2005) NCBI Reference Sequence (RefSeq): a curated non-redundant sequence database of genomes, transcripts and proteins. Nucleic Acids Res, . 33, 501–504[CrossRef].

    Stajich, J. E., et al. (2002) The Bioperl toolkit: Perl modules for the life sciences. Genome Res, . 12, 1611–1618[Abstract/Free Full Text].

    Thompson, J. D., et al. (1994) CLUSTAL W: improving the sensitivity of progressive multiple sequence alignment through sequence weighting, position-specific gap penalties and weight matrix choice. Nucleic Acids Res, . 22, 4673–4680[Abstract/Free Full Text].

    Vasconcelos, A. T., et al. (2005) MamMiBase: a mitochondrial genome database for mammalian phylogenetic studies. Bioinformatics, 21, 2566–2567[Abstract/Free Full Text].

    Wolfsberg, T. G., et al. (2001) Organelle genome resource at NCBI. Trends Biochem. Sci, . 26, 199–203[CrossRef][Web of Science][Medline].


Add to CiteULike CiteULike   Add to Connotea Connotea   Add to Del.icio.us Del.icio.us    What's this?



This Article
Right arrow Abstract Freely available
Right arrow FREE Full Text (Print PDF) Freely available
Right arrow All Versions of this Article:
22/7/902    most recent
btl021v1
Right arrow Comments: Submit a response
Right arrow Alert me when this article is cited
Right arrow Alert me when Comments are posted
Right arrow Alert me if a correction is posted
Services
Right arrow Email this article to a friend
Right arrow Similar articles in this journal
Right arrow Similar articles in ISI Web of Science
Right arrow Similar articles in PubMed
Right arrow Alert me to new issues of the journal
Right arrow Add to My Personal Archive
Right arrow Download to citation manager
Right arrow Search for citing articles in:
ISI Web of Science (8)
Right arrowRequest Permissions
Google Scholar
Right arrow Articles by Feijão, P. C.
Right arrow Articles by Lessinger, A. C.
Right arrow Search for Related Content
PubMed
Right arrow PubMed Citation
Right arrow Articles by Feijão, P. C.
Right arrow Articles by Lessinger, A. C.
Social Bookmarking
 Add to CiteULike   Add to Connotea   Add to Del.icio.us  
What's this?