Skip Navigation



Bioinformatics Advance Access published online on November 30, 2004

Bioinformatics, doi:10.1093/bioinformatics/bti174
Bioinformatics © Oxford University Press 2004; all rights reserved
This Article
Right arrow Advance Access manuscript (PDF) Freely available
Right arrow All Versions of this Article:
21/8/1349    most recent
bti174v2
bti174v1
Right arrow Alert me when this article is cited
Right arrow Alert me if a correction is posted
Services
Right arrow Email this article to a friend
Right arrow Similar articles in this journal
Right arrow Similar articles in PubMed
Right arrow Alert me to new issues of the journal
Right arrow Add to My Personal Archive
Right arrow Download to citation manager
Right arrowRequest Permissions
Google Scholar
Right arrow Articles by Rose, J. R.
Right arrow Articles by Keele, J.
Right arrow Search for Related Content
PubMed
Right arrow PubMed Citation
Right arrow Articles by Rose, J. R.
Right arrow Articles by Keele, J.
Social Bookmarking
 Add to CiteULike   Add to Connotea   Add to Del.icio.us  
What's this?

Received August 2, 2004
Revised November 10, 2004
Accepted November 21, 2004

Article

Correlation of amino acid preference and mammalian viral genome type

John R. Rose 1*, William H. Turkett Jr.1, Iulia C. Oroian 1, William W. Laegreid 2, and John Keele 2

1 Department of Computer Science and Engineering, University of South Carolina, Columbia, SC 29208
2 U.S. Meat Animal Research Center, U.S. Department of Agriculture/Agricultural Research Service, Clay Center, Nebraska 68933

* To whom correspondence should be addressed.
John R. Rose, E-mail: rose{at}cse.sc.edu


   Abstract

Motivation: In the event of an outbreak of a disease caused by an initially unknown pathogen, the ability to characterize anonymous sequences prior to isolation and culturing of the pathogen will be helpful. We show that it is possible to classify viral sequences by genome type (dsDNA, ssDNA, ssRNA positive strand, ssRNA negative strand, retroid) using amino acid distribution.

Results: In this paper we describe the results of analysis of amino acid preference in mammalian viruses. The study was carried out at the genome level as well as two shorter sequence levels: short (300 amino acids) and medium length (660 amino acids). The analysis indicates a correlation between the viral genome types dsDNA, ssDNA, ssRNA positive strand, ssRNA negative strand, and retroid and amino acid preference. We investigated three different models of amino acid preference. The simplest amino acid preference model, 1-AAP, is a normalized description of the frequency of amino acids in genomes of a viral genome type. A slightly more complex model is the ordered pair amino acid preference model (2-AAP), which characterizes genomes of different viral genome types by the frequency of ordered pairs of amino acids. The most complex and accurate model is the ordered triple amino acid preference model (3-AAP), which is based on ordered triples of amino acids. The results demonstrate that mammalian viral genome types differ in their amino acid preference.

Availability: The tools used to format and analyze data and supplementary material are available at http://www.cse.sc.edu/~rose/aminoPreference/index.html.


Add to CiteULike CiteULike   Add to Connotea Connotea   Add to Del.icio.us Del.icio.us    What's this?




Disclaimer:
Please note that abstracts for content published before 1996 were created through digital scanning and may therefore not exactly replicate the text of the original print issues. All efforts have been made to ensure accuracy, but the Publisher will not be held responsible for any remaining inaccuracies. If you require any further clarification, please contact our Customer Services Department.