Bioinformatics Advance Access originally published online on July 26, 2005
Bioinformatics 2005 21(17):3535-3540; doi:10.1093/bioinformatics/bti569
| ||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||
A statistical model for HIV-1 sequence classification using the subtype analyser (STAR)
1Department of Immunology and Molecular Pathology, University College London
2Department of Infection, University College London
3Department of Biochemistry, University College London
*To whom correspondence should be addressed.
Motivation: HIV-1 antiretroviral drug resistance testing produces large amounts of HIV-1 protease and reverse transcriptase sequences. These provide an excellent resource to study the incidence, spread and clinical significance of HIV-1 subtypes. We have produced a program, Subtype Analyser (STAR) that rapidly and accurately subtypes HIV-1. Here we have determined a robust and statistically validated model for subtype assignment.
Results: We have significantly extended our HIV-1 subtyping tool (STAR), such that each query sequence when evaluated against subtype profile alignments, returns a discriminating score based on the ratio of subtype positive to negative amino acid positions. These scores were transformed into a Z-score distribution and evaluated. Of the 141 sequences used to define the subtype alignments, 98% were correctly reclassified. Inclusion of additional recombination detection within STAR increased the detection of known recombinant sequences to 95%.
Availability: STAR is available as compiled (Linux Fedora 3) or source code from http://pgv19.virol.ucl.ac.uk/download/star_linux.tar
Contact: p.kellam{at}ucl.ac.uk
Supplementay Information: http://pgv19.virol.ucl.ac.uk/download/star_supplement
Received on March 23, 2005; revised on June 23, 2005; accepted on June 30, 2005
This article has been cited by other articles:
![]() |
R. E. Myers and D. Pillay Analysis of Natural Sequence Variation and Covariation in Human Immunodeficiency Virus Type 1 Integrase J. Virol., September 15, 2008; 82(18): 9228 - 9235. [Abstract] [Full Text] [PDF] |
||||
![]() |
X. Wu, Z. Cai, X.-F. Wan, T. Hoang, R. Goebel, and G. Lin Nucleotide composition string selection in HIV-1 subtyping using whole genomes Bioinformatics, July 15, 2007; 23(14): 1744 - 1752. [Abstract] [Full Text] [PDF] |
||||
![]() |
R. Myers, C. Clark, A. Khan, P. Kellam, and R. Tedder Genotyping Hepatitis B virus from whole- and sub-genomic fragments using position-specific scoring matrices in HBV STAR J. Gen. Virol., June 1, 2006; 87(6): 1459 - 1464. [Abstract] [Full Text] [PDF] |
||||


