Skip Navigation


Bioinformatics Advance Access originally published online on January 24, 2006
Bioinformatics 2006 22(7):887-888; doi:10.1093/bioinformatics/btl019
This Article
Right arrow Abstract Freely available
Right arrow FREE Full Text (Print PDF) Freely available
Right arrow Supplementary Data
Right arrow All Versions of this Article:
22/7/887    most recent
btl019v1
Right arrow Comments: Submit a response
Right arrow Alert me when this article is cited
Right arrow Alert me when Comments are posted
Right arrow Alert me if a correction is posted
Services
Right arrow Email this article to a friend
Right arrow Similar articles in this journal
Right arrow Similar articles in ISI Web of Science
Right arrow Similar articles in PubMed
Right arrow Alert me to new issues of the journal
Right arrow Add to My Personal Archive
Right arrow Download to citation manager
Right arrow Search for citing articles in:
ISI Web of Science (2)
Right arrowRequest Permissions
Google Scholar
Right arrow Articles by Li, W.
Right arrow Articles by Godzik, A.
Right arrow Search for Related Content
PubMed
Right arrow PubMed Citation
Right arrow Articles by Li, W.
Right arrow Articles by Godzik, A.
Social Bookmarking
 Add to CiteULike   Add to Connotea   Add to Del.icio.us  
What's this?

© The Author 2006. Published by Oxford University Press. All rights reserved. For Permissions, please email: journals.permissions@oxfordjournals.org

VISSA: a program to visualize structural features from structure sequence alignment

Weizhong Li * and Adam Godzik

Burnham Institute for Medical Research La Jolla, CA 92037, USA

*To whom correspondence should be addressed.


    ABSTRACT
 TOP
 ABSTRACT
 1 INTRODUCTION
 2 IMPLEMENTATION
 3 RESULTS
 REFERENCES
 

Motivation: Multiple sequence alignment is an important tool to understand and analyze functions of homologous proteins. However, the logic of residue conservation/variation is usually apparent only in three-dimensional (3D) space, not on a primary sequence level. Thus, in a traditional multiple alignment it is often difficult to directly visualize and analyze key residues because they are masked by other residues along the alignment. Here we present an integrated multiple alignment and 3D structure visualization program that can (1) map and highlight residues from a 1D alignment onto a 3D structure and vice versa and (2) display only the alignment of preselected, key residues. This program, called Visualize Structure Sequence Alignment, also has many other built-in tools that can help analyze multiple sequence alignments.

Availability: http://bioinformatics.burnham.org/liwz/vissa

Contact: liwz{at}burnham.org


    1 INTRODUCTION
 TOP
 ABSTRACT
 1 INTRODUCTION
 2 IMPLEMENTATION
 3 RESULTS
 REFERENCES
 
Multiple sequence alignment is an important tool to study the evolution and function of homologous proteins. Popular algorithms for creating multiple sequence alignment include ClustalW (Thompson et al., 1994), Dalign (Morgenstern, 1999), T-Coffee (Notredame et al., 2000), POA (Lee et al., 2002), muscle (Edgar, 2004) and many others. In addition, many programs to edit and visualize the alignments, such as ClustalX (Thompson et al., 1997), Mview (Brown et al., 1998), Jalview (Clamp et al., 2004) and BoxShade, are also available. They use various coloring schemes to visualize variations of amino acids and identify conserved residues.

A common problem in visualizing and displaying a multiple sequence alignment is that in many cases key residues may be close in three-dimensional (3D) space but not on a primary sequence level, so it is difficult to focus on them while they are embedded in a long sequence alignment. Another related problem, given a protein and its 3D structure, is finding the conservation or mutation pattern of certain residues of interest at a specific 3D position, which requires seamless integration between the multiple sequence alignment and the 3D structures.

Many programs for visualizing protein 3D structures cannot provide multiple sequence alignments. And the multiple sequence alignment programs mentioned above usually cannot display 3D structures. In fact, matching multiple sequence alignments and 3D structures is often not straightforward because (1) the sequence of the same protein from an alignment and a PDB file are often different due to missing or mutated residues or artificial tags, and (2) residue numbers are also often different. When sequences are long or 3D structures are large, matching residues back and forth is a very tedious and boring task.

Several recent programs, including Pfaat (Johnson et al., 2003), ViTO (Catherinot and Labesse, 2004) and Friend (Abyzov et al., 2005), introduced functions to simultaneously visualize multiple alignments and 3D structures. Pfaat was designed for protein family analysis purpose; ViTO's goal was to help refinement of sequence-structure alignment for homology modeling; and Friend was an integrated interface for visualization of multiple structures and sequences.

Here we present a new program that can visualize both multiple sequence alignments and 3D structures. Our goal was to help researchers handle many practical issues involving simultaneous analysis of both sequence alignments and 3D structures, for example, checking what the conservation of residues around the active site is. Compared to other programs, we put more efforts so that users can easily map, highlight, and analyze any subset of residues of interest in 3D structures and an alignment. It also provided functions to present analysis results of such subset with publication-quality image, for example, it can show a condensed alignment of only a subset. The program can also handle various discrepancies of sequences between PDB files and alignments.


    2 IMPLEMENTATION
 TOP
 ABSTRACT
 1 INTRODUCTION
 2 IMPLEMENTATION
 3 RESULTS
 REFERENCES
 
VIsualize Structure Sequence Alignment (VISSA) has the following components: a multiple sequence alignment generator, an alignment viewer, a 3D structure viewer, and supporting tools. In VISSA, we use ClustalW and Rasmol as the alignment generator and structure viewer, respectively, due to their wide user base and great availability. The alignment viewer and all other supporting tools are implemented from scratch.

Many supporting tools are also integrated into the VISSA program. For instance, residues between PDB structures and alignments are matched by a pairwise alignment between the sequence read from the PDB file and the sequence used in the alignment. So, it can handle many common sequence variation cases in PDB such as insertions, deletions, tags, mutations, and residue modifications.

VISSA was developed and tested on a RedHat Linux system, and was written in Perl. The alignment viewer has a graphic user interface to display alignments and to perform various user commands; it was implemented with Perl/Tk.


    3 RESULTS
 TOP
 ABSTRACT
 1 INTRODUCTION
 2 IMPLEMENTATION
 3 RESULTS
 REFERENCES
 
The Figure 1 shows screen shots of several windows of VISSA program. It has one alignment window and can have any number of Rasmol windows. A user can select any set of residues from the alignment window and let the program highlight them in the Rasmol window, or the user can perform reverse mapping.


Figure 1
View larger version (32K):
[in this window]
[in a new window]
 
Fig. 1 Screen shots of the VISSA program. (a) Main alignment display window. The top sequence has a 3D structure, and its residue numbers (PDB numbering) are printed vertically below the sequence. Secondary structure elements, yellow for sheets and red for helices, are plotted. Selected sequences are highlighted in yellow in the left panel. Selected residues in the alignment are highlighted in gray in the right panel. Amino acids are colored at the consensus position (other coloring options are also available). (b) A window used to define a structural site in a PDB file. (c) A window for creating publication-quality alignments. Alignments can be wrapped to any user-defined width. This window can produce a graphic image or an HTML document, which can be further edited by editors such as Microsoft Word. (d) A condensed alignment containing only residues on a bind pocket. (e) Rasmol window. The residues highlighted in green correspond to the pocket residues in (d). The user can define different schemes to highlight any set of residues of interest in the Rasmol window.

 
The graphic user interface allows users to change font size, modify the coloring scheme of amino acids, switch the display of residue numbers, switch the display of certain sequences or residues, select proteins or residues, delete proteins, run or rerun alignments, edit an alignment, save results, export an alignment, define a structural site, define protein rendering options in Rasmol, change program parameters such as consensus threshold, and so on. The program has a tool that can cluster proteins into groups. A unique feature is that it can perform clustering based only on selected residues rather than on a full-length sequence.

This program will be under regular maintenance. We are also open to adding new functions in the future depending on users' suggestions.

Conflict of Interest: none declared.


    FOOTNOTES
 
Associate Editor: Dmitrij Frishman

Received on December 9, 2005; revised on January 18, 2006; accepted on January 23, 2006

    REFERENCES
 TOP
 ABSTRACT
 1 INTRODUCTION
 2 IMPLEMENTATION
 3 RESULTS
 REFERENCES
 

    Abyzov, A., et al. (2005) Friend, an integrated analytical front-end application for bioinformatics. Bioinformatics, 21, 3677–3678[Abstract/Free Full Text].

    Brown, N.P., et al. (1998) MView: a web-compatible database search or multiple alignment viewer. Bioinformatics, 14, 380–381[Abstract/Free Full Text].

    Catherinot, V. and Labesse, G. (2004) ViTO: tool for refinement of protein sequence-structure alignments. Bioinformatics, 20, 3694–3696[Abstract/Free Full Text].

    Clamp, M., et al. (2004) The Jalview Java alignment editor. Bioinformatics, 20, 426–427[Abstract/Free Full Text].

    Edgar, R.C. (2004) MUSCLE: multiple sequence alignment with high accuracy and high throughput. Nucleic Acids Res, . 32, 1792–1797[Abstract/Free Full Text].

    Johnson, J.M., et al. (2003) Protein family annotation in a multiple alignment viewer. Bioinformatics, 19, 544–545[Abstract/Free Full Text].

    Lee, C., et al. (2002) Multiple sequence alignment using partial order graphs. Bioinformatics, 18, 452–464[Abstract/Free Full Text].

    Morgenstern, B. (1999) DIALIGN 2: improvement of the segment-to-segment approach to multiple sequence alignment. Bioinformatics, 15, 211–218[Abstract/Free Full Text].

    Notredame, C., et al. (2000) T-Coffee: A novel method for fast and accurate multiple sequence alignment. J. Mol. Biol, . 302, 205–217[CrossRef][Web of Science][Medline].

    Thompson, J.D., et al. (1994) CLUSTAL W: improving the sensitivity of progressive multiple sequence alignment through sequence weighting, position-specific gap penalties and weight matrix choice. Nucleic Acids Res, . 22, 4673–4680[Abstract/Free Full Text].

    Thompson, J.D., et al. (1997) The CLUSTAL_X windows interface: flexible strategies for multiple sequence alignment aided by quality analysis tools. Nucleic Acids Res, . 25, 4876–4882[Abstract/Free Full Text].


Add to CiteULike CiteULike   Add to Connotea Connotea   Add to Del.icio.us Del.icio.us    What's this?


This article has been cited by other articles:


Home page
Brief BioinformHome page
P. Zhou and Z. Shang
2D molecular graphics: a flattened world of chemistry and biology
Brief Bioinform, May 1, 2009; 10(3): 247 - 258.
[Abstract] [Full Text] [PDF]


This Article
Right arrow Abstract Freely available
Right arrow FREE Full Text (Print PDF) Freely available
Right arrow Supplementary Data
Right arrow All Versions of this Article:
22/7/887    most recent
btl019v1
Right arrow Comments: Submit a response
Right arrow Alert me when this article is cited
Right arrow Alert me when Comments are posted
Right arrow Alert me if a correction is posted
Services
Right arrow Email this article to a friend
Right arrow Similar articles in this journal
Right arrow Similar articles in ISI Web of Science
Right arrow Similar articles in PubMed
Right arrow Alert me to new issues of the journal
Right arrow Add to My Personal Archive
Right arrow Download to citation manager
Right arrow Search for citing articles in:
ISI Web of Science (2)
Right arrowRequest Permissions
Google Scholar
Right arrow Articles by Li, W.
Right arrow Articles by Godzik, A.
Right arrow Search for Related Content
PubMed
Right arrow PubMed Citation
Right arrow Articles by Li, W.
Right arrow Articles by Godzik, A.
Social Bookmarking
 Add to CiteULike   Add to Connotea   Add to Del.icio.us  
What's this?