Bioinformatics Advance Access originally published online on March 29, 2008
Bioinformatics 2008 24(9):1206-1207; doi:10.1093/bioinformatics/btn108
| ||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||
Visualization of unfavorable interactions in protein folds
1Center of Applied Molecular Engineering, University of Salzburg, Hellbrunnerstrasse 34, 5020 Salzburg, Austria and 2Institute of Physical and Theoretical Chemistry, Wroc
aw University of Technology, Wybrze
e Wyspia
skiego 27, 50-370 Wroc
aw, Poland
*To whom correspondence should be addressed.
| ABSTRACT |
|---|
|
|
|---|
Summary: Three dimensional structures of proteins contain errors which often originate from limitations of the experimental techniques employed. Such errors frequently result in unfavorable atomic interactions. Here we present a new web service, called Interaction Viewer, for the visualization and correction of such errors. We show how the Interaction Viewer is used in combination with the NQ-Flipper service to spot strained asparagine and glutamine rotamers and we emphasize the convenience of this service in correcting such errors.
Availability: The web service is integrated with the NQ-Flipper service and accessible at http://flipper.services.came.sbg.ac.at
Contact: sippl{at}came.sbg.ac.at
Protein structures contain a variety of errors. Such errors often originate from experimental limitations like poor and diffuse electron densities derived from X-ray analysis, or insufficient and ambiguous distance constraints obtained from nuclear magnetic resonance (NMR) experiments. A specific example is the asparagine (Asn) and glutamine (Gln) rotamer problem in X-ray analysis. The electron density of these residues is frequently compatible with two distinct rotamer states. In such cases experimental information is insufficient to determine the identities of oxygen and nitrogen atoms in the amide groups of these side-chains. However, in many cases these states are clearly distinguished by favorable or unfavorable interactions with the surrounding atoms and it has been repeatedly demonstrated that such errors can be identified quite reliably (Davis et al., 2007; Hooft et al., 1996; McDonald and Thornton, 1995; Weichenberger and Sippl, 2006a,b, 2007).
It is highly desirable that errors in protein structures are removed on a regular basis which requires that current protocols in experimental structure determination are supplemented by additional error correcting cycles. The primary output of available error recognition programs consists of scores which indicate unfavorable interactions, incorrect atom positions or other problems (Sippl, 1993; Weichenberger and Sippl, 2006b, Wiederstein and Sippl, 2007). The interpretation of such scores is usually straightforward but requires some understanding of the principles and inner workings of the software. Moreover, the skeptical user may not trust numeric scores so that it is necessary to visualize flagged errors and the associated atomic conformations in three dimensions. In general this is a rather cumbersome exercise, which is hampered by the overall complexity of protein structures. To facilitate error recognition, to alleviate acceptance of tagged errors, and to foster error correction it is therefore, necessary to provide automated tools for the visualization of offending interactions in atomic detail. In such closeup views erroneous atomic conformations are often revealed in a most drastic way so that the nature of errors becomes obvious.
We have implemented a new software application, called Interaction Viewer, for the immediate visualization of strained atomic conformations in protein structures which meets these requirements. We have now integrated the Interaction Viewer with the NQ-Flipper web service (Weichenberger and Sippl, 2006a) so that strained interactions of Asn and Gln residues with surrounding atoms are quickly visualized in three dimensions by selecting the respective radio button from the NQ-Flipper score list. The chief goal of this communication is to provide a primer for the use of the Interaction Viewer (Fig. 1), to emphasize once more the high frequency of incorrect Asn/Gln rotamers found in protein structures, and to stress the simplicity in correcting such errors. Moreover, we encourage the structural biology community to use this and related services to correct obvious errors in newly determined protein structures before they are submitted to the PDB repository (Berman et al., 2000).
|
The error rate in the assignment of Asn or Gln side-chain rotamers is rather impressive. On average the structures deposited in PDB have more than 20% incorrectly assigned Asn and Gln rotamers. This number has to be compared to random assignments where the expected error rate is 50%. These errors in rotamer assignment are responsible for more than half a million incorrect atom positions in the PDB repository. In particular, even high-quality structures determined to a resolution of 1.5Å or better generally contain rotamers that violate basic physico-chemical principles. Another unexpected result is that the error rate found in structures determined by NMR exceeds the error rate of X-ray structures (Weichenberger and Sippl, 2006b). This is surprising since the structures determined by NMR are based on the interpretation of interatomic distance constraints rather than electron densities.
Over many years the rate of incorrect rotamers found in the weekly releases of new structures in PDB has remained at this high level. Given that there are now several on-line tools available to detect and to correct such errors it is indeed surprising that these tools are virtually neglected by the X-ray and NMR communities. But we note one exception. In a recent survey of structures deposited by the various structural genomics initiatives we observed the very same error rate of 20% incorrect rotamers for all consortia except for the Joint Center for Structural Genomics whose comparatively small average error rate of 10% is outstanding.
The individual side-chain amides of Gln and Asn have the capacity to form four hydrogen bonds. A flip to the incorrect rotamer frequently results in most unfavorable electrostatic interactions. Although the importance of hydrogen bonds is undisputed, their role in protein folding is a controversial issue due to the many energetic and entropic factors that are thought to contribute to their stability (Baldwin, 2007). Hydrogen bonds and other interactions in proteins are conveniently studied by potentials of mean force (Sippl, 1990, 1995, 2006). They provide quantitative information on the variation in energy as a function of atom pair distances. The potentials are used by the NQ-Flipper application. In Figure 1 we provide an example of a recently released high-resolution structure containing more than 40% unfavorable rotamers. These rotamers are detected by the NQ-Flipper service. Their subsequent visualization reveals the nature of the unfavorable interactions in atomic detail. Such violations are offending in the sense that they violate the most basic physico-chemical principles. Incorrect rotamers frequently have inverted hydrogen bonds where several pairs of like charges are in close proximity. Such configurations are properly called anti-hydrogen bonds. Anti-hydrogen bonds are impossible to achieve in reality since the associated high energies lead to the immediate disintegration of the respective protein structure.
Conflict of Interest: none declared.
| FOOTNOTES |
|---|
Associate Editor: Anna Tramontano
Received on January 25, 2008; revised on March 10, 2008; accepted on March 23, 2008
| REFERENCES |
|---|
|
|
|---|
Baldwin RL. Energetics of protein folding. J. Mol. Biol (2007) 371:283–301.[CrossRef][Web of Science][Medline]
Berman HM, et al. The protein data bank. Nucleic Acids Res (2000) 28:235–242.
Cotelesage JJH, et al. How does an enzyme recognize CO2? Int. J. Biochem. Cell Biol (2007) 39:1204–1210.[CrossRef][Web of Science][Medline]
Davis IW, et al. Molprobity: all-atom contacts and structure validation for proteins and nucleic acids. Nucleic Acids Res (2007) 35:W375–W383.
Hooft RW, et al. Errors in protein structures. Nature (1996) 381:272.[Medline]
McDonald IK, Thornton JM. The application of hydrogen bonding analysis in X-ray crystallography to help orientate asparagine, glutamine and histidine side chains. Protein Eng (1995) 8:217–224.
Sippl MJ. Calculation of conformational ensembles from potentials of mean force. J. Mol. Biol (1990) 213:859–883.[Web of Science][Medline]
Sippl MJ. Recognition of errors in three-dimensional structures of proteins. Proteins (1993) 17:355–362.[CrossRef][Web of Science][Medline]
Sippl MJ. Knowledge-based potentials for proteins. Curr. Opin. Struct. Biol (1995) 5:229–235.[CrossRef][Web of Science][Medline]
Sippl M. Protein structure prediction and molecular forces, Chapter 5. In: Protein Folding-Misfolding: Some Current Concepts of Protein Chemistry.—Zbilut JP, Scheibel T, eds. (2006) Hauppauge, NY: Nova publishers. 117–140.
Weichenberger CX, Sippl MJ. NQ-Flipper: validation and correction of asparagine/glutamine amide rotamers in protein crystal structures. Bioinformatics (2006a) 22:1397–1398.
Weichenberger CX, Sippl MJ. Self-consistent assignment of asparagine and glutamine amide rotamers in protein crystal structures. Structure (2006b) 14:967–972.[Medline]
Weichenberger CX, Sippl MJ. NQ-Flipper: recognition and correction of erroneous asparagine and glutamine side-chain rotamers in protein structures. Nucleic Acids Res (2007) 35:W403–W406.
Wiederstein M, Sippl MJ. ProSA-web: interactive web service for the recognition of errors in three-dimensional structures of proteins. Nucleic Acids Res (2007) 35:W407–W410.
| ||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||
