Bioinformatics Advance Access originally published online on January 5, 2006
Bioinformatics 2006 22(5):623-625; doi:10.1093/bioinformatics/btk023
| ||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||
OPM: Orientations of Proteins in Membranes database
1College of Literature, Science and the Arts, University of Michigan Ann Arbor, MI 48109-1065, USA
2College of Pharmacy, University of Michigan Ann Arbor, MI 48109-1065, USA
*To whom correspondence should be addressed.
| ABSTRACT |
|---|
|
|
|---|
Summary: The Orientations of Proteins in Membranes (OPM) database provides a collection of transmembrane, monotopic and peripheral proteins from the Protein Data Bank whose spatial arrangements in the lipid bilayer have been calculated theoretically and compared with experimental data. The database allows analysis, sorting and searching of membrane proteins based on their structural classification, species, destination membrane, numbers of transmembrane segments and subunits, numbers of secondary structures and the calculated hydrophobic thickness or tilt angle with respect to the bilayer normal. All coordinate files with the calculated membrane boundaries are available for downloading.
Availabililty: http://opm.phar.umich.edu
Contact: almz{at}umich.edu
| INTRODUCTION |
|---|
|
|
|---|
There are hundreds of integral and peripheral membrane proteins with known three-dimensional (3D) structure deposited in the Protein Data Bank (PDB, Berman et al., 2000), but their precise positioning in the lipid bilayer is missing. This positioning is essential for biological activity, intermolecular interactions, stability and folding of membrane protein complexes, and it has been studied by a variety of experimental methods, including chemical modification, fluorescence, spin-labeling, X-ray scattering, neutron diffraction, electron cryo-microscopy and NMR or infrared spectroscopy for several dozen cases, such as rhodopsin, lactose permease, mechanosensitive and potassium channels or C2 domains (Frilingos et al., 1998; Lee, 2003; Hubbell et al., 2003; Malmberg and Falke, 2005). However, since the amount of such experimental data is still very limited, this problem should be addressed computationally to keep up with the expanding flow of structures in the PDB.
Orientations of proteins in membranes may be theoretically calculated by minimizing a protein's transfer energy from water to a planar slab that serves as a crude approximation of the membrane hydrocarbon core (Rees et al., 1989). The transfer energy can be estimated in different ways, including whole-residue hydrophobicity scales (Garlic, Zucic and Juretic, 2004), the normalized accessible surface area of non-polar residues (TMDET, Tusnady et al., 2004) or atomic solvation parameters (IMPALA, Basyn et al., 2003). One of these methods, TMDET, has been applied recently to detect all transmembrane (TM), but not monotopic or peripheral, proteins in the PDB, which were deposited in the PDB_TM database (Tusnady et al., 2005). However, the calculated orientations of TM proteins in PDB_TM were not verified through experimental data.
In an attempt to develop a method that better agrees with experimental studies, we designed a more elaborate computational approach for optimizing the spatial arrangement of proteins in membranes. This method combines atomic solvation parameters for the waterdecadiene system, interfacial polarity profiles in membranes determined in EPR studies, ionization energies of charged residues and elimination of energetic contributions from any atoms situated in the polar pores or channels of TM proteins that do not interact with lipids (A. L. Lomize, I. D. Pogozheva, M. A. Lomize and H. I. Mosberg, manusript submitted). The developed theoretical approach discriminates between TM and water-soluble proteins and determines the positions of TM proteins with a precision of
1 Å for the hydrophobic thickness and
2° for the tilt angle relative to the membrane normal. Most importantly, our results are in good agreement with experimental studies of 24 TM proteins, though they are less consistent with results of other computational methods, such as TMDET (Tusnady et al., 2004) or IMPALA (Basyn et al., 2003) (see more details at OPM website).
OPM database
The OPM database has several important features. First, the calculated spatial arrangements of TM proteins in membranes were verified by a large sample of published experimental data. Second, protein orientations were calculated for quaternary complexes (biological units) rather than individual subunits or domains. Complexes were generated by the PQS server (Henrick and Thornton, 1998) and verified through the literature to exclude functionally irrelevant oligomers found in crystals, as in the PDB_TM database (Tuesday et al., 2004). Third, we included a small initial set of 33 integral monotopic and peripheral membrane proteins, which will be significantly expanded in the future. Fourth, all protein complexes were classified based on the structure of their main membrane-associated domains. Our classification has four hierarchal levels: type (TM or peripheral/monotopic protein and peptides), class (all-
, all-ß,
+ß,
/ß), superfamily (evolutionarily related proteins) and family (proteins with clear sequence homology). The superfamilies and families were taken from SCOP (Andreeva et al., 2004), with some corrections, and included proteins not present in the latest release of SCOP. TM superfamilies and families (except enzymes and structural proteins) were ordered by their biological function, as in TCDB (Busch and Saier, 2004), starting from complexes involved in photosynthesis, respiration and other primary active transport processes. Finally, we included the tilt angle of the protein relative to the membrane normal, the hydrophobic thickness, the transfer energy of the protein from water to the membrane, the topology, the type of destination membrane and other parameters, which are not provided together in any other resource.
Similar to PDB_TM, OPM provides an up-to-date list of TM proteins with their hydrophobic boundaries. Our current release includes 126 unique 3D structures that represent 506 PDB entries. Typically, a complex with the most complete quaternary structure or one determined with the highest resolution is selected as a representative model. Other structures, such as mutants or conformational states of the same protein, are included in OPM as related PDB entries. However, six types of structures are temporarily excluded: (1) complexes with many unassigned residues and sets of backbone coordinates; (2) low-resolution electron microscopy-based models; (3) incomplete or non-functional assemblies, such as peptide fragments, monomeric units of TM channels (protegrin, mellitin, alamethicin, zervamicin, etc.) or double helices of gramicidin A; (4) NMR models derived from orientational rather than distance constraints; (5) ionophores (valionomycin, monesin, etc.) and (6) theoretical models.
Data access and visualization
OPM allows either searching of proteins by their name or PDB ID or sorting of proteins in tables for many specific categories (type, class, superfamily, family, destination membrane or biological source). Sorting within tables may be executed on the proteins' name, PDB ID, number of TM helices or subunits, number of secondary structures, hydrophobic thickness (membrane penetration depth for peripheral proteins), tilt angle, transfer energy, and structural family, biological source or destination membrane.
An individual web page is generated for each membrane protein complex, as shown in Figure 1, with pictures prepared with QUANTA (Accelrys Inc.). 3D visualization is available through Jmol, MDL Chime and WebMol (Walther, 1997). Coordinate files of all proteins with calculated membrane boundary planes are available for downloading separately for each protein or as a whole dataset. Membrane hydrophobic core planes are marked by dummy atoms of nitrogen for the inner and oxygen atoms for the outer membrane sides according to topology definitions in MPtopo (Jayasinghe et al., 2001). OPM is made with PHP, MySQL and uses the Smarty template framework, which separates the program logic (PHP, MySQL) and presentation (XHTML, CSS, JavaScript), and enables caching. OPM is manually curated and will be updated regularly.
|
| Acknowledgments |
|---|
We thank Jim Zajkowski for technical expertise. This work was supported by NIH grant DA003910 and the Upjohn Research Award from the College of Pharmacy, University of Michigan.
Conflict of Interest: none declared.
| FOOTNOTES |
|---|
Associate Editor: Keith A Crandall
Received on November 28, 2005; revised on December 20, 2005; accepted on December 21, 2005
| REFERENCES |
|---|
|
|
|---|
Andreeva, A., et al. (2004) SCOP database in 2004: refinements integrate structure and sequence family data. Nucleic Acids Res, . 32, D226D229
Basyn, F., et al. (2003) Insertion of X-ray structures of proteins in membranes. J. Mol. Graph. Model, . 22, 1121[CrossRef][Medline].
Berman, H.M., et al. (2000) The Protein Data Bank. Nucleic Acids Res, . 28, 235242
Busch, W., et al. (2004) The IUBMB-endorsed transporter classification system. Mol. Biotechnol, . 27, 253262[CrossRef][ISI][Medline].
Frillingos, S., et al. (1998) Cys-scanning mutagenesis: a novel approach to structure function relationships in polytopic membrane proteins. FASEB J, . 12, 12811299
Henrick, K. and Thornton, J.M. (1998) PQS: a protein quaternary structure file server. Trends Biochem. Sci, . 23, 358361[CrossRef][ISI][Medline].
Hubbell, W.L., et al. (2003) Rhodopsin structure, dynamics, and activation: a perspective from crystallography, site-directed spin labeling, sulfhydryl reactivity, and disulfide cross-linking. Adv. Protein Chem, . 63, 243290[ISI][Medline].
Jayasinghe, S., et al. (2001) MPtopo: A database of membrane protein topology. Protein Sci, . 10, 455458
Lee, A.G. (2003) Lipidprotein interactions in biological membranes: a structural perspective. Biochim. Biophys. Acta, . 1612, 140[Medline].
Malmberg, N.J. and Falke, J.J. (2005) Use of EPR power saturation to analyze the membrane-docking geometries of peripheral proteins: applications to C2 domains. Ann. Rev. Biophys. Biomol. Struct, . 34, 7190[CrossRef][ISI][Medline].
Rees, D.C., et al. (1989) The bacterial photosynthetic reaction center as a model for membrane proteins. Annu. Rev. Biochem, . 58, 607633[ISI][Medline].
Tusnady, G.E., et al. (2004) Transmembrane proteins in the Protein Data Bank: identification and classification. Bioinformatics, 20, 29642972
Tusnady, G.E., et al. (2005) PDB_TM: selection and membrane localization of transmembrane proteins in the protein data bank. Nucleic Acids Res, . 33, D275D278
Walther, D. (1997) WebMola Java-based PDB viewer. Trends Biochem. Sci, . 22, 274275[CrossRef][ISI][Medline].
Zucic, D. and Juretic, D. (2004) Precise annotation of transmembrane segments with garlica free molecular visualization program. Croat. Chem. Acta, 77, 397401.
This article has been cited by other articles:
![]() |
H. Viklund and A. Elofsson OCTOPUS: improving topology prediction by two-track ANN-based preference scores and an extended topological grammar Bioinformatics, August 1, 2008; 24(15): 1662 - 1668. [Abstract] [Full Text] [PDF] |
||||
![]() |
S. Jo, M. Vargyas, J. Vasko-Szedlar, B. Roux, and W. Im PBEQ-Solver for online visualization of electrostatic potential of biomolecules Nucleic Acids Res., July 1, 2008; 36(suppl_2): W270 - W275. [Abstract] [Full Text] [PDF] |
||||
![]() |
A. Bernsel, H. Viklund, J. Falk, E. Lindahl, G. von Heijne, and A. Elofsson Prediction of membrane-protein topology from first principles PNAS, May 20, 2008; 105(20): 7177 - 7181. [Abstract] [Full Text] [PDF] |
||||
![]() |
Y. Park and V. Helms Prediction of the translocon-mediated membrane insertion free energies of protein sequences Bioinformatics, May 15, 2008; 24(10): 1271 - 1277. [Abstract] [Full Text] [PDF] |
||||
![]() |
D. Marsh Energetics of Hydrophobic Matching in Lipid-Protein Interactions Biophys. J., May 15, 2008; 94(10): 3996 - 4013. [Abstract] [Full Text] [PDF] |
||||
![]() |
A. Randall, J. Cheng, M. Sweredoski, and P. Baldi TMBpro: secondary structure, {beta}-contact and tertiary structure prediction of transmembrane {beta}-barrel proteins Bioinformatics, February 15, 2008; 24(4): 513 - 520. [Abstract] [Full Text] [PDF] |
||||
![]() |
C. Papaloukas, E. Granseth, H. Viklund, and A. Elofsson Estimating the length of transmembrane helices using Z-coordinate predictions Protein Sci., February 1, 2008; 17(2): 271 - 278. [Abstract] [Full Text] [PDF] |
||||
![]() |
F. Dancea, K. Kami, and M. Overduin Lipid Interaction Networks of Peripheral Membrane Proteins Revealed by Data-Driven Micelle Docking Biophys. J., January 15, 2008; 94(2): 515 - 524. [Abstract] [Full Text] [PDF] |
||||
![]() |
G. E. Tusnady, L. Kalmar, and I. Simon TOPDB: topology data bank of transmembrane proteins Nucleic Acids Res., January 11, 2008; 36(suppl_1): D234 - D239. [Abstract] [Full Text] [PDF] |
||||
![]() |
D. Marsh Lateral Pressure Profile, Spontaneous Curvature Frustration, and the Incorporation and Conformation of Proteins in Membranes Biophys. J., December 1, 2007; 93(11): 3884 - 3899. [Abstract] [Full Text] [PDF] |
||||
![]() |
V. Cherezov, D. M. Rosenbaum, M. A. Hanson, S. G. F. Rasmussen, F. S. Thian, T. S. Kobilka, H.-J. Choi, P. Kuhn, W. I. Weis, B. K. Kobilka, et al. High-Resolution Crystal Structure of an Engineered Human 2-Adrenergic G Protein Coupled Receptor Science, November 23, 2007; 318(5854): 1258 - 1265. [Abstract] [Full Text] [PDF] |
||||
![]() |
N. Bhardwaj, R. V. Stahelin, G. Zhao, W. Cho, and H. Lu MeTaDoR: a comprehensive resource for membrane targeting domains and their host proteins Bioinformatics, November 15, 2007; 23(22): 3110 - 3112. [Abstract] [Full Text] [PDF] |
||||
![]() |
B. Luan, M. Caffrey, and A. Aksimentiev Structure Refinement of the OpcA Adhesin Using Molecular Dynamics Biophys. J., November 1, 2007; 93(9): 3058 - 3069. [Abstract] [Full Text] [PDF] |
||||
![]() |
D. Marsh, M. Jost, C. Peggion, and C. Toniolo Lipid Chain-Length Dependence for Incorporation of Alamethicin in Membranes: Electron Paramagnetic Resonance Studies on TOAC-Spin Labeled Analogs Biophys. J., June 1, 2007; 92(11): 4002 - 4011. [Abstract] [Full Text] [PDF] |
||||
![]() |
Y. Park and V. Helms On the derivation of propensity scales for predicting exposed transmembrane residues of helical membrane proteins Bioinformatics, March 15, 2007; 23(6): 701 - 708. [Abstract] [Full Text] [PDF] |
||||
![]() |
D. Marsh, M. Jost, C. Peggion, and C. Toniolo TOAC Spin Labels in the Backbone of Alamethicin: EPR Studies in Lipid Membranes Biophys. J., January 15, 2007; 92(2): 473 - 481. [Abstract] [Full Text] [PDF] |
||||
![]() |
A. L. Lomize, I. D. Pogozheva, M. A. Lomize, and H. I. Mosberg Positioning of proteins in membranes: A computational approach. Protein Sci., June 1, 2006; 15(6): 1318 - 1333. [Abstract] [Full Text] [PDF] |
||||
![]() |
T. Pali, D. Bashtovyy, and D. Marsh Stoichiometry of lipid interactions with transmembrane proteins--Deduced from the 3D structures. Protein Sci., May 1, 2006; 15(5): 1153 - 1161. [Abstract] [Full Text] [PDF] |
||||
| ||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||






