Bioinformatics Advance Access originally published online on August 9, 2006
Bioinformatics 2006 22(19):2340-2347; doi:10.1093/bioinformatics/btl395
| ||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||
Solvated docking: introducing water into the modelling of biomolecular complexes
Bijvoet Center for Biomolecular Research, Science Faculty, Utrecht University 3584CH, Utrecht, The Netherlands
*To whom correspondence should be addressed.
| ABSTRACT |
|---|
|
|
|---|
Motivation: Interfacial water, which plays an important role in mediating biomolecular interactions, has been neglected in the modelling of biomolecular complexes.
Methods: We present a solvated docking approach that explicitly accounts for the presence of water in proteinprotein complexes. Our solvated docking protocol is based on the concept of the first encounter complex in which a water layer is present in-between the molecules. It mimics the pathway from this initial complex towards the final assembly in which most waters have been expelled from the interface. Docking is performed from solvated biomolecules and waters are removed in a biased Monte Carlo procedure based on water-mediated contact propensities obtained from an analysis of high-resolution crystal structures.
Results: We demonstrate the feasibility of this approach for proteinprotein complexes representing both wet and dry interfaces. Solvated docking leads to improvements both in quality and scoring. Water molecules are recovered that closely match the ones in the crystal structures.
Availabilty: Solvated docking will be made available in the future release of HADDOCK version 2.0 (http://www.nmr.chem.uu.nl/haddock).
Contact: a.m.j.j.bonvin{at}chem.uu.nl
Supplementary information: Supplementary Data are available at Bioinformatics Online.
| 1 INTRODUCTION |
|---|
|
|
|---|
The modelling of proteinprotein complexes by means of docking (a computational approach which models the unknown structure of a complex from its constituents) has become increasingly popular, as witnessed by the CAPRI (Critical Assessment of PRedicted Interactions) experiment (Mendez et al., 2005). Docking approaches have benefited from knowledge obtained by detailed analyses of binding interfaces (Halperin et al., 2002; van Dijk et al., 2005a). As discussed in a recent review, water molecules are expected to influence the assembly of biomolecular complexes (Chandler, 2005), and, as such, to be important for proteinprotein docking. An analysis based on Voronoi volume showed that only upon inclusion of interfacial solvent molecules are proteinprotein interfaces as densely packed as protein interiors (Lo Conte et al., 1999). So far, however, water has been neglected generally in biomolecular docking. Its role and importance in single proteins have been discussed (Rashin et al., 1986; Wade et al., 1993; Wade and Goodford, 1993; Hubbard et al., 1994; Robert and Ho, 1995; Raschke, 2006) and several case studies have analysed its conservation in 3D structures of homologues (Sreenivasan and Axelsen, 1992; Zhang and Matthews, 1994; Robert and Ho, 1995; Tame et al., 1996; Carugo, 1999; Carugo and Bordo, 1999; Loris et al., 1999; Babor et al., 2002; Houborg et al., 2003; Mustata and Briggs, 2004). There has also been quite some interest in identifying and predicting the positions of water molecules in known structures: this can be quite successfully performed, for example, by GRID (Boobbyer et al., 1989; Wade et al., 1993; Wade and Goodford, 1993) or Fold-X (Schymkowitz et al., 2005). These kind of approaches, however, are not very well suited for docking purposes, since the structure of the complex is not known a priori. Ideally, water should be accounted for directly during the docking process since its presence might affect the resulting models. So far this has only be done for proteinligand (Rejto and Verkhivker, 1997; Rarey et al., 1999; Osterberg et al., 2002; Yang and Chen, 2004; Verdonk et al., 2005) and nucleic acidligand docking (Moitessier et al., 2006).
Only very recently has the role of water molecules at proteinprotein interfaces been investigated. A hydrogen bonding potential for water-mediated contacts, in combination with a solvated rotamer library for describing side chain conformations, has been shown to predict rather successfully the positions of water molecules in complexes with known structures (Jiang et al., 2005). In another study (Rodier et al., 2005), various properties of interfacial water molecules such as residue preference and their number per unit of interface area were investigated.
We have experimented previously with the inclusion of water in the NMR structure calculation of a proteinnon-specific DNA complex (Kalodimos et al., 2004): in that case, an extensive set of NOEs could be used, which forced the solvated biomolecules to come together and the unnecessary waters to leave the interface in a simulated annealing molecular dynamic approach. In general, in docking, this kind of experimental information is not available and, in the absence of a driving force, the water molecules will remain trapped at the interface. Alternative approaches are thus needed to remove the unnecessary water molecules from the interface. We have developed for this purpose a solvated docking protocol implemented in our data-driven docking approach HADDOCK (Dominguez et al., 2003) and demonstrate here for the first time that water can be explicitly included in proteinprotein docking.
| 2 METHODS |
|---|
|
|
|---|
2.1 Database analysis
In order to obtain information on water in high-resolution crystal structures of complexes, the non-redundant dataset of Keskin et al. (2004) was analysed using CNS (Brunger et al., 1998) and a set of home written Python scripts. Interface residues were defined as residues having at least one heavy-atom contact with a residue from the partner chain, within a 10 Å cut-off distance. Water-mediated contacts were defined between pairs of interface residues, provided a water molecule is making at least one heavy-atom contact within 5 Å with both residues. Water-mediated contacts were designated main chain when at least one contact was made via a backbone atom; otherwise they were designated side chain.
To investigate whether the various types of water-mediated contacts adopt specific, well-defined conformations, we clustered them on the basis of positional RMSD values: the RMSD values were calculated after least-square positional fitting on the coordinates of the water oxygen, its contacting heavy atoms within 5 Å on both chains and their respective first bonded partner (total of five atoms). Since several atoms of a given side-chain can make contacts with the water oxygen atom within 5 Å, various combinations of atoms were tested for the calculation of the RMSD matrix and the one resulting in the best clustering (most populated first cluster) was selected for each amino acidamino acid pair. Clustering was performed separately for main chainwatermain chain, side chainwaterside chain and main chainwaterside chain contacts. In the case of main chain contacts, N and O were defined as contacting atoms, with CA and C, respectively, as bonded neighbours.
RMSDs were calculated using g_rms (Lindahl et al., 2001) and Profit (www.bioinf.org.uk/software/profit). Clustering was performed using the greedy algorithm described by Daura et al. (1999), with a cut-off of 1.5 Å. This cut-off was based on an analysis of the distribution of all RMSD values (data not shown). Contacts involving two close waters that would fall into the same cluster were counted only once.
2.2 Proteinprotein docking using explicit water
HADDOCK incorporates information about the interface in ambiguous interaction restraints (AIRs) that drive the docking. An AIR is defined as an ambiguous intermolecular distance (diAB) with a maximum value of typically 2 Å between any atom m of an active residue i of protein A (miA) and any atom n of both active and passive residues k (Nres in total) of protein B (nkB) (and inversely for protein B). The effective distance diABeff for each restraint is calculated using the following equation:
![]() |
We modified the rigid body docking stage to explicitly include water. We start by solvating the two chains using a box of TIP3P (Jorgensen et al., 1983) water. All waters outside a cut-off range (<4.0 Å to >8.0 Å) from the protein are removed. A short molecular dynamics (MD) run is performed to optimize the water positions while keeping the proteins fixed (4000 MD steps consisting of four times 1000 steps at a temperature of 600, 500, 400 and 300 K, respectively). After that, all waters further away than 5.5 Å are removed. An ensemble of different solvation shells (typically 5) is generated by randomly rotating the protein before adding the solvation shell. We also experimented with the use of GRID (Boobbyer et al., 1989) to place the initial waters around the separate protein chains. The results of the subsequent docking did not depend much on the choice of the solvating method (data not shown). The solvated docking protocol itself is presented in the Results section.
The standard semi-flexible refinement of HADDOCK consists of two rigid body simulated annealing stages followed by two simulated annealing stages with flexibility introduced first on side chains and then on backbone. For solvated docking we only used the latter two semi-flexible simulated annealing stages.
Non-bonded energies (sum of van der Waals and electrostatic terms) are calculated with an 8.5 Å distance cut-off using the OPLS non-bonded parameters (Jorgensen and Tirado-rives, 1988) from the parallhdg5.3.pro parameter file (Linge et al., 2003b); the dielectric constant
is set to 10.0 to damp the electrostatic contribution in vacuum. The overall score is calculated as a weighted sum of different terms, using the default HADDOCK2.0 values for the weights (rigid body stage: EvdW 0.01, Eelec 1.0, EAIR 0.01, BSA 0.01, Edesolv 1.0; semi-flexible refinement: EvdW 1.0, Eelec 1.0, EAIR 0.1, BSA 0.01, Edesolv 1.0). Here vdW is van der Waals energy; elec, electrostatic energy; AIR, ambiguous interaction restraints; BSA, buried surface area; and desolv, desolvation energy. The desolvation energy is calculated using the atomic desolvation parameters of Fernandez-Recio et al. (2004). The various weights were obtained by a grid search to optimize scoring over the complexes tested so far including CAPRI targets. These were optimized separately for the various stages of HADDOCK to reflect the various levels of complexity and refinement (from rigid body docking in vacuum to flexible refinement in explicit solvent).
2.3 Test systems
We tested our protocol on 10 proteinprotein complexes (Table 2). Note that there are only a limited number of complexes that are suitable as test cases: the resolution should be high enough (>2 Å) in order to have reliable positions for interfacial water molecules, and the free structures of the components of the complex should be available. We used all structures from the docking benchmark (Mintseris et al., 2005) satisfying those criteria and a few other complexes which we have been testing before. For two of these, E2AHPr (Wang et al., 2000) and cohesindockerin (Carvalho et al., 2003), we used experimental data available from the literature (NMR chemical shift perturbation data for E2AHPr (Dominguez et al., 2003) and mutagenesis and conservation data as used previously for docking cohesindockerin, which was one of the targets in round 4 of CAPRI (van Dijk et al., 2005b). For the others, AIRs were defined based on the interface residues identified in the crystal structure; for those complexes, to simulate a more realistic case, 50% of the restraints were randomly removed for each docking trial. When free structures of the complex components were available (seven cases, Table 2), we performed unbound docking followed by semi-flexible refinement as well as bound docking. For cohesindockerin, boundunbound docking was performed in addition to bound docking, and for the other two cases only bound docking was performed.
| 3 RESULTS |
|---|
|
|
|---|
Our solvated docking protocol is based on the physical concept that, in the first encounter complex, a water layer will be present in-between the two protein chains. To proceed from the encounter complex to the final structure, most of the interfacial waters have to be removed. Our protocol mimics this process by starting the docking from solvated molecules. Water is subsequently removed in a biased Monte Carlo procedure based on water-mediated contact propensities. The latter are obtained from an analysis of a database of high-resolution crystal structures of proteinprotein complexes. In the following we will first describe the results of this analysis and then present our solvated docking protocol, demonstrating its feasibility for a number of proteinprotein complexes.
3.1 Analysis of water mediated contacts
In order to extract statistics of water-mediated contacts, we analysed the high-resolution structures (
2.0 Å) in the non-redundant dataset of proteinprotein interfaces of Keskin et al. (2004). The corresponding PDB id's are provided in Supplementary Table 5. Some general statistics of our dataset are listed in Table 1.
|
In Figure 1, the fraction of water-mediated side chain and main chain contacts for all 20 x 20 amino acid combinations is shown. It is clear from this figure that preferences do exist for specific water-mediated contacts, an information which should be useful in the modelling of proteinprotein complexes by docking (see below). In order to assess the statistical significance of the fractions of water-mediated contacts we compared the values obtained from the non-redundant filtered set with those obtained using the complete redundant set of structural homologues. Since these have a lower resolution, the derived fractions are lower than those from the filtered set (data not shown); there is, however, a clear correlation between the two datasets (R = 0.6). It is, however, clear that the propensities reported here should be refined in the future by making use of the (rather slowly) increasing number of protein complexes deposited into the PDB.
|
To find out whether interfacial water molecules adopt specific, well-defined conformations, we clustered the water-mediated contacts based on pairwise RMSDs (for details see the Methods section and Supplementary Material). The rationale behind this analysis is that, if water molecules do adopt well-defined specific positions in an interface, one might be able to derive for each type of water-mediated contact a few preferred conformations (an analogy in protein structures would be the rotameric states of side chains). Such information might be useful in the modelling of water-mediated contacts. The clustering statistics are reported in Supplementary Table 7. Using a 1.5 Å clustering cut-off almost 90% (118 out of 133) of the side chain contacts that could be clustered (133 out of 210) fall into one or two clusters (note that contacts for which less than two water-mediated instances were found could not be clustered at all). Figure 2 shows examples of clusters found for the most populated water-mediated contacts in the resolution-filtered Keskin dataset; in addition, the main backbonebackbone contact (OH2OO) and the best-clustering backboneside chain contact (Ser side chainN) are shown.
|
3.2 Solvated docking
Our solvated docking approach is based on the concept of the first encounter complex in which the proteins are separated by a hydration layer. Before docking, we solvate the protein chains with one hydration layer as described in the Methods section. Then, the conventional HADDOCK rigid body docking protocol is followed; for this, each protein and its associated solvation shell is considered as one rigid body. This results in an encounter complex with a water-layer in between the two protein chains. All non-interfacial water molecules are removed from this complex and the remaining waters, together with the protein chains, are treated as separate rigid bodies in a subsequent energy minimization stage (1000 EM steps were found to be sufficient for convergence). Water molecules are then removed in a biased Monte Carlo procedure: randomly chosen water molecules are probed for their closest amino acid residues on both chains; their probability to be kept is set equal to the observed fraction of water-mediated contacts for this specific amino acid combination as derived from the resolution-filtered Keskin set (see above). This procedure is repeated until only 25% of the initial interfacial water molecules remain. Subsequently, water molecules with an unfavourable interaction energy (sum of van der Waals and electrostatic waterprotein energies >0.0 kcal/mol) are removed.
Finally, the remaining waters and the protein chains are again subjected to a rigid body energy minimization (for an overview see Supplementary Figure 6). Note that we checked that the use of water-mediated propensities to bias water removal does lead to improvement compared to a simple random removal of waters.
The number of retained waters at the end of our protocol is usually lower than 25% because of the energy criterion, typically between 10 and 20%. This fraction is roughly in accordance with a recent study (Rodier et al., 2005) where it was found that, on average, 90% of the interface waters are removed upon assembly. In fact, we observe a substantial variation in the final number of water molecules in the docked structures for the complexes that we used to test our protocol (see below, Table 4).
The solvated docking protocol as described above corresponds to the rigid body docking stage in HADDOCK. The resulting structures are then further refined using semi-flexible simulated annealing. Since water is introduced during rigid body docking we focus the discussion of our results on this stage, but we will also show some initial results for the semi-flexible refinement.
We tested our solvated docking approach on 10 complexes representing both wet and dry interfaces (Table 2). An accurate docking protocol accounting for the presence of water should not only be able to correctly position water molecules at the interface, thereby improving the docking results in the case of wet interfaces, but also it should avoid retaining waters in dry interfaces in order not to deteriorate the docking results. Assessed by the number of fully buried water molecules, the
-amylase
AI and barnasebarstar complexes are representative of wet interfaces, the PKC interacting protein complex represents a completely dry interface and most of the other complexes are in-between. Only the E2AHPr complex is an NMR structure for which no information on water positions is available.
|
The docking was performed using either the bound (B) structures from the complex or the unbound (U) structures; in the latter case rigid body docking was followed by flexible refinement. Experimental data (E) or interface residues (I) in the complex were used to define the AIRs, 50% of which were randomly discarded for each docking trial in the latter case (see the Methods section). Further details on these complexes and the information used to drive the docking can be found in the Methods section.
For each complex, two runs were performed: one reference run without water and one following our new solvated docking approach (see the Methods section). This was done for bound docking (using the bound structures of the components of the complex) and, if unbound structures were available, repeated for unbound docking.
The bound docking results are presented in Supplementary Table 8. Table 3 gives an overview of the unbound docking results, assessed by interface-RMSD (i-RMSD) to the target structure. The i-RMSD is defined as the backbone RMSD from the reference structure of the complex for those residues making contacts across the interface within a 10 Å cut-off [i-RMSDs below 2 and 4 Å are considered as medium quality and acceptable predictions, respectively, according to the CAPRI criteria (Mendez et al., 2005)]. As can be seen from Table 3, the inclusion of water in docking generally improves the scoring of the solutions. This is clear from the i-RMSD of the top ranking solution: for the solvated docking, this is in five cases a medium quality solution and in one case an acceptable solution, whereas for the unsolvated docking, this is in only two cases a medium quality solution and in one case an acceptable solution. In addition, the rank of the best-ranked medium quality solution is in most cases lower for the solvated docking. Finally, the lowest RMSD found in all top 200 ranked structures is on average lower for the solvated docking. Note thatscoring in our solvated docking protocol includes the waterwater and waterprotein non-bonded energy contributions, which clearly improves the performance (data not shown).
|
After flexible refinement (Table 3) the same conclusions are valid, although the differences between solvated and unsolvated docking are smaller. For example, the unsolvated docking has four medium and one acceptable solutions and the solvated docking has five medium quality solutions. For the wet interfaces, a large fraction of the waters in our docking solutions have positions very close to those in the crystal (Fig. 3 and Supplementary Figures 79). These correspond to both fully buried waters and waters present at the rim of the interface. Especially the results from the bound barnasebarstar docking are impressive, with
80% of the water molecules within 2 Å of crystal water positions. The distributions of distances between predicted and native waters in Figure 3 compare favourably with the results from Jiang et al. (2005); in that study, no docking was performed, but water positions at the interface were predicted from the crystal structures of a set of complexes. We also found that the quality of the water predictions does not change much after the semi-flexible refinement (Supplementary Figure 9). Note however that those are only preliminary results and the flexible refinement protocol needs further optimization.
|
We analysed the recovery of totally buried crystal water molecules over all acceptable (i-RMSD <4 Å) solutions out of the top 200 ranked models (Table 4 and Supplementary Table 9). On average, each docking solution contains between 6 and 12 water molecules (both buried and rim). Buried water molecules are generally more consistently recovered (i.e. found in a larger fraction of the solutions) than those at the rim of the interface (Fig. 4 and Supplementary Figures 911). On average, 94% of the buried crystal waters are recovered and each one is observed in 17% of the acceptable solutions. We find that those crystal waters that are not recovered are making most of their contacts with only one of the two components of the complex.
|
|
We also analysed the fraction of native water-mediated contacts recovered after flexible refinement: this is on average 30% for all acceptable structures, 46% for the highest-ranked acceptable structure and even 66% in the most favourable case. These are quite high fractions considering that on average, per structure, only 32% of the crystal waters are recovered within 4 Å. Those numbers are on average 25% smaller for rigid body docking solutions. As was already observed previously (van Dijk et al., 2005b), flexible refinement significantly improves the fraction of native contacts across the interface. In CAPRI, high/medium/acceptable-quality solutions require at least 50/30/10% fraction native contacts.
Crystal waters are recovered not only in wet interfaces (e.g.
-amylase
AI and barnasebarstar) but also, for example, in the case of 1gcq, where all four fully buried interface waters are found in several of the docking solutions [this complex shows the highest average fraction of structures in which crystal waters are observed (34%)]. For the dry PKC interacting protein, the water molecules in the resulting docked structures are placed mostly at the rim of the interface. The same applies to E2AHPr. For the latter, however, we cannot compare their positions to experimental ones since the reference complex was solved by NMR. Although decreasing somewhat the number of acceptable solutions for that particular complex, explicit inclusion of water led to an improvement in the ranking and in the number of medium quality solutions, both before and after flexible refinement. Taken all together, these results demonstrate the general applicability of our method.
Explicit inclusion of water molecules in our solvated docking protocol results in a factor 3 to 4 increase in computational time requirements for the rigid body docking stage. The most time-consuming part of HADDOCK is, however, the semi-flexible refinement stage, in which the presence of some additional water molecules does not make much difference. Explicit inclusion of water in docking thus only results in about a factor 2 increase in the overall run time, which is reasonable considering the improvements in both success rate and accuracy, and the fact that as a result water positions are predicted.
| 4 CONCLUSIONS AND PERSPECTIVE |
|---|
|
|
|---|
For the first time, water has been introduced explicitly in proteinprotein docking. We followed for this purpose a strategy mimicking the concept of the solvated initial encounter complex. By performing the docking from solvated protein chains in combination with a Monte Carlo water removal procedure based on water contact propensities, we successfully recovered interfacial crystal water molecules and improved our docking results both in bound and unbound docking cases. Further improvements could be achieved by making use of the geometrical information obtained from the cluster analysis of water-mediated contacts.
The very promising results obtained here and the rather reasonable additional computational burden make us confident that solvated docking is a viable approach to model biomolecular complexes. We actually started applying solvated docking in the last two rounds of CAPRI (targets 25 and 26; see http://capri.ebi.ac.uk) but will have to wait for the release of the targets in order to assess its performance. Solvated docking should also benefit the field of proteinDNA modelling since it is well known that proteinDNA complexes have rather wet interfaces. We therefore intend to extend our approach to the modelling of such complexes, which, as we demonstrated recently, can be modelled successfully using HADDOCK (van Dijk et al., 2006).
| Acknowledgments |
|---|
This work was supported by a Jonge Chemici grant (grant no. 700.50.512) from The Netherlands Organization for Scientific Research (N.W.O.) to A.B. and by the European Community, FP6 STREP project ExtendNMR (contract no. LSHG-CT-2005-018988).
Conflict of Interest: none declared.
| FOOTNOTES |
|---|
Associate Editor: Alex Bateman
Received on May 10, 2005; revised on June 26, 2006; accepted on July 17, 2006
| REFERENCES |
|---|
|
|
|---|
Babor, M., et al. (2002) Conserved positions for ribose recognition: importance of water bridging interactions among ATP, ADP and FADprotein complexes. J. Mol. Biol, . 323, 523532[CrossRef][ISI][Medline].
Bode, W., et al. (1989) The refined 2.0 a X-ray crystal-structure of the complex formed between bovine beta-trypsin and Cmti-I, a trypsin-inhibitor from Squash seeds (Cucurbita-Maxima)topological similarity of the Squash seed inhibitors with the carboxypeptidase a inhibitor from potatoes. FEBS Lett, . 242, 285292[CrossRef][ISI][Medline].
BompardGilles, C., et al. (1996) Substrate mimicry in the active center of a mammalian alpha-amylase: structural analysis of an enzymeinhibitor complex. Structure, 4, 14411452[Medline].
Boobbyer, D.N.A., et al. (1989) New hydrogen-bond potentials for use in determining energetically favorable binding-sites on molecules of known structure. J. Med. Chem, . 32, 10831094[CrossRef][ISI][Medline].
Brunger, A.T., et al. (1998) Crystallography and NMR system: a new software suite for macromolecular structure determination. Acta Crystallogr D, 54, 905921[CrossRef][Medline].
Buckle, A.M., et al. (1994) Proteinprotein recognitioncrystal structural-analysis of a Barnase Barstar complex at 2.0-Angstrom resolution. Biochemistry, 33, 88788889[CrossRef][Medline].
Carugo, O. (1999) Correlation between occupancy and B factor of water molecules in protein crystal structures. Protein Eng, . 12, 10211024
Carugo, O. and Bordo, D. (1999) How many water molecules can be detected by protein crystallography? Acta Crystallogr. D, 55, 479483[CrossRef][Medline].
Carvalho, A.L., et al. (2003) Cellulosome assembly revealed by the crystal structure of the cohesin-dockerin complex. Proc. Natl Acad. Sci. USA, 100, 1380913814
Chandler, D. (2005) Interfaces and the driving force of hydrophobic assembly. Nature, 437, 640647[CrossRef][Medline].
Daura, X., et al. (1999) Peptide folding: when simulation meets experiment. Angew. Chem. Int. Ed, . 38, 236240[CrossRef].
Dominguez, C., et al. (2003) HADDOCK: a proteinprotein docking approach based on biochemical or biophysical information. J. Am. Chem. Soc, . 125, 17311737[CrossRef][ISI][Medline].
Fernandez-Recio, J., et al. (2004) Identification of protein-protein interaction sites from docking energy landscapes. J. Mol. Biol, . 335, 843865[CrossRef][ISI][Medline].
Halperin, I., et al. (2002) Principles of docking: An overview of search algorithms and a guide to scoring functions. Proteins, 47, 40943[CrossRef][ISI][Medline].
Houborg, K., et al. (2003) Impact of the physical and chemical environment on the molecular structure of Coprinus cinereus peroxidase. Acta Crystallogr. D, 59, 989996[CrossRef][Medline].
Hubbard, S.J. and Thornton, J.M. NACCESS, (1993) , London Department of Biochemistry and Molecular Biology, University College.
Hubbard, S.J., et al. (1994) Intramolecular cavities in globular-proteins. Protein Eng, . 7, 613626
Jiang, L., et al. (2005) A solvated rotainer approach to modeling water-mediated hydrogen bonds at proteinprotein interfaces. Proteins, 58, 893904[CrossRef][ISI][Medline].
Jorgensen, W.L. and Tirado-rives, J. (1988) The OPLS Potential functions for proteins. Energy minimizations for crystals of cyclin peptides and crambin. J. Am. Chem. Soc, . 110, 16571666.
Jorgensen, W.L., et al. (1983) Comparison of simple potential functions for simulating liquid water. J. Chem. Phys, . 79, 926935[CrossRef].
Kalodimos, C.G., et al. (2004) Structure and flexibility adaptation in nonspecific and specific proteinDNA complexes. Science, 305, 386389
Keskin, O., et al. (2004) A new, structurally nonredundant, diverse data set of proteinprotein interfaces and its implications. Protein Sci, . 13, 10431055
Ko, T.P., et al. (1999) The crystal structure of the DNase domain of colicin E7 in complex with its inhibitor Im7 protein. Structure, 7, 91102[Medline].
Lima, C.D., et al. (1997) Structure-based analysis of catalysis and substrate definition in the HIT protein family. Science, 278, 286290
Lindahl, E., et al. (2001) GROMACS 3.0: a package for molecular simulation and trajectory analysis. J. Mol. Model, 7, 306317.
Linge, J.P., et al. (2003a) ARIA: automated NOE assignment and NMR structure calculation. Bioinformatics, 19, 315316
Linge, J.P., et al. (2003b) Refinement of protein structures in explicit solvent. Proteins, 50, 496506[CrossRef][ISI][Medline].
Lo Conte, L., et al. (1999) The atomic structure of protein-protein recognition sites. J. Mol. Biol, . 285, 21772198[CrossRef][ISI][Medline].
Loris, R., et al. (1999) Conserved water molecules in a large family of microbial ribonucleases. Proteins, 36, 117134[CrossRef][ISI][Medline].
Mendez, R., et al. (2005) Assessment of CAPRI predictions in rounds 35 shows progress in docking procedures. Proteins, 60, 150169[CrossRef][ISI][Medline].
Mintseris, J., et al. (2005) Proteinprotein docking benchmark 2.0: an update. Proteins, 60, 214216[CrossRef][ISI][Medline].
Moitessier, N., et al. (2006) Docking of Aminoglycosides to hydrated and flexible RNA. J. Med. Chem, . 49, 10231033[CrossRef][ISI][Medline].
Mustata, G. and Briggs, J.M. (2004) Cluster analysis of water molecules in alanine racemase and their putative structural role. Protein Eng, . 17, 223234
Nishida, M., et al. (2001) Novel recognition mode between Vav and Grb2 SH3 domains. EMBO J, . 20, 29953007[CrossRef][ISI][Medline].
Osterberg, F., et al. (2002) Automated docking to multiple target structures: incorporation of protein mobility and structural water heterogeneity in AutoDock. Proteins, 46, 3440[CrossRef][ISI][Medline].
Rarey, M., et al. (1999) The particle concept: placing discrete water molecules during protein-ligand docking predictions. Proteins, 34, 1728[Medline].
Raschke, T.M. (2006) Water structure and interactions with protein surfaces. Curr. Opin. Struct. Biol, . 16, 152159[CrossRef][ISI][Medline].
Rashin, A.A., et al. (1986) Internal cavities and buried waters in globular proteins. Biochemistry, 25, 36193625[CrossRef][Medline].
Rejto, P.A. and Verkhivker, G.M. (1997) Mean field analysis of FKBP12 complexes with FK506 and rapamycin: implications for a role of crystallographic water molecules in molecular recognition and specificity. Proteins, 28, 313324[CrossRef][ISI][Medline].
Robert, C.H. and Ho, P.S. (1995) Significance of bound water to local chain conformations in protein crystals. Proc. Natl Acad. Sci. USA, 92, 76007604
Rodier, F., et al. (2005) Hydration of proteinprotein interfaces. Proteins, 60, 3645[CrossRef][ISI][Medline].
Schymkowitz, J.W.H., et al. (2005) Prediction of water and metal binding sites and their affinities by using the Fold-X force field. Proc. Natl Acad. Sci. USA, 102, 1014710152
Song, H.K. and Suh, S.W. (1998) Kunitz-type soybean trypsin inhibitor revisited: refined structure of its complex with porcine trypsin reveals an insight into the interaction between a homologous inhibitor from Erythrina caffra and tissue-type plasminogen activator. J. Mol. Biol, . 275, 347363[CrossRef][ISI][Medline].
Sreenivasan, U. and Axelsen, P.H. (1992) Buried water in homologous serine proteases. Biochemistry, 31, 1278512791[CrossRef][Medline].
Takeuchi, Y., et al. (1991) Refined crystal-structure of the complex of subtilisin Bpn' and Streptomyces Subtilisin inhibitor at 1.8 A-resolution. J. Mol. Biol, . 221, 309325[ISI][Medline].
Tame, J.R.H., et al. (1996) The role of water in sequence-independent ligand binding by an oligopeptide transporter protein. Nat. Struct. Biol, . 3, 9981001[CrossRef][ISI][Medline].
van Dijk, A.D.J., et al. (2005a) Data-driven docking for the study of biomolecular complexes. FEBS J, . 272, 293312[CrossRef][Medline].
van Dijk, A.D.J., et al. (2005b) Data-driven docking: HADDOCK's adventures in CAPRI. Proteins, 60, 232238[CrossRef][ISI][Medline].
van Dijk, A.D.J., et al. (2006) Information-driven protein-DNA docking using HADDOCK: it is a matter of flexibility. Nucleic Acids Res, . 34, 33173325
Verdonk, M.L., et al. (2005) Modeling water molecules in protein-ligand docking using GOLD. J. Med. Chem, . 48, 65046515[CrossRef][ISI][Medline].
Wade, R.C. and Goodford, P.J. (1993) Further development of hydrogen-bond functions for use in determining energetically favorable binding-sites on molecules of known structure.2. Ligand probe groups with the ability to form more than 2 hydrogen-bonds. J. Med. Chem, . 36, 148156[CrossRef][ISI][Medline].
Wade, R.C., et al. (1993) Further development of hydrogen-bond functions for use in determining energetically favorable binding-sites on molecules of known structure.1. Ligand probe groups with the ability to form 2 hydrogen-bonds. J. Med. Chem, . 36, 140147[CrossRef][ISI][Medline].
Wang, G., et al. (2000) Solution structure of the phosphoryl transfer complex between the signal transducing proteins HPr and IIA(glucose) of the Escherichia coli phosphoenolpyruvate:sugar phosphotransferase system. EMBO J, . 19, 56355649[CrossRef][ISI][Medline].
Yang, J.M. and Chen, C.C. (2004) GEMDOCK: a generic evolutionary method for molecular docking. Proteins, 55, 288304[CrossRef][ISI][Medline].
Zhang, X.J. and Matthews, B.W. (1994) Conservation of solvent-binding sites in 10 crystal forms of T4-Lysozyme. Protein Sci, . 3, 10311039[Abstract].
| ||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||




