Skip Navigation


Bioinformatics Advance Access originally published online on May 17, 2007
Bioinformatics 2007 23(14):1840-1842; doi:10.1093/bioinformatics/btm259
This Article
Right arrow Abstract Freely available
Right arrow FREE Full Text (Print PDF) Freely available
Right arrow All Versions of this Article:
23/14/1840    most recent
btm259v1
Right arrow Comments: Submit a response
Right arrow Alert me when this article is cited
Right arrow Alert me when Comments are posted
Right arrow Alert me if a correction is posted
Services
Right arrow Email this article to a friend
Right arrow Similar articles in this journal
Right arrow Similar articles in ISI Web of Science
Right arrow Similar articles in PubMed
Right arrow Alert me to new issues of the journal
Right arrow Add to My Personal Archive
Right arrow Download to citation manager
Right arrow Search for citing articles in:
ISI Web of Science (2)
Right arrowRequest Permissions
Google Scholar
Right arrow Articles by Gront, D.
Right arrow Articles by Kolinski, A.
Right arrow Search for Related Content
PubMed
Right arrow PubMed Citation
Right arrow Articles by Gront, D.
Right arrow Articles by Kolinski, A.
Social Bookmarking
 Add to CiteULike   Add to Connotea   Add to Del.icio.us  
What's this?

© The Author 2007. Published by Oxford University Press. All rights reserved. For Permissions, please email: journals.permissions@oxfordjournals.org

T-Pile—a package for thermodynamic calculations for biomolecules

Dominik Gront * and Andrzej Kolinski

Warsaw University, Faculty of Chemistry, Pasteura 1 02-093 Warsaw, Poland

*To whom correspondence should be addressed.


    ABSTRACT
 TOP
 ABSTRACT
 1 INTRODUCTION
 2 T-PILE PACKAGE
 3 EXAMPLE APPLICATIONS
 ACKNOWLEDGEMENTS
 REFERENCES
 

Summary: Molecular dynamics and Monte Carlo, usually conducted in canonical ensemble, deliver a plethora of biomolecular conformations. Proper analysis of the simulation data is a crucial part of biophysical and bioinformatics studies. Sequence alignment problem can be also formulated in terms of Boltzmann distribution. Therefore tools for efficient analysis of canonical ensemble data become extremely valuable. T-Pile package, presented here provides a user-friendly implementation of most important algorithms such as multihistogram analysis and reweighting technique. The package can be used in studies of virtually any system governed by Boltzmann distribution.

Availability: T-Pile can be downloaded from: http://biocomp.chem.uw.edu.pl/services/tpile. These pages provide a comprehensive tutorial and documentation with illustrative examples of applications.

Contact: dgront{at}chem.uw.edu.pl

Supplementary information: Supplementary data are available at Bioinformatics online.


    1 INTRODUCTION
 TOP
 ABSTRACT
 1 INTRODUCTION
 2 T-PILE PACKAGE
 3 EXAMPLE APPLICATIONS
 ACKNOWLEDGEMENTS
 REFERENCES
 
Proper thermodynamic analysis of simulation results is crucial for understanding properties of biomolecules. Usually simulations are done in canonical or multicanonical ensembles. Canonical ensemble is more intuitive and easier to simulate. The drawback is that the partition function cannot be computed directly from a trajectory of states observed at a single temperature. Simulations at various temperatures can be combined by means of multihistogram algorithm (Ferrenberg and Swendsen, 1989; Gront et al., 2001) [or WHAM Weighted Histogram Analysis Method (Kumar et al., 1992)] introduced a few years ago. Despite the popularity of the method, there is no a publicly available program that implements it. The purpose of this work is to provide a flexible implementation that can be used in many applications.


    2 T-PILE PACKAGE
 TOP
 ABSTRACT
 1 INTRODUCTION
 2 T-PILE PACKAGE
 3 EXAMPLE APPLICATIONS
 ACKNOWLEDGEMENTS
 REFERENCES
 
The package has been implemented in Java language what makes it almost platform independent. Special care has been taken to avoid unnecessary overhead caused by the high-level programming language. We tested our implementation with the early-stage version of T-Pile written in C. The Java version presented in this Application note is practically as fast as the one written in C language. The Java implementation features high portability and stability of the package. The package contains three programs:

ACorr Computes various autocorrelation and cross-correlation functions, necessary for understanding a system's behavior. ACorr provides also the error analysis utilized by MultiHist.
MultiHist Computes the density of states from the energy observations.
StatPhys Calculates various properties of a molecule as canonical averages. It can read the density of states from a user-specified file or use the output from MultiHist. The other possibility is to use a user-defined partition function as an input.


    3 EXAMPLE APPLICATIONS
 TOP
 ABSTRACT
 1 INTRODUCTION
 2 T-PILE PACKAGE
 3 EXAMPLE APPLICATIONS
 ACKNOWLEDGEMENTS
 REFERENCES
 
3.1 Modeling a canonical system
The GB1P peptide (C-terminal beta-hairpin from immunoglobulin binding domain of protein G, PDB code 2gb1 [PDB] ) has been extensively studied experimentally (Munoz et al., 1997) and theoretically (Kolinski et al., 1999). Here we used CABS lattice reduced model for studying the folding transition of GB1P. The details of CABS representation and its force field are given elsewhere (Kolinski, 2004). Conformational space was sampled by means of Parallel Tempering, also known as Replica Exchange Monte Carlo. In preliminary runs, we estimated the folding reduced temperature and optimized the set of temperatures for parallel tempering. To estimate statistical errors we performed 12 independent production runs; each for seven replicas, with 1 300 000 observations per replica. T-Pile can process one billion data entries in several minutes on a standard PC.

Figure 1 shows thermodynamic description of the GB1P peptide in the CABS representation: energy as a function of temperature (top-left), heat capacity (top-right). and a 2D P (E|T) conditional distribution (bottom). Highly cooperative, two-state folding is clearly visible.


Figure 1
View larger version (45K):
[in this window]
[in a new window]
[Download PowerPoint slide]
 
Fig. 1. Mean energy and heat capacity for GB1P peptide (top-left and top-right, respectively) and a 2D histogram showing conditional probability of energy P(E|T) plotted as a function of reduced temperature. Colour version of this figure is available as Supplementary material online.

 
3.2 Statistical thermodynamics of suboptimal sequence alignments
In a pioneering work, Miyazawa (Miyazawa, 1995) proposed a modification of Needleman–Wunsch algorithm for global sequence alignment in which a single optimal solution is substituted by an ensemble of suboptimal alignments. A free parameter that is introduced into the alignment algorithm plays a role of temperature. Suboptimal alignments obey Boltzmann distribution with the negative score corresponding to ‘energy’. Probability of an alignment with score S is defined as follows:


Formula 1

(1)
where {Omega}(S) denotes the total number of alignments of score S and Formula is the partition function. It is also possible to estimate a residue–residue alignment probability. For this purpose the partition function Formula for all possible alignments between two sequences, partition function Formula for all possible partial alignments that end at aligned residues i and j, and Formula —a partition function for all partial alignments that starts with aligned residues i and j need to be computed. Then the probability PT (i, j) that residues i and j are aligned at temperature T is given by a formula:


Formula 2

(2)

This algorithm has been implemented in the PRAline tool from BioShell package (Gront and Kolinski, 2006). At T = 0, the result of Miyazawa algorithm corresponds to the optimal alignment from Needleman–Wunsch algorithm. With growing temperature, alignments undergo thermal fluctuation and become fuzzy (see Fig. 2). Higher stability of an alignment reflects higher local evolutionary conservation. It is a non-trivial task to find an optimal temperature for the evaluation of the alignment stability. Here, we propose to use the transition temperature as a reference point. The transition temperature corresponds to the peak of heat capacity. To compute Cv for alignments we use StatPhys program from T-Pile package. Figure 2 (top) presents the ‘heat capacity’ for 1b56 protein aligned (dashed line) and with a randomly shuffled sequence of itself. Probability maps (bottom panels) for these alignments (with 1bj7, 1p6pA and random) provide a quantitative measure of the alignments’ stability that could be extremely useful in comparative modeling.


Figure 2
View larger version (25K):
[in this window]
[in a new window]
[Download PowerPoint slide]
 
Fig. 2. ‘Heat capacity’ for sequence alignments (upper panel) and probability maps (bottom row) showing how likely residue i from one sequence is aligned with residue j from the other sequence. The three figures are given in the order of decreasing homology from the left to the right: 1b57 – 1p6pA, 1b57 – 1bj7 and 1b57 – 1b57 randomly shuffled. Each probability map was calculated at transition temperature proper for a given pair of sequences: T = 3.18, T = 3.18 and T= 2.08, respectively. Colour version of this figure is available as Supplementary material online.

 


    ACKNOWLEDGEMENTS
 TOP
 ABSTRACT
 1 INTRODUCTION
 2 T-PILE PACKAGE
 3 EXAMPLE APPLICATIONS
 ACKNOWLEDGEMENTS
 REFERENCES
 
The computational part of this work was done using the computer cluster at the Computing Center of the Department of Chemistry, University of Warsaw.

Conflict of Interest: none declared.


    FOOTNOTES
 
Associate Editor: Anna Tramontano

Received on April 13, 2007; revised on April 13, 2007; accepted on May 18, 2007

    REFERENCES
 TOP
 ABSTRACT
 1 INTRODUCTION
 2 T-PILE PACKAGE
 3 EXAMPLE APPLICATIONS
 ACKNOWLEDGEMENTS
 REFERENCES
 

    Ferrenberg AM, Swendsen RH. Optimized Monte Carlo data analysis. Phys. Rev. Lett. (1989) 63:1195.[CrossRef][Web of Science][Medline]

    Gront D, Kolinski A. BioShell – a package of tools for structural biology computations. Bioinformatics (2006) 22:621–622.[Abstract/Free Full Text]

    Gront D, et al. A new combination of replica exchange Monte Carlo and histogram analysis for protein folding and thermodynamics. J. Chem. Phys. (2001) 115:1569–1574.[CrossRef]

    Kolinski A. Protein modeling and structure prediction with a reduced representation. Acta Biochim. Pol. (2004) 51:349–371.[Web of Science][Medline]

    Kolinski A, et al. Dynamics and thermodynamics of beta-hairpin assembly: insights from various simulation techniques. Biophys. J. (1999) 77:2942–2952.[Web of Science][Medline]

    Kumar S, et al. The weighted histogram analysis method for free-energy calculations on biomolecules. i. the method. J. Compu. Chem. (1992) 13:1011–1021.[CrossRef]

    Miyazawa S. A reliable sequence alignment method based on probabilities of residue correspondences. Protein Eng (1995) 8:999–1009.[Abstract/Free Full Text]

    Munoz V, et al. Folding dynamics and mechanism of [beta]-hairpin formation. Nature (1997) 390:196–199.[CrossRef][Medline]


Add to CiteULike CiteULike   Add to Connotea Connotea   Add to Del.icio.us Del.icio.us    What's this?


This article has been cited by other articles:


Home page
BioinformaticsHome page
D. Gront and A. Kolinski
Utility library for structural bioinformatics
Bioinformatics, February 15, 2008; 24(4): 584 - 585.
[Abstract] [Full Text] [PDF]


This Article
Right arrow Abstract Freely available
Right arrow FREE Full Text (Print PDF) Freely available
Right arrow All Versions of this Article:
23/14/1840    most recent
btm259v1
Right arrow Comments: Submit a response
Right arrow Alert me when this article is cited
Right arrow Alert me when Comments are posted
Right arrow Alert me if a correction is posted
Services
Right arrow Email this article to a friend
Right arrow Similar articles in this journal
Right arrow Similar articles in ISI Web of Science
Right arrow Similar articles in PubMed
Right arrow Alert me to new issues of the journal
Right arrow Add to My Personal Archive
Right arrow Download to citation manager
Right arrow Search for citing articles in:
ISI Web of Science (2)
Right arrowRequest Permissions
Google Scholar
Right arrow Articles by Gront, D.
Right arrow Articles by Kolinski, A.
Right arrow Search for Related Content
PubMed
Right arrow PubMed Citation
Right arrow Articles by Gront, D.
Right arrow Articles by Kolinski, A.
Social Bookmarking
 Add to CiteULike   Add to Connotea   Add to Del.icio.us  
What's this?