Skip Navigation

This Article
Right arrow FREE Full Text (Print PDF) Freely available
Right arrow FREE Full Text (Screen PDF)
Right arrow Alert me when this article is cited
Right arrow Alert me if a correction is posted
Services
Right arrow Email this article to a friend
Right arrow Similar articles in this journal
Right arrow Similar articles in PubMed
Right arrow Alert me to new issues of the journal
Right arrow Add to My Personal Archive
Right arrow Download to citation manager
Right arrowRequest Permissions
Google Scholar
Right arrow Articles by Arvestad, L.
Right arrow Articles by Sennblad, B.
Right arrow Search for Related Content
PubMed
Right arrow PubMed Citation
Right arrow Articles by Arvestad, L.
Right arrow Articles by Sennblad, B.
Social Bookmarking
 Add to CiteULike   Add to Connotea   Add to Del.icio.us  
What's this?

Bioinformatics Vol. 19 Suppl. 1 2003
Pages i7-i15
© 2003 Oxford University Press

Bayesian gene/species tree reconciliation and orthology analysis using MCMC

Lars Arvestad 2,*, Ann-Charlotte Berglund 1, Jens Lagergren 1 and Bengt Sennblad 2

1 SBC and Department of Numerical Analysis and Computing Science, KTH, SE-100 44, Stockholm
2 SBC and Center for Genomics and Bioinformatics, Karolinska Institutet, SE-171 77, Stockholm

Received on January 6, 2003 ; accepted on February 20, 2003

Motivation: Comparative genomics in general and orthology analysis in particular are becoming increasingly important parts of gene function prediction. Previously, orthology analysis and reconciliation has been performed only with respect to the parsimony model. This discards many plausible solutions and sometimes precludes finding the correct one. In many other areas in bioinformatics probabilistic models have proven to be both more realistic and powerful than parsimony models. For instance, they allow for assessing solution reliability and consideration of alternative solutions in a uniform way. There is also an added benefit in making model assumptions explicit and therefore making model comparisons possible. For orthology analysis, uncertainty has recently been addressed using parsimonious reconciliation combined with bootstrap techniques. However, until now no probabilistic methods have been available.

Results: We introduce a probabilistic gene evolution model based on a birth-death process in which a gene tree evolves ‘inside’ a species tree. Based on this model, we develop a tool with the capacity to perform practical orthology analysis, based on Fitch’s original definition, and more generally for reconciling pairs of gene and species trees. Our gene evolution model is biologically sound (Nei et al., 1997) and intuitively attractive. We develop a Bayesian analysis based on MCMC which facilitates approximation of an a posteriori distribution for reconciliations. That is, we can find the most probable reconciliations and estimate the probability of any reconciliation, given the observed gene tree. This also gives a way to estimate the probability that a pair of genes are orthologs. The main algorithmic contribution presented here consists of an algorithm for computing the likelihood of a given reconciliation. To the best of our knowledge, this is the first successful introduction of this type of probabilistic methods, which flourish in phylogeny analysis, into reconciliation and orthology analysis. The MCMC algorithm has been implemented and, although not yet being in its final form, tests show that it performs very well on synthetic as well as biological data. Using standard correspondences, our results carry over to allele trees as well as biogeography.

Contact: lottab,jensl{at}nada.kth.se, bengt.sennblad,lars.arvestad{at}sbc.su.se

* To whom correspondence should be addressed.


Add to CiteULike CiteULike   Add to Connotea Connotea   Add to Del.icio.us Del.icio.us    What's this?


This article has been cited by other articles:


Home page
Nucleic Acids ResHome page
H. Li, A. Coghlan, J. Ruan, L. J. Coin, J.-K. Heriche, L. Osmotherly, R. Li, T. Liu, Z. Zhang, L. Bolund, et al.
TreeFam: a curated database of phylogenetic trees of animal gene families
Nucleic Acids Res., January 1, 2006; 34(suppl_1): D572 - D580.
[Abstract] [Full Text] [PDF]


Home page
RNAHome page
A. MUSHEGIAN
Protein content of minimal and ancestral ribosome
RNA, September 1, 2005; 11(9): 1400 - 1406.
[Abstract] [Full Text] [PDF]


Home page
ScienceHome page
A. C. Driskell, C. Ane, J. G. Burleigh, M. M. McMahon, B. C. O'Meara, and M. J. Sanderson
Prospects for Building the Tree of Life from Large Sequence Databases
Science, November 12, 2004; 306(5699): 1172 - 1174.
[Abstract] [Full Text] [PDF]


Home page
Mol Biol EvolHome page
R. Bourgon, M. Delorenzi, T. Sargeant, A. N. Hodder, B. S. Crabb, and T. P. Speed
The Serine Repeat Antigen (SERA) Gene Family Phylogeny in Plasmodium: The Impact of GC Content and Reconciliation of Gene and Species Trees
Mol. Biol. Evol., November 1, 2004; 21(11): 2161 - 2171.
[Abstract] [Full Text] [PDF]


Home page
Am. J. Bot.Home page
C. R. Linder and L. H. Rieseberg
Reconstructing patterns of reticulate evolution in plants
Am. J. Botany, September 1, 2004; 91(10): 1700 - 1708.
[Abstract] [Full Text] [PDF]



Disclaimer:
Please note that abstracts for content published before 1996 were created through digital scanning and may therefore not exactly replicate the text of the original print issues. All efforts have been made to ensure accuracy, but the Publisher will not be held responsible for any remaining inaccuracies. If you require any further clarification, please contact our Customer Services Department.