Skip Navigation



Bioinformatics Advance Access published online on April 1, 2004

Bioinformatics, doi:10.1093/bioinformatics/bth211
Bioinformatics © Oxford University Press 2004; all rights reserved
This Article
Right arrow Advance Access manuscript (PDF) Freely available
Right arrow All Versions of this Article:
20/13/2113    most recent
bth211v1
Right arrow Comments: Submit a response
Right arrow Alert me when this article is cited
Right arrow Alert me when Comments are posted
Right arrow Alert me if a correction is posted
Services
Right arrow Email this article to a friend
Right arrow Similar articles in this journal
Right arrow Similar articles in PubMed
Right arrow Alert me to new issues of the journal
Right arrow Add to My Personal Archive
Right arrow Download to citation manager
Right arrowRequest Permissions
Google Scholar
Right arrow Articles by Makarenkov, V.
Right arrow Articles by Lapointe, F.-J.
Right arrow Search for Related Content
PubMed
Right arrow PubMed Citation
Right arrow Articles by Makarenkov, V.
Right arrow Articles by Lapointe, F.-J.
Social Bookmarking
 Add to CiteULike   Add to Connotea   Add to Del.icio.us  
What's this?

Received April 4, 2003
Revised February 20, 2004
Accepted February 20, 2004

Article

A weighted least-squares approach for inferring phylogenies from incomplete distance matrices

Vladimir Makarenkov 1* François-Joseph Lapointe 2

1 Département d'Informatique, Université du Québec à Montréal, C.P. 8888, Succ. Centre-Ville, Montréal (Québec), Canada, H3C 3P8; Institute of Control Sciences, 65 Profsoyuznaya, Moscow 117806, Russia
2 Département de Sciences Biologiques, Université de Montréal, C.P. 6128, succ. Centre-ville, Montréal, Québec H3C 3J7, Canada

* To whom correspondence should be addressed. E-mail: makarenkov.vladimir{at}uqam.ca.


   Abstract

Motivation: The problem of phylogenetic inference from data sets including incomplete or uncertain entries is among the most relevant issues in systematic biology. In this paper, we propose a new method for reconstructing phylogenetic trees from partial distance matrices. The new method combines the usage of the four-point condition and the ultrametric inequality with a weighted least-squares approximation to solve the problem of missing entries. It can be applied to infer phylogenies from evolutionary data including some missing or uncertain information, for instance, when observed nucleotide or protein sequences contain gaps or missing entries.

Results: In a number of simulations involving incomplete data sets, the proposed method outperformed the well-known Ultrametric and Additive procedures. Generally, the new method also outperformed all the other competing approaches including Triangle and Fitch which is the most popular least-squares method for reconstructing phylogenies. We illustrate the usefulness of the introduced method by analyzing two well-known phylogenies derived from complete mammalian mtDNA sequences (see Cao et al., 1998, Reyes et al., 2000, and Li et al., 2001). Some interesting theoretical results concerning the NP-hardness of the ordinary and weighted least-squares fitting of a phylogenetic tree to a partial distance matrix are also established.

Availability: The T-Rex package (Makarenkov, 2001a) including this method is freely available for download at: http://www.fas.umontreal.ca/biol/casgrain/en/labo/t-rex.


Add to CiteULike CiteULike   Add to Connotea Connotea   Add to Del.icio.us Del.icio.us    What's this?


This article has been cited by other articles:


Home page
BioinformaticsHome page
F. Cheng, S. Hartmann, M. Gupta, J. G. Ibrahim, and T. J. Vision
A hierarchical model for incomplete alignments in phylogenetic inference
Bioinformatics, March 1, 2009; 25(5): 592 - 598.
[Abstract] [Full Text] [PDF]


Home page
Syst BiolHome page
O. Gauthier and F.-J. Lapointe
Seeing the Trees for the Network: Consensus, Information Content, and Superphylogenies
Syst Biol, April 1, 2007; 56(2): 345 - 355.
[Full Text] [PDF]


Home page
Syst BiolHome page
B. Holland, G. Conner, K. Huber, and V. Moulton
Imputing Supertrees and Supernetworks from Quartets
Syst Biol, February 1, 2007; 56(1): 57 - 67.
[Abstract] [Full Text] [PDF]


Home page
Syst BiolHome page
A. Criscuolo, V. Berry, E. J. P. Douzery, and O. Gascuel
SDM: A Fast Distance-Based Approach for (Super)Tree Building in Phylogenomics
Syst Biol, October 1, 2006; 55(5): 740 - 755.
[Abstract] [Full Text] [PDF]



Disclaimer: Please note that abstracts for content published before 1996 were created through digital scanning and may therefore not exactly replicate the text of the original print issues. All efforts have been made to ensure accuracy, but the Publisher will not be held responsible for any remaining inaccuracies. If you require any further clarification, please contact our Customer Services Department.