Bioinformatics Advance Access originally published online on February 22, 2005
Bioinformatics 2005 21(10):2447-2455; doi:10.1093/bioinformatics/bti342
| ||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||
Mapping genomegenome epistasis: a high-dimensional model
Department of Statistics, University of Florida Gainesville, FL 32611, USA
*To whom correspondence should be addressed.
Motivation: The proper development of any organ or tissue requires the coordinated expression of its underlying genes that can be located on different genomes present in an organism. For instance, each step in the development of seed for a higher plant is the consequence of gene interactions from the maternal, embryo and endosperm genomes.
Results: We present a multivariate statistical model for mapping quantitative trait loci (QTL) by incorporating two important aspects of seed development in plantsQTL interactions derived from different genomes, the maternal, embryo and endosperm, and genetic correlations among phenotypic traits expressed in different genome-specific tissues. This model, which has a high dimensionality, is constructed within the maximum-likelihood context based on a finite mixture model. The implementation of the expectationmaximization algorithm allows for the efficient estimation of QTL positions, their action and interaction effects and pleiotropic effects. The application of this high-dimensional model to a real rice dataset has validated its usefulness.
Conclusions: Our model was derived for self-pollinated plants, but it can be extended to cross-pollinated plants and to animals. With the burgeoning of genetic and genomic data, this high-dimensional model will have many implications for agricultural and evolutionary genetic research.
Availability: A package of software will be provided from the corresponding author upon request.
Contact: rwu{at}stat.ufl.edu