Bioinformatics Vol. 18 no. 90002 2002
Pages S241-S248
© 2002 Oxford University Press
Application of metabolomics to plant genotype discrimination using statistics and machine learning
1 Department of Computer Science,
University of Wales, Aberystwyth, Penglais, Aberystwyth, SY23 3DB, UK
2 Max Planck Institute of Molecular Plant Physiology,
14424 Potsdam, Germany
Received on April 8, 2002
; accepted on June 15, 2002
Motivation: Metabolomics is a post genomic technology which seeks to provide a comprehensive profile of all the metabolites present in a biological sample. This complements the mRNA profiles provided by microarrays, and the protein profiles provided by proteomics. To test the power of metabolome analysis we selected the problem of discrimating between related genotypes of Arabidopsis. Specifically, the problem tackled was to discrimate between two background genotypes (Col0 and C24) and, more significantly, the offspring produced by the crossbreeding of these two lines, the progeny (whose genotypes would differ only in their maternally inherited mitichondia and chloroplasts).
Overview: A gas chromotographymass spectrometry (GCMS) profiling protocol was used to identify 433 metabolites in the samples. The metabolomic profiles were compared using descriptive statistics which indicated that key primary metabolites vary more than other metabolites. We then applied neural networks to discriminate between the genotypes. This showed clearly that the two background lines can be discrimated between each other and their progeny, and indicated that the two progeny lines can also be discriminated. We applied Euclidean hierarchical and Principal Component Analysis (PCA) to help understand the basis of genotype discrimination. PCA indicated that malic acid and citrate are the two most important metabolites for discriminating between the background lines, and glucose and fructose are two most important metabolites for discriminating between the crosses. These results are consistant with genotype differences in mitochondia and chloroplasts.
Keywords: Metabolome, Arabidopsis, Clustering
Contact: jat{at}aber.ac.uk
![]()
CiteULike
Connotea
Del.icio.us What's this?
This article has been cited by other articles:
![]() |
C. Vasquez-Robinet, S. P. Mane, A. V. Ulanov, J. I. Watkinson, V. K. Stromberg, D. De Koeyer, R. Schafleitner, D. B. Willmot, M. Bonierbale, H. J. Bohnert, et al. Physiological and molecular adaptations to drought in Andean potato genotypes J. Exp. Bot., May 1, 2008; 59(8): 2109 - 2123. [Abstract] [Full Text] [PDF] |
||||
![]() |
A. Ulanov and J. M. Widholm Effect of the expression of cyanamide hydratase on metabolites in cyanamide-treated soybean plants kept in the light or dark J. Exp. Bot., December 1, 2007; 58(15-16): 4319 - 4332. [Abstract] [Full Text] [PDF] |
||||
![]() |
K. Missal, M. A. Cross, and D. Drasdo Gene network inference from incomplete expression data: transcriptional control of hematopoietic commitment Bioinformatics, March 15, 2006; 22(6): 731 - 738. [Abstract] [Full Text] [PDF] |
||||
![]() |
G. S. Catchpole, M. Beckmann, D. P. Enot, M. Mondhe, B. Zywicki, J. Taylor, N. Hardy, A. Smith, R. D. King, D. B. Kell, et al. Hierarchical metabolomics demonstrates substantial compositional similarity between genetically modified and conventional potato crops PNAS, October 4, 2005; 102(40): 14458 - 14462. [Abstract] [Full Text] [PDF] |
||||
![]() |
A. S. Rodin and E. Boerwinkle Mining genetic epidemiology data with Bayesian networks I: Bayesian networks and example application (plasma apoE levels) Bioinformatics, August 1, 2005; 21(15): 3273 - 3278. [Abstract] [Full Text] [PDF] |
||||
![]() |
K. Sachs, O. Perez, D. Pe'er, D. A. Lauffenburger, and G. P. Nolan Causal Protein-Signaling Networks Derived from Multiparameter Single-Cell Data Science, April 22, 2005; 308(5721): 523 - 529. [Abstract] [Full Text] [PDF] |
||||
![]() |
C. D. Broeckling, D. V. Huhman, M. A. Farag, J. T. Smith, G. D. May, P. Mendes, R. A. Dixon, and L. W. Sumner Metabolic profiling of Medicago truncatula cell cultures reveals the effects of biotic and abiotic elicitors on metabolism J. Exp. Bot., January 1, 2005; 56(410): 323 - 336. [Abstract] [Full Text] [PDF] |
||||
![]() |
N. Friedman Inferring Cellular Networks Using Probabilistic Graphical Models Science, February 6, 2004; 303(5659): 799 - 805. [Abstract] [Full Text] [PDF] |
||||
![]() |
J. B. German, M.-A. Roberts, and S. M. Watkins Personal Metabolomics as a Next Generation Nutritional Assessment J. Nutr., December 1, 2003; 133(12): 4260 - 4266. [Abstract] [Full Text] [PDF] |
||||




