Bioinformatics Advance Access originally published online on June 2, 2005
Bioinformatics 2005 21(16):3385-3393; doi:10.1093/bioinformatics/bti526
| ||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||
Biomarker discovery in microarray gene expression data with Gaussian processes
1Gatsby Computational Neuroscience Unit, University College London UK
2School of Biosciences, University of Birmingham UK
3Keck Graduate Institute of Applied Life Sciences Claremont, CA 91711, USA
*To whom correspondence should be addressed.
Motivation: In clinical practice, pathological phenotypes are often labelled with ordinal scales rather than binary, e.g. the Gleason grading system for tumour cell differentiation. However, in the literature of microarray analysis, these ordinal labels have been rarely treated in a principled way. This paper describes a gene selection algorithm based on Gaussian processes to discover consistent gene expression patterns associated with ordinal clinical phenotypes. The technique of automatic relevance determination is applied to represent the significance level of the genes in a Bayesian inference framework.
Results: The usefulness of the proposed algorithm for ordinal labels is demonstrated by the gene expression signature associated with the Gleason score for prostate cancer data. Our results demonstrate how multi-gene markers that may be initially developed with a diagnostic or prognostic application in mind are also useful as an investigative tool to reveal associations between specific molecular and cellular events and features of tumour physiology. Our algorithm can also be applied to microarray data with binary labels with results comparable to other methods in the literature.
Availability:The source code was written in ANSI C, which is accessible at www.gatsby.ucl.ac.uk/~chuwei/code/gpgenes.tar
Contact: wild{at}kgi.edu
Received on April 21, 2005; revised on May 19, 2005; accepted on May 31, 2005
This article has been cited by other articles:
![]() |
P. Sykacek, R. Clarkson, C. Print, R. Furlong, and G. Micklem Bayesian modelling of shared gene function Bioinformatics, August 1, 2007; 23(15): 1936 - 1944. [Abstract] [Full Text] [PDF] |
||||
![]() |
G. C. Cawley and N. L. C. Talbot Gene selection in cancer classification using sparse logistic regression with Bayesian regularization Bioinformatics, October 1, 2006; 22(19): 2348 - 2355. [Abstract] [Full Text] [PDF] |
||||
![]() |
V. Gowri-Shankar and M. Rattray On the Correlation Between Composition and Site-Specific Evolutionary Rate: Implications for Phylogenetic Inference Mol. Biol. Evol., February 1, 2006; 23(2): 352 - 364. [Abstract] [Full Text] [PDF] |
||||

