Bioinformatics Advance Access originally published online on January 19, 2007
Bioinformatics 2007 23(9):1132-1140; doi:10.1093/bioinformatics/btm001
| ||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||
A computational system to select candidate genes for complex human traits
1Curriculum in Genetics and Molecular Biologly, 2Bioinformatics and Computational Biology Training Program, Departments of 3Genetics and 4Biology, University of North Carolina at Chapel Hill, Chapel Hill, NC 27516
*To whom correspondence should be addressed.
| Abstract |
|---|
Motivation: Identification of the genetic variation underlying complex traits is challenging. The wealth of information publicly available about the biology of complex traits and the function of individual genes permits the development of informatics-assisted methods for the selection of candidate genes for these traits.
Results: We have developed a computational system named CAESAR that ranks all annotated human genes as candidates for a complex trait by using ontologies to semantically map natural language descriptions of the trait with a variety of gene-centric information sources. In a test of its effectiveness, CAESAR successfully selected 7 out of 18 (39%) complex human trait susceptibility genes within the top 2% of ranked candidates genome-wide, a subset that represents roughly 1% of genes in the human genome and provides sufficient enrichment for an association study of several hundred human genes. This approach can be applied to any well-documented mono- or multi-factorial trait in any organism for which an annotated gene set exists.
Availability: CAESAR scripts and test data can be downloaded from http://visionlab.bio.unc.edu/caesar/
Contact: kgaulton{at}email.unc.edu
Associate Editor: John Quackenbush
Received on October 30, 2006; revised on January 2, 2007; accepted on January 8, 2007
This article has been cited by other articles:
![]() |
S. F. Saccone, N. L. Saccone, G. E. Swan, P. A. F. Madden, A. M. Goate, J. P. Rice, and L. J. Bierut Systematic biological prioritization after a genome-wide association study: an application to nicotine dependence Bioinformatics, August 15, 2008; 24(16): 1805 - 1811. [Abstract] [Full Text] [PDF] |
||||
![]() |
S. Yu, S. Van Vooren, L.-C. Tranchevent, B. De Moor, and Y. Moreau Comparison of vocabularies, representations and ranking algorithms for gene prioritization by text mining Bioinformatics, August 15, 2008; 24(16): i119 - i125. [Abstract] [Full Text] [PDF] |
||||
![]() |
L.-C. Tranchevent, R. Barriot, S. Yu, S. Van Vooren, P. Van Loo, B. Coessens, B. De Moor, S. Aerts, and Y. Moreau ENDEAVOUR update: a web resource for gene prioritization in multiple species Nucleic Acids Res., July 1, 2008; 36(suppl_2): W377 - W384. [Abstract] [Full Text] [PDF] |
||||

