Skip Navigation



Bioinformatics Advance Access published online on November 15, 2007

Bioinformatics, doi:10.1093/bioinformatics/btm522
This Article
Right arrow Advance Access manuscript (PDF) Freely available
Right arrow All Versions of this Article:
23/24/3280    most recent
btm522v1
Right arrow Comments: Submit a response
Right arrow Alert me when this article is cited
Right arrow Alert me when Comments are posted
Right arrow Alert me if a correction is posted
Services
Right arrow Email this article to a friend
Right arrow Similar articles in this journal
Right arrow Similar articles in PubMed
Right arrow Alert me to new issues of the journal
Right arrow Add to My Personal Archive
Right arrow Download to citation manager
Right arrowRequest Permissions
Google Scholar
Right arrow Articles by Nunkesser, R.
Right arrow Articles by Wegener, I.
Right arrow Search for Related Content
PubMed
Right arrow PubMed Citation
Right arrow Articles by Nunkesser, R.
Right arrow Articles by Wegener, I.
Social Bookmarking
 Add to CiteULike   Add to Connotea   Add to Del.icio.us  
What's this?

© The Author (2007). Published by Oxford University Press. All rights reserved. For Permissions, please email: journals.permissions@oxfordjournals.org

Detecting High-Order Interactions of Single Nucleotide Polymorphisms Using Genetic Programming

Robin Nunkesser 1,2,*, Thorsten Bernholt 1,2, Holger Schwender 1,3, Katja Ickstadt 1,3 and Ingo Wegener 1,2

1Collaborative Research Center 475, University of Dortmund, Dortmund, Germany, 2 Department of Computer Science, University of Dortmund, Dortmund, Germany, 3 Department of Statistics, University of Dortmund, Dortmund, Germany

*To whom correspondence should be addressed. Robin Nunkesser, E-mail: robin.nunkesser{at}uni-dortmund.de


   Abstract

Motivation: Not individual single nucleotide polymorphisms (SNPs), but high-order interactions of SNPs are assumed to be responsible for complex diseases such as cancer. Therefore, one of the major goals of genetic association studies concerned with such genotype data is the identification of these high-order interactions. This search is additionally impeded by the fact that these interactions often are only explanatory for a relatively small subgroup of patients. Most of the feature selection methods proposed in the literature, unfortunately, fail at this task, since they can either only identify individual variables or interactions of a low order, or try to find rules that are explanatory for a high percentage of the observations. In this paper, we present a procedure based on genetic programming and multivalued logic that enables the identification of high-order interactions of categorical variables such as SNPs. This method called GPAS cannot only be used for feature selection, but can also be employed for discrimination.

Results: In an application to the genotype data from the GENICA study, an association study concerned with sporadic breast cancer, GPAS is able to identify high-order interactions of SNPs leading to a considerably increased breast cancer risk for different subsets of patients that are not found by other feature selection methods. As an application to a subset of the HapMap data shows, GPAS is not restricted to association studies comprising several ten SNPs, but can also be employed to analyze whole-genome data.

Availability: Software can be downloaded from http://ls2-www.cs.unidortmund.de/~nunkesser/#Software.

Contact: robin.nunkesser{at}uni-dortmund.de

Associate Editor: Prof. Martin Bishop


Received on July 12, 2007; revised on October 11, 2007; accepted on October 14, 2007

Add to CiteULike CiteULike   Add to Connotea Connotea   Add to Del.icio.us Del.icio.us    What's this?


This article has been cited by other articles:


Home page
Brief BioinformHome page
G. B. Fogel
Computational intelligence approaches for pattern discovery in biological systems
Brief Bioinform, July 1, 2008; 9(4): 307 - 316.
[Abstract] [Full Text] [PDF]



Disclaimer: Please note that abstracts for content published before 1996 were created through digital scanning and may therefore not exactly replicate the text of the original print issues. All efforts have been made to ensure accuracy, but the Publisher will not be held responsible for any remaining inaccuracies. If you require any further clarification, please contact our Customer Services Department.