Skip Navigation



Bioinformatics Advance Access published online on June 22, 2007

Bioinformatics, doi:10.1093/bioinformatics/btm317
This Article
Right arrow Advance Access manuscript (PDF) Freely available
Right arrow All Versions of this Article:
23/16/2113    most recent
btm317v1
Right arrow Comments: Submit a response
Right arrow Alert me when this article is cited
Right arrow Alert me when Comments are posted
Right arrow Alert me if a correction is posted
Services
Right arrow Email this article to a friend
Right arrow Similar articles in this journal
Right arrow Similar articles in PubMed
Right arrow Alert me to new issues of the journal
Right arrow Add to My Personal Archive
Right arrow Download to citation manager
Right arrowRequest Permissions
Google Scholar
Right arrow Articles by McKinney, B. A.
Right arrow Articles by Moore, J. H.
Right arrow Search for Related Content
PubMed
Right arrow PubMed Citation
Right arrow Articles by McKinney, B. A.
Right arrow Articles by Moore, J. H.
Social Bookmarking
 Add to CiteULike   Add to Connotea   Add to Del.icio.us  
What's this?

© The Author (2007). Published by Oxford University Press. All rights reserved. For Permissions, please email: journals.permissions@oxfordjournals.org

Evaporative cooling feature selection for genotypic data involving interactions

B. A. McKinney a,*, D. M. Reif b, B. C. White c, J. E. Crowe, Jr. d and J. H. Moore c

aDepartment of Genetics, University of Alabama School of Medicine, Birmingham, AL 35294
bNational Center for Computational Toxicology, U.S. Environmental Protection Agency, Research Triangle Park, NC 27711
cComputational Genetics Laboratory, Department of Genetics, Dartmouth Medical School, Lebanon, NH 03756
dProgram in Vaccine Sciences, Departments of Microbiology and Immunology, and Pediatrics, Vanderbilt University Medical Center, Nashville, TN 37232

*to whom correspondence should be addressed. Dr. Brett McKinney, E-mail: bmckinney{at}genetics.uab.edu


   Abstract

Motivation: The development of genome-wide capabilities for genotyping has led to the practical problem of identifying the minimum subset of genetic variants relevant to the classification of a phenotype. This challenge is especially difficult in the presence of attribute interactions, noise, and small sample size.

Methods: Analogous to the physical mechanism of evaporation, we introduce an evaporative cooling (EC) feature selection algorithm that seeks to obtain a subset of attributes with the optimum information temperature (i.e., the least noise). EC uses an attribute quality measure analogous to thermodynamic free energy that combines Relief-F and mutual information to evaporate (i.e., remove) noise features, leaving behind a subset of attributes that contain DNA sequence variations associated with a given phenotype.

Methods: EC is able to identify functional sequence variations that involve interactions (epistasis) between other sequence variations that influence their association with the phenotype. This ability is demonstrated on simulated genotypic data with attribute interactions and on real genotypic data from individuals who experienced adverse events following smallpox vaccination. The EC formalism allows us to combine information entropy, energy and temperature into a single information free energy attribute quality measure that balances interaction and main effects.

Availability: Open source software, written in Java, is freely available upon request.

Associate Editor: Prof. Keith Crandall


Received on May 2, 2007; revised on May 2, 2007; accepted on June 6, 2007

Add to CiteULike CiteULike   Add to Connotea Connotea   Add to Del.icio.us Del.icio.us    What's this?


This article has been cited by other articles:


Home page
BioinformaticsHome page
D. J. Miller, Y. Zhang, G. Yu, Y. Liu, L. Chen, C. D. Langefeld, D. Herrington, and Y. Wang
An algorithm for learning maximum entropy probability models of disease risk that efficiently searches and sparingly encodes multilocus genomic interactions
Bioinformatics, October 1, 2009; 25(19): 2478 - 2485.
[Abstract] [Full Text] [PDF]



Disclaimer: Please note that abstracts for content published before 1996 were created through digital scanning and may therefore not exactly replicate the text of the original print issues. All efforts have been made to ensure accuracy, but the Publisher will not be held responsible for any remaining inaccuracies. If you require any further clarification, please contact our Customer Services Department.