Skip Navigation


Bioinformatics Advance Access originally published online on January 28, 2009
Bioinformatics 2009 25(6):714-721; doi:10.1093/bioinformatics/btp041
This Article
Right arrow Full Text
Right arrow Full Text (Print PDF)
Right arrow Supplementary Data
Right arrow All Versions of this Article:
25/6/714    most recent
btp041v1
Right arrow Comments: Submit a response
Right arrow Alert me when this article is cited
Right arrow Alert me when Comments are posted
Right arrow Alert me if a correction is posted
Services
Right arrow Email this article to a friend
Right arrow Similar articles in this journal
Right arrow Similar articles in PubMed
Right arrow Alert me to new issues of the journal
Right arrow Add to My Personal Archive
Right arrow Download to citation manager
Right arrowRequest Permissions
Google Scholar
Right arrow Articles by Wu, T. T.
Right arrow Articles by Lange, K.
Right arrow Search for Related Content
PubMed
Right arrow PubMed Citation
Right arrow Articles by Wu, T. T.
Right arrow Articles by Lange, K.
Social Bookmarking
 Add to CiteULike   Add to Connotea   Add to Del.icio.us  
What's this?

© The Author 2009. Published by Oxford University Press. All rights reserved. For Permissions, please email: journals.permissions@oxfordjournals.org

Genome-wide association analysis by lasso penalized logistic regression

Tong Tong Wu 1, Yi Fang Chen 2, Trevor Hastie 2,3, Eric Sobel 4 and Kenneth Lange 4,5,*

1Department of Epidemiology and Biostatistics, University of Maryland, College Park, MD 20742, 2Department of Statistics, 3Department of Biostatistics, Stanford University, Stanford, CA 94305, 4Department of Human Genetics and 5Department of Biomathematics, University of California, Los Angeles, CA 90095

*To whom correspondence should be addressed.


   Abstract

Motivation: In ordinary regression, imposition of a lasso penalty makes continuous model selection straightforward. Lasso penalized regression is particularly advantageous when the number of predictors far exceeds the number of observations.

Method: The present article evaluates the performance of lasso penalized logistic regression in case–control disease gene mapping with a large number of SNPs (single nucleotide polymorphisms) predictors. The strength of the lasso penalty can be tuned to select a predetermined number of the most relevant SNPs and other predictors. For a given value of the tuning constant, the penalized likelihood is quickly maximized by cyclic coordinate ascent. Once the most potent marginal predictors are identified, their two-way and higher order interactions can also be examined by lasso penalized logistic regression.

Results: This strategy is tested on both simulated and real data. Our findings on coeliac disease replicate the previous SNP results and shed light on possible interactions among the SNPs.

Availability: The software discussed is available in Mendel 9.0 at the UCLA Human Genetics web site.

Contact: klange{at}ucla.edu

Supplementary information: Supplementary data are available at Bioinformatics online.

Associate Editor: Alex Bateman


Received on September 29, 2008; revised on December 11, 2008; accepted on January 18, 2009

Add to CiteULike CiteULike   Add to Connotea Connotea   Add to Del.icio.us Del.icio.us    What's this?




Disclaimer: Please note that abstracts for content published before 1996 were created through digital scanning and may therefore not exactly replicate the text of the original print issues. All efforts have been made to ensure accuracy, but the Publisher will not be held responsible for any remaining inaccuracies. If you require any further clarification, please contact our Customer Services Department.