Skip Navigation


Bioinformatics Advance Access originally published online on March 24, 2007
Bioinformatics 2007 23(12):1444-1450; doi:10.1093/bioinformatics/btm119
This Article
Right arrow Full Text Freely available
Right arrow FREE Full Text (Print PDF) Freely available
Right arrowOA All Versions of this Article:
23/12/1444    most recent
btm119v1
Right arrow Alert me when this article is cited
Right arrow Alert me if a correction is posted
Services
Right arrow Email this article to a friend
Right arrow Similar articles in this journal
Right arrow Similar articles in PubMed
Right arrow Alert me to new issues of the journal
Right arrow Add to My Personal Archive
Right arrow Download to citation manager
Right arrow Search for citing articles in:
ISI Web of Science (5)
Google Scholar
Right arrow Articles by Ye, Z.-Q.
Right arrow Articles by Wei, L.
Right arrow Search for Related Content
PubMed
Right arrow PubMed Citation
Right arrow Articles by Ye, Z.-Q.
Right arrow Articles by Wei, L.
Social Bookmarking
 Add to CiteULike   Add to Connotea   Add to Del.icio.us  
What's this?

© 2007 The Author(s)
This is an Open Access article distributed under the terms of the Creative Commons Attribution Non-Commercial License (http://creativecommons.org/licenses/by-nc/2.0/uk/) which permits unrestricted non-commercial use, distribution, and reproduction in any medium, provided the original work is properly cited.

Finding new structural and sequence attributes to predict possible disease association of single amino acid polymorphism (SAP)

Zhi-Qiang Ye 1, Shu-Qi Zhao 1, Ge Gao 1, Xiao-Qiao Liu 1, Robert E. Langlois 2, Hui Lu 2 and Liping Wei 1,*

1Center for Bioinformatics, National Laboratory of Protein Engineering and Plant Genetic Engineering, College of Life Sciences, Peking University, Beijing 100871, P. R. China and 2Bioinformatics Program, Department of Bioengineering, University of Illinois at Chicago, Chicago, IL 60607, USA

*To whom correspondence should be addressed.


   Abstract

Motivation: The rapid accumulation of single amino acid polymorphisms (SAPs), also known as non-synonymous single nucleotide polymorphisms (nsSNPs), brings the opportunities and needs to understand and predict their disease association. Currently published attributes are limited, the detailed mechanisms governing the disease association of a SAP remain unclear and thus, further investigation of new attributes and improvement of the prediction are desired.

Results: A SAP dataset was compiled from the Swiss-Prot variant pages. We extracted and demonstrated the effectiveness of several new biologically informative attributes including the structural neighbor profiles that describe the SAP's microenvironment, nearby functional sites that measure the structure-based and sequence-based distances between the SAP site and its nearby functional sites, aggregation properties that measure the likelihood of protein aggregation and disordered regions that consider whether the SAP is located in structurally disordered regions. The new attributes provided insights into the mechanisms of the disease association of SAPs. We built a support vector machines (SVMs) classifier employing a carefully selected set of new and previously published attributes. Through a strict protein-level 5-fold cross-validation, we attained an overall accuracy of 82.61%, and an MCC of 0.60. Moreover, a web server was developed to provide a user-friendly interface for biologists.

Availability: The web server is available at http://sapred.cbi.pku.edu.cn/

Contact: sapred{at}mail.cbi.pku.edu.cn

Supplementary information: Supplementary data are available at http://sapred.cbi.pku.edu.cn/supp.do

Associate Editor: Alfonso Valencia


Received on December 21, 2006; revised on February 25, 2007; accepted on March 16, 2007

Add to CiteULike CiteULike   Add to Connotea Connotea   Add to Del.icio.us Del.icio.us    What's this?


This article has been cited by other articles:


Home page
BioinformaticsHome page
J. Song, H. Tan, K. Takemoto, and T. Akutsu
HSEpred: predict half-sphere exposure from protein sequences
Bioinformatics, July 1, 2008; 24(13): 1489 - 1497.
[Abstract] [Full Text] [PDF]


Home page
Hum ReprodHome page
T. D. Gallardo, G. B. John, K. Bradshaw, C. Welt, R. Reijo-Pera, P. H. Vogt, P. Touraine, S. Bione, D. Toniolo, L. M. Nelson, et al.
Sequence variation at the human FOXO3 locus: a study of premature ovarian failure and primary amenorrhea
Hum. Reprod., January 1, 2008; 23(1): 216 - 221.
[Abstract] [Full Text] [PDF]



Disclaimer:
Please note that abstracts for content published before 1996 were created through digital scanning and may therefore not exactly replicate the text of the original print issues. All efforts have been made to ensure accuracy, but the Publisher will not be held responsible for any remaining inaccuracies. If you require any further clarification, please contact our Customer Services Department.