Skip Navigation

Bioinformatics 2007 23(2):e163-e169; doi:10.1093/bioinformatics/btl290
This Article
Right arrow Full Text Freely available
Right arrow FREE Full Text (Print PDF) Freely available
Right arrow Alert me when this article is cited
Right arrow Alert me if a correction is posted
Services
Right arrow Email this article to a friend
Right arrow Similar articles in this journal
Right arrow Similar articles in PubMed
Right arrow Alert me to new issues of the journal
Right arrow Add to My Personal Archive
Right arrow Download to citation manager
Right arrowRequest Permissions
Google Scholar
Right arrow Articles by Raveh, B.
Right arrow Articles by Schreiber, G.
Right arrow Search for Related Content
PubMed
Right arrow PubMed Citation
Right arrow Articles by Raveh, B.
Right arrow Articles by Schreiber, G.
Social Bookmarking
 Add to CiteULike   Add to Connotea   Add to Del.icio.us  
What's this?

© The Author 2006. Published by Oxford University Press. All rights reserved. For Permissions, please email: journals.permissions@oxfordjournals.org

Machine Learning in Computational Biology

Rediscovering secondary structures as network motifs—an unsupervised learning approach

Barak Raveh 1,2,*,{dagger}, Ofer Rahat 2,{dagger}, Ronen Basri 1 and Gideon Schreiber 2

1 Department of Computer Science & Applied Mathematics, Weizmann Institute of Science Rehovot, 76100, Israel
2 Department of Biological Chemistry, Weizmann Institute of Science Rehovot, 76100, Israel

*To whom correspondence should be addressed.


   Abstract

Motivation: Secondary structures are key descriptors of a protein fold and its topology. In recent years, they facilitated intensive computational tasks for finding structural homologues, fold prediction and protein design. Their popularity stems from an appealing regularity in patterns of geometry and chemistry. However, the definition of secondary structures is of subjective nature. An unsupervised de-novo discovery of these structures would shed light on their nature, and improve the way we use these structures in algorithms of structural bioinformatics.

Methods: We developed a new method for unsupervised partitioning of undirected graphs, based on patterns of small recurring network motifs. Our input was the network of all H-bonds and covalent interactions of protein backbones. This method can be also used for other biological and non-biological networks.

Results: In a fully unsupervised manner, and without assuming any explicit prior knowledge, we were able to rediscover the existence of conventional {alpha}-helices, parallel ß-sheets, anti-parallel sheets and loops, as well as various non-conventional hybrid structures. The relation between connectivity and crystallographic temperature factors establishes the existence of novel secondary structures.

Contact: barak.raveh{at}weizmann.ac.il; gideon.schreiber{at}weizmann.ac.il

{dagger}The authors wish it to be known that, in their opinion, the first two authors should be regarded as joint First Authors.



Add to CiteULike CiteULike   Add to Connotea Connotea   Add to Del.icio.us Del.icio.us    What's this?


This article has been cited by other articles:


Home page
Nucleic Acids ResHome page
H. Neuvirth, U. Heinemann, D. Birnbaum, N. Tishby, and G. Schreiber
ProMateus--an open research approach to protein-binding sites analysis
Nucleic Acids Res., July 13, 2007; 35(suppl_2): W543 - W548.
[Abstract] [Full Text] [PDF]



Disclaimer:
Please note that abstracts for content published before 1996 were created through digital scanning and may therefore not exactly replicate the text of the original print issues. All efforts have been made to ensure accuracy, but the Publisher will not be held responsible for any remaining inaccuracies. If you require any further clarification, please contact our Customer Services Department.