Skip Navigation


Bioinformatics Advance Access originally published online on February 12, 2004
This Article
Right arrow FREE Full Text (Print PDF) Freely available
Right arrow FREE Full Text (Screen PDF)
Right arrow All Versions of this Article:
20/9/1335    most recent
bth086v1
Right arrow Alert me when this article is cited
Right arrow Alert me if a correction is posted
Services
Right arrow Email this article to a friend
Right arrow Similar articles in this journal
Right arrow Similar articles in ISI Web of Science
Right arrow Similar articles in PubMed
Right arrow Alert me to new issues of the journal
Right arrow Add to My Personal Archive
Right arrow Download to citation manager
Right arrow Search for citing articles in:
ISI Web of Science (27)
Right arrowRequest Permissions
Google Scholar
Right arrow Articles by Nagarajan, N.
Right arrow Articles by Yona, G.
Right arrow Search for Related Content
PubMed
Right arrow PubMed Citation
Right arrow Articles by Nagarajan, N.
Right arrow Articles by Yona, G.
Social Bookmarking
 Add to CiteULike   Add to Connotea   Add to Del.icio.us  
What's this?

Bioinformatics 20(9) © Oxford University Press 2004; all rights reserved.

Automatic prediction of protein domains from sequence information using a hybrid learning system

Niranjan Nagarajan and Golan Yona *

Department of Computer Science, Cornell University, Upson Hall, Ithaca, NY 14853, USA

Received on August 6, 2003; revised on December 7, 2003; accepted on December 12, 2003
Advance Access Publication February 12, 2004

Motivation: We describe a novel method for detecting the domain structure of a protein from sequence information alone. The method is based on analyzing multiple sequence alignments that are derived from a database search. Multiple measures are defined to quantify the domain information content of each position along the sequence and are combined into a single predictor using a neural network. The output is further smoothed and post-processed using a probabilistic model to predict the most likely transition positions between domains.

Results: The method was assessed using the domain definitions in SCOP and CATH for proteins of known structure and was compared with several other existing methods. Our method performs well both in terms of accuracy and sensitivity. It improves significantly over the best methods available, even some of the semi-manual ones, while being fully automatic. Our method can also be used to suggest and verify domain partitions based on structural data. A few examples of predicted domain definitions and alternative partitions, as suggested by our method, are also discussed.

Availability: An online domain-prediction server is available at http://biozon.org/tools/domains/

Contact: niranjan{at}cs.cornell.edu; golan{at}cs.cornell.edu

* To whom correspondence should be addressed.


Add to CiteULike CiteULike   Add to Connotea Connotea   Add to Del.icio.us Del.icio.us    What's this?


This article has been cited by other articles:


Home page
Nucleic Acids ResHome page
J. Cheng
DOMAC: an accurate, hybrid protein domain prediction server
Nucleic Acids Res., July 13, 2007; 35(suppl_2): W354 - W356.
[Abstract] [Full Text] [PDF]


Home page
Protein Sci.Home page
H. Zhou, B. Xue, and Y. Zhou
DDOMAIN: Dividing structures into domains using a normalized domain-domain interaction profile
Protein Sci., May 1, 2007; 16(5): 947 - 955.
[Abstract] [Full Text] [PDF]


Home page
Nucleic Acids ResHome page
T. M. Phuong, C. B. Do, R. C. Edgar, and S. Batzoglou
Multiple alignment of protein sequences with repeats and rearrangements
Nucleic Acids Res., November 6, 2006; 34(20): 5932 - 5942.
[Abstract] [Full Text] [PDF]


Home page
Nucleic Acids ResHome page
L. Chen, W. Wang, S. Ling, C. Jia, and F. Wang
KemaDom: a web server for domain prediction using kernel machine with local context.
Nucleic Acids Res., July 1, 2006; 34(Web Server issue): W158 - W163.
[Abstract] [Full Text] [PDF]


Home page
Brief BioinformHome page
V. Shepelev and A. Fedorov
Advances in the Exon-Intron Database (EID)
Brief Bioinform, June 1, 2006; 7(2): 178 - 185.
[Abstract] [Full Text] [PDF]


Home page
Protein Sci.Home page
T. Hondoh, A. Kato, S. Yokoyama, and Y. Kuroda
Computer-aided NMR assay for detecting natively folded structural domains
Protein Sci., April 1, 2006; 15(4): 871 - 883.
[Abstract] [Full Text] [PDF]


Home page
BioinformaticsHome page
J. E. Gewehr and R. Zimmer
SSEP-Domain: protein domain prediction by alignment of secondary structure elements and profiles
Bioinformatics, January 15, 2006; 22(2): 181 - 187.
[Abstract] [Full Text] [PDF]


Home page
Nucleic Acids ResHome page
A. Birkland and G. Yona
BIOZON: a hub of heterogeneous biological data
Nucleic Acids Res., January 1, 2006; 34(suppl_1): D235 - D242.
[Abstract] [Full Text] [PDF]


Home page
BioinformaticsHome page
H. K. Saini and D. Fischer
Meta-DP: domain prediction meta-server
Bioinformatics, June 15, 2005; 21(12): 2917 - 2920.
[Abstract] [Full Text] [PDF]



Disclaimer:
Please note that abstracts for content published before 1996 were created through digital scanning and may therefore not exactly replicate the text of the original print issues. All efforts have been made to ensure accuracy, but the Publisher will not be held responsible for any remaining inaccuracies. If you require any further clarification, please contact our Customer Services Department.