Skip Navigation

This Article
Right arrow FREE Full Text (Print PDF) Freely available
Right arrow FREE Full Text (Screen PDF)
Right arrow Alert me when this article is cited
Right arrow Alert me if a correction is posted
Services
Right arrow Email this article to a friend
Right arrow Similar articles in this journal
Right arrow Similar articles in ISI Web of Science
Right arrow Similar articles in PubMed
Right arrow Alert me to new issues of the journal
Right arrow Add to My Personal Archive
Right arrow Download to citation manager
Right arrow Search for citing articles in:
ISI Web of Science (25)
Right arrowRequest Permissions
Google Scholar
Right arrow Articles by Kim, J.
Right arrow Articles by Carlson, J. R.
Right arrow Search for Related Content
PubMed
Right arrow PubMed Citation
Right arrow Articles by Kim, J.
Right arrow Articles by Carlson, J. R.
Social Bookmarking
 Add to CiteULike   Add to Connotea   Add to Del.icio.us  
What's this?

Bioinformatics Vol. 16 no. 9 2000
Pages 767-775
© 2000 Oxford University Press


Original Paper

Identification of novel multi-transmembrane proteins from genomic databases using quasi-periodic structural properties

Junhyong Kim 1,2,3,****, Etsuko N. Moriyama 1, Coral G. Warr 3, Peter J. Clyne 3 and John R. Carlson 3

1 Department of Ecology and Evolutionary Biology,
2 Department of Statistics,
3 Department of Molecular, Cellular and Developmental Biology, Yale University, USA

Received on August 25, 1999 ; revised on April 14, 2000 ; accepted on May 10, 2000

Motivation: Identification of novel G protein-coupled receptors and other multi-transmembrane proteins from genomic databases using structural features.

Results: Here we describe a new algorithm for identifying multi-transmembrane proteins from genomic databases with a specific application to identifying G protein-coupled receptors (GPCRs) that we call quasi-periodic feature classifier (QFC). The QFC algorithm uses concise statistical variables as the ‘feature space’ to characterize the quasi-periodic physico-chemical properties of multi-transmembrane proteins. For the case of identifying GPCRs, the variables are then used in a non-parametric linear discriminant function to separate GPCRs from non-GPCRs. The algorithm runs in time linearly proportional to the number of sequences, and performance on a test dataset shows 96% positive identification of known GPCRs. The QFC algorithm also works well with short random segments of proteins and it positively identified GPCRs at a level greater than 90% even with segments as short as 100 amino acids. The primary advantage of the algorithm is that it does not directly use primary sequence patterns which may be subject to sampling bias. The utility of the new algorithm has been demonstrated by the isolation from the Drosophila genome project database of a novel class of seven-transmembrane proteins which were shown to be the elusive olfactory receptor genes of Drosophila.

Availability: C++/Perl available from http://jkim.eeb.yale.edu/index.html

Contact: Junhyong Kim, Dept. of Ecology and Evolutionary Biology, Yale University, P.O. Box 208106, New Haven, CT 06520-8106; junhyong.kim{at}yale.edu

Supplementary information: Test dataset and training dataset are available from http://jkim.eeb.yale.edu/index.html

**** To whom correspondence should be addressed.


Add to CiteULike CiteULike   Add to Connotea Connotea   Add to Del.icio.us Del.icio.us    What's this?


This article has been cited by other articles:


Home page
Mol Biol EvolHome page
S. Guo and J. Kim
Molecular Evolution of Drosophila Odorant Receptor Genes
Mol. Biol. Evol., May 1, 2007; 24(5): 1198 - 1207.
[Abstract] [Full Text] [PDF]


Home page
Protein Sci.Home page
M. Wistrand, L. Kall, and E. L.L. Sonnhammer
A general model of G protein-coupled receptor sequences and its application to detect remote homologs
Protein Sci., March 1, 2006; 15(3): 509 - 521.
[Abstract] [Full Text] [PDF]


Home page
Proc. Natl. Acad. Sci. USAHome page
H. M. Robertson, C. G. Warr, and J. R. Carlson
Molecular evolution of the insect chemoreceptor gene superfamily in Drosophila melanogaster
PNAS, November 25, 2003; 100(suppl_2): 14537 - 14542.
[Abstract] [Full Text]


Home page
ScienceHome page
C. A. Hill, A. N. Fox, R. J. Pitts, L. B. Kent, P. L. Tan, M. A. Chrystal, A. Cravchik, F. H. Collins, H. M. Robertson, and L. J. Zwiebel
G Protein-Coupled Receptors in Anopheles gambiae
Science, October 4, 2002; 298(5591): 176 - 178.
[Abstract] [Full Text] [PDF]


Home page
J. Cell Sci.Home page
J. Kim and J. R. Carlson
Gene discovery by e-genetics: Drosophila odor and taste receptors
J. Cell Sci., March 15, 2002; 115(6): 1107 - 1112.
[Abstract] [Full Text] [PDF]


Home page
Proc. Natl. Acad. Sci. USAHome page
J. R. Carlson
Functional expression of a Drosophila odor receptor
PNAS, July 31, 2001; 98(16): 8936 - 8937.
[Full Text] [PDF]


Home page
Chem SensesHome page
C. Warr, P. Clyne, M. de Bruyne, J. Kim, and J. R. Carlson
Olfaction in Drosophila: Coding, Genetics and e-Genetics
Chem Senses, February 1, 2001; 26(2): 201 - 206.
[Abstract] [Full Text] [PDF]



Disclaimer:
Please note that abstracts for content published before 1996 were created through digital scanning and may therefore not exactly replicate the text of the original print issues. All efforts have been made to ensure accuracy, but the Publisher will not be held responsible for any remaining inaccuracies. If you require any further clarification, please contact our Customer Services Department.