Skip Navigation

This Article
Right arrow FREE Full Text (Print PDF) Freely available
Right arrow FREE Full Text (Screen PDF)
Right arrow Alert me when this article is cited
Right arrow Alert me if a correction is posted
Services
Right arrow Email this article to a friend
Right arrow Similar articles in this journal
Right arrow Similar articles in ISI Web of Science
Right arrow Similar articles in PubMed
Right arrow Alert me to new issues of the journal
Right arrow Add to My Personal Archive
Right arrow Download to citation manager
Right arrow Search for citing articles in:
ISI Web of Science (9)
Right arrowRequest Permissions
Google Scholar
Right arrow Articles by Tsai, J.
Right arrow Articles by Gerstein, M.
Right arrow Search for Related Content
PubMed
Right arrow PubMed Citation
Right arrow Articles by Tsai, J.
Right arrow Articles by Gerstein, M.
Social Bookmarking
 Add to CiteULike   Add to Connotea   Add to Del.icio.us  
What's this?

Bioinformatics Vol. 17 no. 10 2001
Pages 949-956
© 2001 Oxford University Press

Determining the minimum number of types necessary to represent the sizes of protein atoms

Jerry Tsai 1, Neil Voss 2 and Mark Gerstein 2,*

1 Department of Biochemistry & Biophysics, 103 Biochemistry/Biophysics Building, Texas A&M University, 2128 TAMU, College Station, TX 77843-2128, USA
2 Department of Molecular Biophysics and Biochemistry, Yale University, Bass Center, 266 Whitney Avenue, PO Box 208114, New Haven, CT 06520-8114, USA

Received on April 20, 2001 ; revised on July 9, 2001 ; accepted on July 9, 2001

Motivation: Traditionally, for packing calculations people have collected atoms together into a number of distinct ‘types’. These, in fact, often represent a heavy atom and its associated hydrogens (i.e. a united atom). Also, atom typing is usually done according to basic chemistry, giving rise to 20–30 protein atom types, such as carbonyl carbons, methyl groups, and hydroxyl groups. No one has yet investigated how similar in packing these chemically derived types are. Here we address this question in detail, using Voronoi volume calculations on a set of high-resolution crystal structures.

Results: We perform a rigorous clustering analysis with cross-validation on tens of thousands of atom volumes and attempt to compile them into types based purely on packing. From our analysis, we are able to determine a ‘minimal’ set of 18 atom types that most efficiently represent the spectrum of packing in proteins. Furthermore, we are able to uncover a number of inconsistencies in traditional chemical typing schemes, where differently typed atoms have almost the same effective size. In particular, we find that tetrahedral carbons with two hydrogens are almost identical in size to many aromatic carbons with a single hydrogen.

Availability: Programs available from http://geometry.molmovdb.org.

Contact: JerryTsai{at}TAMU.edu; neil.voss{at}yale.edu; Mark.Gerstein{at}yale.edu

Supplementary Information: Available at http://geometry.molmovdb.org.

* To whom correspondence should be addressed.


Add to CiteULike CiteULike   Add to Connotea Connotea   Add to Del.icio.us Del.icio.us    What's this?




Disclaimer:
Please note that abstracts for content published before 1996 were created through digital scanning and may therefore not exactly replicate the text of the original print issues. All efforts have been made to ensure accuracy, but the Publisher will not be held responsible for any remaining inaccuracies. If you require any further clarification, please contact our Customer Services Department.