Bioinformatics Advance Access originally published online on August 1, 2008
Bioinformatics 2008 24(19):2236-2244; doi:10.1093/bioinformatics/btn405
| ||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||
Genome-scale classification of metabolic reactions and assignment of EC numbers with self-organizing maps
1CQFB, REQUIMTE, Departamento de Química, Faculdade de Ciências e Tecnologia, Universidade Nova de Lisboa, 2829-516 Caparica, 2CCMM, Departamento de Química e Bioquímica, Faculdade de Ciências, Universidade de Lisboa, Campo Grande, 1749-016 Lisboa, Portugal and 3College of Chemistry and Chemical Engineering, Henan University, Kaifeng, 475001 China
*To whom correspondence should be addressed.
| Abstract |
|---|
Motivation: The automatic perception of chemical similarities between metabolic reactions is required for a variety of applications ranging from the computer-aided validation of classification systems, to genome-scale reconstruction (or comparison) of metabolic pathways, to the classification of enzymatic mechanisms. Comparison of metabolic reactions has been mostly based on Enzyme Commission (EC) numbers, which are extremely useful and widespread, but not always straightforward to apply, and often problematic when an enzyme catalyzes several reactions, when the same reaction is catalyzed by different enzymes, when official full EC numbers are unavailable or when reactions are not catalyzed by enzymes. Different methods should be available to compare metabolic reactions. Simultaneously, methods are required for the automatic assignment of EC numbers to reactions still not officially classified.
Results: We have proposed the MOLMAP reaction descriptors to numerically encode the structural transformations resulting from a chemical reaction. Here, such descriptors are applied to the mapping of a genome-scale database of almost 4000 metabolic reactions by Kohonen self-organizing maps (SOMs), and its screening for inconsistencies in EC numbers. This approach allowed for the SOMs to assign EC numbers at the class, subclass and sub-subclass levels for reactions of independent test sets with accuracies up to 92, 80 and 70%, respectively. Different levels of similarity between training and test sets were explored. The approach also led to the identification of a number of similar reactions bearing differences at the EC class level.
Availability: The programs to generate MOLMAP descriptors from atomic properties included in SDF files are available upon request for evaluation.
Contact: jas{at}fct.unl.pt
Supplementary information: Supplementary data are available at Bioinformatics online.
Associate Editor: Martin Bishop
Received on April 3, 2008; revised on July 25, 2008; accepted on July 29, 2008
This article has been cited by other articles:
![]() |
Y. Yamanishi, M. Hattori, M. Kotera, S. Goto, and M. Kanehisa E-zyme: predicting potential EC numbers from the chemical transformation pattern of substrate-product pairs Bioinformatics, June 15, 2009; 25(12): i179 - i186. [Abstract] [Full Text] [PDF] |
||||
