Bioinformatics Advance Access originally published online on July 19, 2008
Bioinformatics 2008 24(18):2079-2085; doi:10.1093/bioinformatics/btn378
| ||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||
Data-driven extraction of relative reasoning rules to limit combinatorial explosion in biodegradation pathway prediction
1Eawag, Swiss Federal Institute of Aquatic Science and Technology, CH-8600 Dübendorf, 2Institute of Biogeochemistry and Pollutant Dynamics (IBP), ETH Zurich, CH-8092 Zürich, Switzerland, 3Department of Biochemistry, Molecular Biology and Biophysics, University of Minnesota, St. Paul, MN 55108, 4Department of Laboratory Medicine and Pathology, University of Minnesota, Minneapolis, MN 55455, USA and 5Institute for Informatics/I12, Technical University of Munich, Boltzmannstr. 3, D-85748 Garching bei München, Germany
*To whom correspondence should be addressed.
| Abstract |
|---|
Motivation: The University of Minnesota Pathway Prediction System (UM-PPS) is a rule-based expert system to predict plausible biodegradation pathways for organic compounds. However, iterative application of these rules to generate biodegradation pathways leads to combinatorial explosion. We use data from known biotransformation pathways to rationally determine biotransformation priorities (relative reasoning rules) to limit this explosion.
Results: A total of 112 relative reasoning rules were identified and implemented. In one prediction step, i.e. as per one generation predicted, the use of relative reasoning decreases the predicted biotransformations by over 25% for 50 compounds used to generate the rules and by about 15% for an external validation set of 47 xenobiotics, including pesticides, biocides and pharmaceuticals. The percentage of correctly predicted, experimentally known products remains at 75% when relative reasoning is used. The set of relative reasoning rules identified, therefore, effectively reduces the number of predicted transformation products without compromising the quality of the predictions.
Availability: The UM-PPS server is freely available on the web to all users at the time of submission of this manuscript and will be available following publication at http://umbbd.msi.umn.edu/predict/.
Contact: kathrin.fenner{at}eawag.ch
Supplementary information: Supplementary data are available at Bioinformatics online.
Associate Editor: Thomas Lengauer
Received on October 16, 2007; revised on June 19, 2008; accepted on July 17, 2008
This article has been cited by other articles:
![]() |
J. Gao, L. B. M. Ellis, and L. P. Wackett The University of Minnesota Biocatalysis/Biodegradation Database: improving public access Nucleic Acids Res., September 18, 2009; (2009) gkp771v1. [Abstract] [Full Text] [PDF] |
||||
