Bioinformatics, Vol 15, 723-728, Copyright © 1999 by Oxford University Press
CF Allex, JW Shavlik and FR Blattner
MOTIVATION: Given inputs extracted from an aligned column of DNA bases and
the underlying Perkin Elmer Applied Biosystems (ABI) fluorescent traces,
our goal is to train a neural network to determine correctly the consensus
base for the column. Choosing an appropriate network input representation
is critical to success in this task. We empirically compare five
representations; one uses only base calls and the others include trace
information. RESULTS: We attained the most accurate results from networks
that incorporate trace information into their input representations. Based
on estimates derived from using 10- fold cross-validation, the best network
topology produces consensus accuracies ranging from 99.26% to >99.98%
for coverages from two to six aligned sequences. With a coverage of six, it
makes only three errors in 20 000 consensus calls. In contrast, the network
that only uses base calls in its input representation has over double that
error rate: eight errors in 20 000 consensus calls. CONTACT:
allex@cs.wisc.edu
ARTICLES
Neural network input representations that produce accurate consensus sequences from DNA fragment assemblies
Computer Sciences Department, University of Wisconsin - Madison, 1210 West Dayton Street, Madison, WI 53706, DNASTAR Inc., 1228 South Park Street, Madison, WI 53715, USA.
![]()
CiteULike
Connotea
Del.icio.us What's this?
This article has been cited by other articles:
![]() |
J. Amos Wilson, V. M. Pratt, A. Phansalkar, K. Muralidharan, W. E. Highsmith Jr, J. C. Beck, S. Bridgeman, E. M. Courtney, L. Epp, A. Ferreira-Gonzalez, et al. Consensus Characterization of 16 FMR1 Reference Materials: A Consortium Study J. Mol. Diagn., January 1, 2008; 10(1): 2 - 12. [Abstract] [Full Text] [PDF] |
||||
