Skip Navigation

This Article
Right arrow FREE Full Text (Print PDF) Freely available
Right arrow Comments: Submit a response
Right arrow Alert me when this article is cited
Right arrow Alert me when Comments are posted
Right arrow Alert me if a correction is posted
Services
Right arrow Email this article to a friend
Right arrow Similar articles in this journal
Right arrow Alert me to new issues of the journal
Right arrow Add to My Personal Archive
Right arrow Download to citation manager
Right arrowRequest Permissions
Google Scholar
Right arrow Articles by Smith, T. M.
Right arrow Articles by Hood, L.
Right arrow Search for Related Content
PubMed
Right arrow Articles by Smith, T. M.
Right arrow Articles by Hood, L.
Social Bookmarking
 Add to CiteULike   Add to Connotea   Add to Del.icio.us  
What's this?

© Oxford University Press

Hopper: software for automating data tracking and flow in DNA sequencing

Todd M. Smith , Chris Abajian and Leroy Hood 1

Department of Molecular Biotechnology, Box 357730, University of Washington Seattle, WA 98195, USA

1To whom reprint requests should be addressed

MOTIVATION: Genome-scale DNA sequencing is a multistep process in which large numbers of small template clones are propagated, purified, sequenced and analyzed on acrylamide gels. A significant challenge to these projects is the scale at which the data handling must be done. Hence, large-scale sequencing facilities will benefit from tracking template DNA information (purification methods, reaction and electrophoresis conditions) in a systematic fashion. A lack of software tools that support automated sample entry, and automatic data storage, retrieval and analysis are a major hindrance to recording and using laboratory workflow information to monitor the overall quality of data production.

RESULTS: The UNIX file system has been used to prototype automation of the flow of data from the ABI sequencer to a data repository. Data are automatically processed by a central Perl program, Hopper, which runs a series of programs that analyze data quality (read length estimate, fraction of indeterminate bases, and number of contaminating and repetitive sequences), assemble shotgun sequence data, and generates simple reports describing the results.

AVAILABILITY: This software is freely available over the Internet on the WWW (http://www.genome.washington.edu/docs/hopper).

CONTACT: E-mail: T.M.Smith, soundbat{at}u.washington.edu


Received on August 26, 1996; accepted on November 20, 1996

Add to CiteULike CiteULike   Add to Connotea Connotea   Add to Del.icio.us Del.icio.us    What's this?


This article has been cited by other articles:


Home page
Genome ResHome page
D. Meldrum
Automation for Genomics, Part One: Preparation for Sequencing
Genome Res., August 1, 2000; 10(8): 1081 - 1092.
[Abstract] [Full Text]


Home page
Genome ResHome page
M. C. Wendl, S. Dear, D. Hodgson, and L. Hillier
Automated Sequence Preprocessing in a Large-Scale Sequencing Environment
Genome Res., September 1, 1998; 8(9): 975 - 984.
[Abstract] [Full Text]


Home page
Genome ResHome page
N. N. Dedhia and W. R. McCombie
Kaleidaseq: A Web-Based Tool to Monitor Data Flow in a High Throughput Sequencing Facility
Genome Res., March 1, 1998; 8(3): 313 - 318.
[Abstract] [Full Text]



Disclaimer: Please note that abstracts for content published before 1996 were created through digital scanning and may therefore not exactly replicate the text of the original print issues. All efforts have been made to ensure accuracy, but the Publisher will not be held responsible for any remaining inaccuracies. If you require any further clarification, please contact our Customer Services Department.