Skip Navigation



Bioinformatics Advance Access published online on January 9, 2009

Bioinformatics, doi:10.1093/bioinformatics/btp010
This Article
Right arrow Advance Access manuscript (PDF)
Right arrow Supplementary Data
Right arrow All Versions of this Article:
25/4/458    most recent
btp010v1
Right arrow Comments: Submit a response
Right arrow Alert me when this article is cited
Right arrow Alert me when Comments are posted
Right arrow Alert me if a correction is posted
Services
Right arrow Email this article to a friend
Right arrow Similar articles in this journal
Right arrow Similar articles in PubMed
Right arrow Alert me to new issues of the journal
Right arrow Add to My Personal Archive
Right arrow Download to citation manager
Right arrowRequest Permissions
Google Scholar
Right arrow Articles by Warren, R. L
Right arrow Articles by Holt, R. A.
Right arrow Search for Related Content
PubMed
Right arrow PubMed Citation
Right arrow Articles by Warren, R. L
Right arrow Articles by Holt, R. A.
Social Bookmarking
 Add to CiteULike   Add to Connotea   Add to Del.icio.us  
What's this?

© The Author (2009). Published by Oxford University Press. All rights reserved. For Permissions, please email: journals.permissions@oxfordjournals.org

Profiling model T cell metagenomes with short reads

René L Warren 1,*, Brad H. Nelson 2 and Robert A. Holt 1

1BC Cancer Agency, Michael Smith Genome Sciences Centre, 675 West 10th Avenue, Vancouver, B.C., Canada, V5Z 1L3
2BC Cancer Agency, Deeley Research Centre, 2410 Lee Ave, Victoria, B.C., Canada, V8R 6V5

*To whom correspondence should be addressed. Mr. Rene Warren, E-mail: rwarren{at}bcgsc.ca


   Abstract

Motivation: T cell receptor (TCR) diversity in peripheral blood has not yet been fully profiled with sequence level resolution. Each T cell clonotype expresses a unique receptor, generated by somatic recombination of TCR genes and the enormous potential for T cell diversity makes repertoire analysis challenging. We developed a sequencing approach and assembly software (immuno-SSAKE or iSSAKE) for profiling T cell metagenomes using short reads from the massively parallel sequencing platforms.

Results: Models of sequence diversity for the TCR beta-chain CDR3 region were built using empirical data and used to simulate, at random, distinct TCR clonotypes at 1-20 parts per million. Using simulated TCRβ (sTCRβ) sequences, we randomly created 20 million 36nt reads having 1%-2% random error, 20 million 42nt or 50nt reads having 1% random error and 20 million 36nt reads with 1% error modeled on real short read data. Reads aligning to the end of known TCR variable (V) genes and having consecutive unmatched bases in the adjacent CDR3 were used to seed iSSAKE de novo assemblies of CDR3. With assembled 36nt reads we detect over 51% and 63% of rare (1 ppm) clonotypes using a random or modeled error distribution, respectively. We detect over 99% of more abundant clonotypes (6 ppm or higher) using either error distribution. Longer reads improve sensitivity, with assembled 42nt and 50nt reads identifying 82.0% and 94.7% of rare 1 ppm clonotypes, respectively. Our approach illustrates the feasibility of complete profiling of the TCR repertoire using new massively parallel short read sequencing technology.

Availability: ftp://ftp.bcgsc.ca/supplementary/iSSAKE.

Contact: rwarren{at}bcgsc.ca

Supplementary Information: Supplementary methods and data available at Bioinformatics online and at ftp://ftp.bcgsc.ca/supplementary/iSSAKE

Associate Editor: Prof. Alfonso Valencia


Received on September 26, 2008; revised on November 28, 2008; accepted on January 1, 2009

Add to CiteULike CiteULike   Add to Connotea Connotea   Add to Del.icio.us Del.icio.us    What's this?


This article has been cited by other articles:


Home page
Genome ResHome page
J. D. Freeman, R. L. Warren, J. R. Webb, B. H. Nelson, and R. A. Holt
Profiling the T-cell receptor beta-chain repertoire by massively parallel sequencing
Genome Res., October 1, 2009; 19(10): 1817 - 1824.
[Abstract] [Full Text] [PDF]



Disclaimer: Please note that abstracts for content published before 1996 were created through digital scanning and may therefore not exactly replicate the text of the original print issues. All efforts have been made to ensure accuracy, but the Publisher will not be held responsible for any remaining inaccuracies. If you require any further clarification, please contact our Customer Services Department.