Skip Navigation


Bioinformatics Advance Access originally published online on March 30, 2006
Bioinformatics 2006 22(12):1437-1439; doi:10.1093/bioinformatics/btl116
This Article
Right arrow Full Text Freely available
Right arrow FREE Full Text (Print PDF) Freely available
Right arrowOA All Versions of this Article:
22/12/1437    most recent
btl116v1
Right arrow Alert me when this article is cited
Right arrow Alert me if a correction is posted
Services
Right arrow Email this article to a friend
Right arrow Similar articles in this journal
Right arrow Similar articles in ISI Web of Science
Right arrow Similar articles in PubMed
Right arrow Alert me to new issues of the journal
Right arrow Add to My Personal Archive
Right arrow Download to citation manager
Right arrow Search for citing articles in:
ISI Web of Science (5)
Google Scholar
Right arrow Articles by Zhang, Z.
Right arrow Articles by Gerstein, M.
Right arrow Search for Related Content
PubMed
Right arrow PubMed Citation
Right arrow Articles by Zhang, Z.
Right arrow Articles by Gerstein, M.
Social Bookmarking
 Add to CiteULike   Add to Connotea   Add to Del.icio.us  
What's this?

© The Author 2006. Published by Oxford University Press. All rights reserved. For Permissions, please email: journals.permissions@oxfordjournals.org
The online version of this article has been published under an open access model. Users are entitled to use, reproduce, disseminate, or display the open access version of this article for non-commercial purposes provided that: the original authorship is properly and fully attributed; the Journal and Oxford University Press are attributed as the original place of publication with the correct citation details given; if an article is subsequently reproduced or disseminated not in its entirety but only in part or as a derivative work this must be clearly indicated. For commercial re-use, please contact journals.permissions@oxfordjournals.org

PseudoPipe: an automated pseudogene identification pipeline

Zhaolei Zhang 1, Nicholas Carriero 2, Deyou Zheng 3, John Karro 4, Paul M. Harrison 5 and Mark Gerstein 3,*

1 Banting and Best Department of Medical Research, Donnelly CCBR, University of Toronto 160 College Street, Toronto, ON M5S 3E1, Canada
2 Department of Computer Science, Yale University New Haven, CT 06520, USA
3 Department of Molecular Biophysics and Biochemistry New Haven, CT 06520, USA
4 Department of Biology 506 Wartik Pennsylvania State University, University Park PA 16802, USA
5 Department of Biology, McGill University Stewart Biology Building, 1205 Dr Penfield Avenue, Montreal, QC, H3A 1B1, Canada

*To whom correspondence should be addressed.

Motivation: Mammalian genomes contain many ‘genomic fossils’ i.e. pseudogenes. These are disabled copies of functional genes that have been retained in the genome by gene duplication or retrotransposition events. Pseudogenes are important resources in understanding the evolutionary history of genes and genomes.

Results: We have developed a homology-based computational pipeline (‘PseudoPipe’) that can search a mammalian genome and identify pseudogene sequences in a comprehensive and consistent manner. The key steps in the pipeline involve using BLAST to rapidly cross-reference potential "parent" proteins against the intergenic regions of the genome and then processing the resulting "raw hits" -- i.e. eliminating redundant ones, clustering together neighbors, and associating and aligning clusters with a unique parent. Finally, pseudogenes are classified based on a combination of criteria including homology, intron-exon structure, and existence of stop codons and frameshifts.

Availability: The PseudoPipe program is implemented in Python and can be downloaded at http://pseudogene.org/

Contact: Mark.Gerstein{at}yale.edu or zhaolei.zhang{at}utoronto.ca


Received on December 28, 2005; revised on March 1, 2006; accepted on March 22, 2006

Add to CiteULike CiteULike   Add to Connotea Connotea   Add to Del.icio.us Del.icio.us    What's this?


This article has been cited by other articles:


Home page
Gen Biol EvolHome page
M. Rho, M. Zhou, X. Gao, S. Kim, H. Tang, and M. Lynch
Independent Mammalian Genome Contractions Following the KT Boundary
Gen Biol Evol, June 22, 2009; 2009(0): 2 - 12.
[Abstract] [Full Text] [PDF]


Home page
BioinformaticsHome page
F.-C. Chen, Y.-Z. Chen, and T.-J. Chuang
CNVVdb: a database of copy number variations across vertebrate genomes
Bioinformatics, June 1, 2009; 25(11): 1419 - 1421.
[Abstract] [Full Text] [PDF]


Home page
Nucleic Acids ResHome page
H. Y. K. Lam, E. Khurana, G. Fang, P. Cayting, N. Carriero, K.-H. Cheung, and M. B. Gerstein
Pseudofam: the pseudogene families database
Nucleic Acids Res., January 1, 2009; 37(suppl_1): D738 - D743.
[Abstract] [Full Text] [PDF]


Home page
Genome ResHome page
Y.-T. Huang, F.-C. Chen, C.-J. Chen, H.-L. Chen, and T.-J. Chuang
Identification and analysis of ancestral hominoid transcriptome inferred from cross-species transcript and processed pseudogene comparisons
Genome Res., July 1, 2008; 18(7): 1163 - 1170.
[Abstract] [Full Text] [PDF]


Home page
Genome ResHome page
D. Zheng, A. Frankish, R. Baertsch, P. Kapranov, A. Reymond, S. W. Choo, Y. Lu, F. Denoeud, S. E. Antonarakis, M. Snyder, et al.
Pseudogenes in the ENCODE regions: Consensus annotation, analysis of transcription, and evolution
Genome Res., June 1, 2007; 17(6): 839 - 851.
[Abstract] [Full Text] [PDF]


Home page
Genes Dev.Home page
K. V. Prasanth and D. L. Spector
Eukaryotic regulatory RNAs: an answer to the 'genome complexity' conundrum
Genes & Dev., January 1, 2007; 21(1): 11 - 42.
[Abstract] [Full Text] [PDF]


Home page
Genome ResHome page
F.-C. Chen, C.-J. Chen, W.-H. Li, and T.-J. Chuang
Human-specific insertions and deletions inferred from mammalian genome sequences
Genome Res., January 1, 2007; 17(1): 16 - 22.
[Abstract] [Full Text] [PDF]



Disclaimer: Please note that abstracts for content published before 1996 were created through digital scanning and may therefore not exactly replicate the text of the original print issues. All efforts have been made to ensure accuracy, but the Publisher will not be held responsible for any remaining inaccuracies. If you require any further clarification, please contact our Customer Services Department.