CoGenT++: an extensive and extensible data environment for computational genomics
1Computational Genomics Group, The European Bioinformatics Institute EMBL, Cambridge Outstation, Cambridge CB10 1SD, UK
2Laboratory for Microbiology, Belgian Nuclear Research Center SCK/CEN, Boeretang 200, Mol B-2400, Belgium
3Institute of Agrobiotechnology, National Center for Research and Technology PO Box 361, Thermi, Thessaloniki GR-57001, Greece
4Laboratoire de Physique, Ecole Normale Supérieure 46 Allée d' Italie, Lyon CEDEX 07 F-69364, France
5Transcription Networks Group, National Center for Biotechnology CNB-CSIC Cantoblanco, Madrid E-28049, Spain
6Sanger Institute Wellcome Trust Campus, Cambridge CB10 1SA, UK
7Hospital for Sick Children, 555 University Avenue Toronto, Canada, ON M5G 1X
8DOE Joint Genome Institute 2800 Mitchell Drive, Walnut Creek, CA 94598, USA
*To whom correspondence should be addressed.
Motivation: CoGenT++ is a data environment for computational research in comparative and functional genomics, designed to address issues of consistency, reproducibility, scalability and accessibility.
Description: CoGenT++ facilitates the re-distribution of all fully sequenced and published genomes, storing information about species, gene names and protein sequences. We describe our scalable implementation of ProXSim, a continually updated all-against-all similarity database, which stores pairwise relationships between all genome sequences. Based on these similarities, derived databases are generated for gene fusionsAllFuse, putative orthologsOFAM, protein familiesTRIBES, phylogenetic profilesProfUse and phylogenetic trees. Extensions based on the CoGenT++ environment include disease gene prediction, pattern discovery, automated domain detection, genome annotation and ancestral reconstruction.
Conclusion: CoGenT++ provides a comprehensive environment for computational genomics, accessible primarily for large-scale analyses as well as manual browsing.
Availability: The database and component downloads are accessible at http://cgg.ebi.ac.uk/cogentpp.html.
Contact: ouzounis{at}ebi.ac.uk
Received on April 6, 2005; revised on July 5, 2005; accepted on July 6, 2005
This article has been cited by other articles:
![]() |
M. Toll-Riera, N. Bosch, N. Bellora, R. Castelo, L. Armengol, X. Estivill, and M. Mar Alba Origin of Primate Orphan Genes: A Comparative Genomics Approach Mol. Biol. Evol., March 1, 2009; 26(3): 603 - 612. [Abstract] [Full Text] [PDF] |
||||
![]() |
V. Kunin, A. Copeland, A. Lapidus, K. Mavromatis, and P. Hugenholtz A Bioinformatician's Guide to Metagenomics Microbiol. Mol. Biol. Rev., December 1, 2008; 72(4): 557 - 578. [Abstract] [Full Text] [PDF] |
||||
![]() |
S. Gu, I. Anderson, V. Kunin, M. Cipriano, S. Minovitsky, G. Weber, N. Amenta, B. Hamann, and I. Dubchak TreeQ-VISTA: an interactive tree visualization tool with functional annotation query capabilities Bioinformatics, March 15, 2007; 23(6): 764 - 766. [Abstract] [Full Text] [PDF] |
||||


