Bioinformatics Vol. 17 no. 1 2001
Pages 44-57
© 2001 Oxford University Press
Original Paper |
Functional and structural genomics using PEDANT
1 GSF-Forschungszentrum
für Umwelt und Gesundheit, Munich
Information Center for Protein Sequences (MIPS) am
Max-Planck-Institut für Biochemie, Am
Klopferspitz 18, 82152 Martinsried, Germany
2 Biomax Informatics AG, Lochhamer
Straße 11, 82152 Martinsried, Germany
Received on April 28, 2000
; revised on June 23, 2000
; accepted on June 23, 2000
Motivation: Enormous demand for fast and accurate analysis of biological sequences is fuelled by the pace of genome analysis efforts. There is also an acute need in reliable up-to-date genomic databases integrating both functional and structural information. Here we describe the current status of the PEDANT software system for high-throughput analysis of large biological sequence sets and the genome analysis server associated with it.
Results: The principal features of PEDANT are: (i) completely automatic processing of data using a wide range of bioinformatics methods, (ii) manual refinement of annotation, (iii) automatic and manual assignment of gene products to a number of functional and structural categories, (iv) extensive hyperlinked protein reports, and (v) advanced DNA and protein viewers. The system is easily extensible and allows to include custom methods, databases, and categories with minimal or no programming effort. PEDANT is actively used as a collaborative environment to support several on-going genome sequencing projects. The main purpose of the PEDANT genome database is to quickly disseminate well-organized information on completely sequenced and unfinished genomes. It currently includes 80 genomic sequences and in many cases serves as the only source of exhaustive information on a given genome. The database also acts as a vehicle for a number of research projects in bioinformatics. Using SQL queries, it is possible to correlate a large variety of pre-computed properties of gene products encoded in complete genomes with each other and compare them with data sets of special scientific interest. In particular, the availability of structural predictions for over 300 000 genomic proteins makes PEDANT the most extensive structural genomics resource available on the web.
Availability: The PEDANT genome analysis server is available at http://pedant.mips.biochem.mpg.de.
Contact: Genome sequencing centres interested in inclusion of their sequences in the PEDANT database should contact Dmitrij Frishman (frishman{at}mips.biochem.mpg.de).
* To whom correspondence should be addressed.
![]()
CiteULike
Connotea
Del.icio.us What's this?
This article has been cited by other articles:
![]() |
T. Kasuga and N. L. Glass Dissecting Colony Development of Neurospora crassa Using mRNA Profiling and Comparative Genomics Approaches Eukaryot. Cell, September 1, 2008; 7(9): 1549 - 1564. [Abstract] [Full Text] [PDF] |
||||
![]() |
E. L. Denham, P. N. Ward, and J. A. Leigh Lipoprotein Signal Peptides Are Processed by Lsp and Eep of Streptococcus uberis J. Bacteriol., July 1, 2008; 190(13): 4641 - 4647. [Abstract] [Full Text] [PDF] |
||||
![]() |
C. Tian, T. Kasuga, M. S. Sachs, and N. L. Glass Transcriptional Profiling of Cross Pathway Control in Neurospora crassa and Comparative Analysis of the Gcn4 and CPC1 Regulons Eukaryot. Cell, June 1, 2007; 6(6): 1018 - 1029. [Abstract] [Full Text] [PDF] |
||||
![]() |
M. L. Riley, T. Schmidt, I. I. Artamonova, C. Wagner, A. Volz, K. Heumann, H.-W. Mewes, and D. Frishman PEDANT genome database: 10 years online Nucleic Acids Res., January 12, 2007; 35(suppl_1): D354 - D357. [Abstract] [Full Text] [PDF] |
||||
![]() |
K. Bryson, V. Loux, R. Bossy, P. Nicolas, S. Chaillou, M. van de Guchte, S. Penaud, E. Maguin, M. Hoebeke, P. Bessieres, et al. AGMIAL: implementing an annotation strategy for prokaryote genomes as a distributed system Nucleic Acids Res., July 19, 2006; 34(12): 3533 - 3545. [Abstract] [Full Text] [PDF] |
||||
![]() |
A. Clare, A. Karwath, H. Ougham, and R. D. King Functional bioinformatics for Arabidopsis thaliana Bioinformatics, May 1, 2006; 22(9): 1130 - 1136. [Abstract] [Full Text] [PDF] |
||||
![]() |
S. V. V. Deevi and A. C. R. Martin An extensible automated protein annotation tool: standardizing input and output using validated XML Bioinformatics, February 1, 2006; 22(3): 291 - 296. [Abstract] [Full Text] [PDF] |
||||
![]() |
T. Sonke, S. Ernste, R. F. Tandler, B. Kaptein, W. P. H. Peeters, F. B. J. van Assema, M. G. Wubbolts, and H. E. Schoemaker L-Selective Amidase with Extremely Broad Substrate Specificity from Ochrobactrum anthropi NCIMB 40321 Appl. Envir. Microbiol., December 1, 2005; 71(12): 7961 - 7973. [Abstract] [Full Text] [PDF] |
||||
![]() |
T. Kasuga, J. P. Townsend, C. Tian, L. B. Gilbert, G. Mannhaupt, J. W. Taylor, and N. L. Glass Long-oligomer microarray profiling in Neurospora crassa reveals the transcriptional program underlying biochemical and physiological events of conidial germination Nucleic Acids Res., November 14, 2005; 33(20): 6469 - 6485. [Abstract] [Full Text] [PDF] |
||||
![]() |
P. Lu, D. Szafron, R. Greiner, D. S. Wishart, A. Fyshe, B. Pearcy, B. Poulin, R. Eisner, D. Ngo, and N. Lamb PA-GOSUB: a searchable database of model organism protein sequences with their predicted Gene Ontology molecular function and subcellular localization Nucleic Acids Res., January 1, 2005; 33(suppl_1): D147 - D153. [Abstract] [Full Text] [PDF] |
||||
![]() |
J. Lai, N. Dey, C.-S. Kim, A. K. Bharti, S. Rudd, K. F.X. Mayer, B. A. Larkins, P. Becraft, and J. Messing Characterization of the Maize Endosperm Transcriptome and Its Comparison to the Rice Genome Genome Res., October 1, 2004; 14(10a): 1932 - 1937. [Abstract] [Full Text] [PDF] |
||||
![]() |
D. Szafron, P. Lu, R. Greiner, D. S. Wishart, B. Poulin, R. Eisner, Z. Lu, J. Anvik, C. Macdonell, A. Fyshe, et al. Proteome Analyst: custom predictions with explanations in a web-based tool for high-throughput proteome annotations Nucleic Acids Res., July 1, 2004; 32(suppl_2): W365 - W371. [Abstract] [Full Text] [PDF] |
||||
![]() |
T. Z. Berardini, S. Mundodi, L. Reiser, E. Huala, M. Garcia-Hernandez, P. Zhang, L. A. Mueller, J. Yoon, A. Doyle, G. Lander, et al. Functional Annotation of the Arabidopsis Genome Using Controlled Vocabularies Plant Physiology, June 1, 2004; 135(2): 745 - 755. [Abstract] [Full Text] [PDF] |
||||
![]() |
C. del Val, A. Mehrle, M. Falkenhahn, M. Seiler, K.-H. Glatting, A. Poustka, S. Suhai, and S. Wiemann High-throughput protein analysis integrating bioinformatics and experimental assays Nucleic Acids Res., February 3, 2004; 32(2): 742 - 748. [Abstract] [Full Text] [PDF] |
||||
![]() |
F. O. Glockner, M. Kube, M. Bauer, H. Teeling, T. Lombardot, W. Ludwig, D. Gade, A. Beck, K. Borzym, K. Heitmann, et al. Complete genome sequence of the marine planctomycete Pirellula sp. strain 1 PNAS, July 8, 2003; 100(14): 8298 - 8303. [Abstract] [Full Text] [PDF] |
||||
![]() |
G. Caetano-Anolles and D. Caetano-Anolles An Evolutionarily Structured Universe of Protein Architecture Genome Res., July 1, 2003; 13(7): 1563 - 1571. [Abstract] [Full Text] [PDF] |
||||
![]() |
R. Casadio, P. Fariselli, G. Finocchiaro, and P. L. Martelli Fishing new proteins in the twilight zone of genomes: The test case of outer membrane proteins in Escherichia coli K12, Escherichia coli O157:H7, and other Gram-negative bacteria Protein Sci., June 1, 2003; 12(6): 1158 - 1168. [Abstract] [Full Text] [PDF] |
||||
![]() |
F. Meyer, A. Goesmann, A. C. McHardy, D. Bartels, T. Bekel, J. Clausen, J. Kalinowski, B. Linke, O. Rupp, R. Giegerich, et al. GenDB--an open source genome annotation system for prokaryote genomes Nucleic Acids Res., April 15, 2003; 31(8): 2187 - 2195. [Abstract] [Full Text] [PDF] |
||||
![]() |
G. Mannhaupt, C. Montrone, D. Haase, H. W. Mewes, V. Aign, J. D. Hoheisel, B. Fartmann, G. Nyakatura, F. Kempken, J. Maier, et al. What's in the genome of a filamentous fungus? Analysis of the Neurospora genome sequence Nucleic Acids Res., April 1, 2003; 31(7): 1944 - 1954. [Abstract] [Full Text] [PDF] |
||||
![]() |
J. Hou, G. E. Sims, C. Zhang, and S.-H. Kim A global representation of the protein fold space PNAS, March 4, 2003; 100(5): 2386 - 2390. [Abstract] [Full Text] [PDF] |
||||
![]() |
S. Rudd, H.-W. Mewes, and K. F.X. Mayer Sputnik: a database platform for comparative plant genomics Nucleic Acids Res., January 1, 2003; 31(1): 128 - 132. [Abstract] [Full Text] [PDF] |
||||
![]() |
W. M. Karlowski, H. Schoof, V. Janakiraman, V. Stuempflen, and K. F. X. Mayer MOsDB: an integrated information resource for rice genomics Nucleic Acids Res., January 1, 2003; 31(1): 190 - 192. [Abstract] [Full Text] [PDF] |
||||
![]() |
D. Frishman, M. Mokrejs, D. Kosykh, G. Kastenmuller, G. Kolesov, I. Zubrzycki, C. Gruber, B. Geier, A. Kaps, K. Albermann, et al. The PEDANT genome database Nucleic Acids Res., January 1, 2003; 31(1): 207 - 211. [Abstract] [Full Text] [PDF] |
||||
![]() |
A. Muller, R. M. MacCallum, and M. J.E. Sternberg Structural Characterization of the Human Proteome Genome Res., November 1, 2002; 12(11): 1625 - 1641. [Abstract] [Full Text] [PDF] |
||||
![]() |
J. Lin, J. Qian, D. Greenbaum, P. Bertone, R. Das, N. Echols, A. Senes, B. Stenger, and M. Gerstein GeneCensus: genome comparisons in terms of metabolic pathway activity and protein family sharing Nucleic Acids Res., October 15, 2002; 30(20): 4574 - 4582. [Abstract] [Full Text] [PDF] |
||||
![]() |
A. J.K. Koo and J. B. Ohlrogge The Predicted Candidates of Arabidopsis Plastid Inner Envelope Membrane Proteins and Their Expression Profiles Plant Physiology, October 1, 2002; 130(2): 823 - 836. [Abstract] [Full Text] [PDF] |
||||
![]() |
I. C. Sutcliffe and D. J. Harrington Pattern searches for the identification of putative lipoprotein genes in Gram-positive bacterial genomes Microbiology, July 1, 2002; 148(7): 2065 - 2077. [Abstract] [Full Text] [PDF] |
||||
![]() |
D. Frishman Knowledge-based selection of targets for structural genomics Protein Eng. Des. Sel., March 1, 2002; 15(3): 169 - 183. [Abstract] [Full Text] [PDF] |
||||
![]() |
H. Schoof, P. Zaccaria, H. Gundlach, K. Lemcke, S. Rudd, G. Kolesov, R. Arnold, H. W. Mewes, and K. F. X. Mayer MIPS Arabidopsisthaliana Database (MAtDB): an integrated biological knowledge resource based on the first complete plant genome Nucleic Acids Res., January 1, 2002; 30(1): 91 - 93. [Abstract] [Full Text] [PDF] |
||||
![]() |
T. Kawabata, S. Fukuchi, K. Homma, M. Ota, J. Araki, T. Ito, N. Ichiyoshi, and K. Nishikawa GTOP: a database of protein structures predicted from genome sequences Nucleic Acids Res., January 1, 2002; 30(1): 294 - 298. [Abstract] [Full Text] [PDF] |
||||










