Bioinformatics Advance Access originally published online on May 19, 2005
Bioinformatics 2005 21(14):3198-3199; doi:10.1093/bioinformatics/bti506
| ||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||
Dasty and UniProt DAS: a perfect pair for protein feature visualization
1EMBL OutstationEuropean Bioinformatics Institute (EBI), Wellcome Trust Genome Campus Hinxton, Cambridge CB10 1SD, UK
2Wellcome Trust Sanger Institute, Wellcome Trust Genome Campus Hinxton, Cambridge CB10 1SA, UK
*To whom correspondence should be addressed.
| Abstract |
|---|
|
|
|---|
Summary: In this study, we present two freely available and complementary Distributed Annotation System (DAS) resources: a DAS reference server that provides up-to-date sequence and annotation from UniProt, with additional feature links and database cross-references from InterPro and a DAS client implemented using Java and Macromedia Flash that is optimized for the display of protein features.
Availability: The Dasty DAS client is freely available at the following URLs: Dasty DAS Client (http://www.ebi.ac.uk/das-srv/uniprot/dasty/), Dasty Example Query (http://www.ebi.ac.uk/das-srv/uniprot/dasty/index.jsp?ID=Q24488), UniProt DAS Service (http://www.ebi.ac.uk/das-srv/uniprot/das), Dasty Documentation (http://www.ebi.ac.uk/dasty/) and UniProt DAS Documentation (http://www.ebi.ac.uk/uniprot-das/). It is intended that the Dasty client package and source code will be available for download and local installation in the coming weeks. The UniProt DAS reference server and the associated documentation can be freely accessed via the EBI site at the given link.
Contact: For further details of the Dasty client, please contact N.V. (nvinod{at}ebi.ac.uk). For further details of the UniProt DAS server, please contact P.J. (pjones{at}ebi.ac.uk).
| 1 INTRODUCTION |
|---|
|
|
|---|
The Distributed Annotation System (DAS) (Dowell et al., 2001) specifies a lightweight protocol to allow the positional feature data for biological molecules to be requested using HTTP requests (i.e. a URL), with the response being returned as XML. The DAS specification was originally designed to allow the feature information to be served in relation to a genomic sequence, however, the DAS protocol can be leveraged to serve the protein sequence and feature information. DAS servers are separated into two kinds: reference servers provide sequence data and where appropriate scaffolding information and annotation servers that provide feature information only. An annotation server is required to provide coordinates for each feature that it serves and indicate a suitable DAS reference server from which the corresponding sequence can be obtained. A DAS client is an application that is able to connect to at least one reference server and one annotation server and merge the information from these servers in a unified display. The true power of this mechanism is that a DAS client is able to connect to any number of annotation servers, located anywhere in the world, serving annotation data from different laboratories (hence distributed).
| 2 DASTY: A DAS CLIENT OPTIMIZED FOR PROTEIN VISUALIZATION |
|---|
|
|
|---|
Dasty has been developed with protein DAS services in mind. It offers an attractive and powerful graphical user interface (Fig. 1). A user can request any protein identification from the available DAS servers. The resulting view is furnished with tools that allow the user to zoom by sequence coordinates and DAS tracks; sort the order of DAS tracks by feature type or score; view details of each features type, description, start and end coordinates, and score and follow any hyperlinks to additional information or external data resources. Depending on the type of the feature, Dasty allows features to be colour coded. At present such settings can be set by editing a configuration file, however, a user interface to this configuration is under development. Dasty has been developed using Java and Macromedia FlashTM. Dasty can be used in any modern Internet browser with the freely available Flash plugin installed.
|
The ability to view non-positional features available from UniProt DAS, such as protein synonyms and literature references, is currently being built into the client.
| 3 UniProt DAS PROTEIN REFERENCE SERVER |
|---|
|
|
|---|
UniProt DAS has been made available from the EBI since July 2004.
The server is able to access sequence and feature data from the UniProt data warehouse via the Aristotle API (Apweiler et al., 2004; Kretschmann et al., 2004), hence, it acts as both a reference and annotation server in one. Feature data available from UniProt DAS include all human annotations and feature predictions from Swiss-Prot and TrEMBL. In both cases, this information is anchored to the latest sequence data. UniProt DAS is synchronized with the fortnightly release of UniProt, providing up-to-date sequence and annotation.
In addition, the UniProt data warehouse provides secondary database cross-references from the InterPro (Mulder et al., 2005). UniProt DAS is able to serve these cross references as hyperlinks to the secondary database entry. The DAS response includes a brief description of the InterPro entry and where applicable, a note about the status of the annotation.
UniProt DAS provides several query options, including query by Swiss-Prot, TrEMBL, UniProt, IPI and UniParc accession numbers.
IPI is the International Protein Index (Kersey et al., 2004), a minimally redundant database of protein sequences that cross-references to Swiss-Prot, TrEMBL, H-Inv DB, RefSeq, TAIR and Ensembl. UniProt DAS has recently been augmented with the ability to retrieve sequence and cross-references from the IPI for all entries, including cross-references to the available primary sequence databases, the corresponding NCBI taxonomy ID and a link to any related Ensembl gene entries. UniProt DAS has been developed using the Java programming language and is based upon the Dazzle general purpose DAS server (Table 1).
|
| 4 CONCLUSIONS |
|---|
|
|
|---|
Dasty and UniProt DAS together provide a freely available and powerful resource for distributed protein sequence and annotation services. UniProt DAS opens all UniProt, IPI and related InterPro protein sequence and feature data to query via the DAS protocol.
| Acknowledgments |
|---|
This work has been supported by the European FP6 Programme, BioSapiens grant, contract number LHSG-CT-2003503265 and by the BBSRC ISPIDER grant.
Received on April 4, 2005; accepted on May 17, 2005
| REFERENCES |
|---|
|
|
|---|
Apweiler, R., et al. (2004) UniProt: the Universal Protein knowledgebase. Nucleic Acids Res., 32, D115D119
Dowell, R.D., et al. (2001) The Distributed Annotation System. BMC Bioinformatics, 2, 7[CrossRef][Medline].
Kersey, P.J., et al. (2004) The International Protein Index: an integrated database for proteomics experiments. Proteomics, 4, 19851988[CrossRef][Web of Science][Medline].
Kretschmann, E., Rakow, A., Hackmann, A., Apweiler, R. (2004) The Aristotle Semantic Network Technology. Proceedings of the Eighth World Multiconference on Systemics, Cybernetics and Informatics (SCI 2004) 13, , pp. 6570.
Mulder, N.J., et al. (2005) InterPro, progress and status in 2005. Nucleic Acids Res, 33, D201D205 (Database issue)
This article has been cited by other articles:
![]() |
N. Jimenez-Lozano, J. Segura, J. R. Macias, J. Vega, and J. M. Carazo aGEM: an integrative system for analyzing spatial-temporal gene-expression information Bioinformatics, October 1, 2009; 25(19): 2566 - 2572. [Abstract] [Full Text] [PDF] |
||||
![]() |
H. Blankenburg, R. D. Finn, A. Prlic, A. M. Jenkinson, F. Ramirez, D. Emig, S.-E. Schelhorn, J. Buch, T. Lengauer, and M. Albrecht DASMI: exchanging, annotating and assessing molecular interaction data Bioinformatics, May 15, 2009; 25(10): 1321 - 1328. [Abstract] [Full Text] [PDF] |
||||
![]() |
K. Kandasamy, S. Keerthikumar, R. Goel, S. Mathivanan, N. Patankar, B. Shafreen, S. Renuse, H. Pawar, Y. L. Ramachandra, P. K. Acharya, et al. Human Proteinpedia: a unified discovery resource for proteomics research Nucleic Acids Res., January 1, 2009; 37(suppl_1): D773 - D781. [Abstract] [Full Text] [PDF] |
||||
![]() |
I. M. Overton, C. A. J. van Niekerk, L. G. Carter, A. Dawson, D. M. A. Martin, S. Cameron, S. A. McMahon, M. F. White, W. N. Hunter, J. H. Naismith, et al. TarO: a target optimisation system for structural biology Nucleic Acids Res., July 1, 2008; 36(suppl_2): W190 - W196. [Abstract] [Full Text] [PDF] |
||||
![]() |
J. A. Siepen, K. Belhajjame, J. N. Selley, S. M. Embury, N. W. Paton, C. A. Goble, S. G. Oliver, R. Stevens, L. Zamboulis, N. Martin, et al. ISPIDER Central: an integrated database web-server for proteomics Nucleic Acids Res., July 1, 2008; 36(suppl_2): W485 - W490. [Abstract] [Full Text] [PDF] |
||||
![]() |
M. Amico, M. Finelli, I. Rossi, A. Zauli, A. Elofsson, H. Viklund, G. von Heijne, D. Jones, A. Krogh, P. Fariselli, et al. PONGO: a web server for multiple predictions of all-alpha transmembrane proteins. Nucleic Acids Res., July 1, 2006; 34(Web Server issue): W169 - W172. [Abstract] [Full Text] [PDF] |
||||
![]() |
E. Birney, D. Andrews, M. Caccamo, Y. Chen, L. Clarke, G. Coates, T. Cox, F. Cunningham, V. Curwen, T. Cutts, et al. Ensembl 2006 Nucleic Acids Res., January 1, 2006; 34(suppl_1): D556 - D561. [Abstract] [Full Text] [PDF] |
||||
![]() |
P. Jones, R. G. Cote, L. Martens, A. F. Quinn, C. F. Taylor, W. Derache, H. Hermjakob, and R. Apweiler PRIDE: a public repository of protein and peptide identifications for the proteomics community Nucleic Acids Res., January 1, 2006; 34(suppl_1): D659 - D663. [Abstract] [Full Text] [PDF] |
||||
| ||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||


