Bioinformatics Advance Access originally published online on August 11, 2008
Bioinformatics 2008 24(18):2119-2121; doi:10.1093/bioinformatics/btn387
| ||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||
Dasty2, an Ajax protein DAS client



1Proteomics Services Team, European Bioinformatics Institute (EMBL-EBI), Hinxton, Cambridge, CB10 1SD, UK, 2National Bioinformatics Network, NBN Central, Raapenberg Rd, Pinelands, 7405, Cape Town, South Africa, 3Faculty of Linguistics and Literary Sciences, University of Bremen, Bremen, Germany, 4External Services Team, European Bioinformatics Institute (EMBL-EBI), Hinxton, Cambridge, CB10 1SD, UK and 5Integromics S.L., Madrid, Spain
*To whom correspondence should be addressed.
| ABSTRACT |
|---|
|
|
|---|
Summary: Dasty2 is a highly interactive web client integrating protein sequence annotations from currently more than 40 sources, using the distributed annotation system (DAS).
Availability: Dasty2 is an open source tool freely available under the terms of the Apache License 2.0, publicly available at http://www.ebi.ac.uk/dasty/
Contact: hhe{at}ebi.ac.uk
| 1 INTRODUCTION |
|---|
|
|
|---|
We have developed Dasty2, a lightweight, modular distributed annotation system (DAS) client for researchers who aim to visualize, share and compare protein sequence annotations using the DAS. DAS is a lightweight system for integrating data from multiple heterogeneous distributed databases (Dowell et al., 2001). DAS provides a client–server communication protocol to exchange biological annotation in XML format via HTTP requests. It allows integrating DNA or protein annotations from multiple sources in a web browser, rather than in large central databases. DAS allows annotations to be visualized in the DAS client while control over the original source remains with the data provider (Prlic et al., 2005). Unlike SPICE, a desktop Java-based DAS client for protein sequence and structure views (Prlic et al., 2005), Dasty2 is a web application that only requires a web browser. Compared to other web clients like Ensembl ProtView (Flicek et al., 2007), Dasty2 increases responsiveness and interactivity by providing a rich user experience and mimics the efficiency of desktop applications. Asynchronous loading and local caching of results enabled by AJAX improves usability and system response time (Garrett et al., 2005).
| 2 RESULTS |
|---|
|
|
|---|
Dasty2 retrieves, integrates and visualizes protein sequence information and facilitates rapid searches to find, share and compare annotations for the protein of interest. The graphical layout of Dasty2 is composed of panels which display different related protein information such as sequence details, positional annotations, literature references and the protein structure, if present in Protein Data Bank (PDB). Additionally it includes other panels which complement the user interface by providing functionality such as protein ID search and ontology-based filtering. Panels are interconnected, presenting an interactive display. For instance, the action of clicking on a positional feature highlights its coverage on the sequence and on the protein structure and it shows a pop-up window with more details about the feature. Dasty2 improves the rich user interface allowing users to expand and collapse sections, zoom and resize the graphic, display extra features, drag and drop feature lines, as well as to highlight and sort features.
The modular architecture of Dasty2 allows the inclusion of further panels to add functionality as well as the use of only selected panels; for instance, OntoDas (O'Neill et al., 2007) allows users to build conceptual queries using the gene ontology (GO). For this purpose an additional panel displaying ontology annotations and enabling query construction was created. As more modules are added the number of panels will increase. Figure 1 illustrates how the initial information is displayed. Some of the panels are:
|
2.1 Search panel
Dasty2 provides a text box to search for proteins using UniProt identifiers and accession numbers. The search can be restricted to a certain number of DAS servers classified in the DAS Registry by a specific label, for example a large, collaborative annotation project like Biosapiens (Valencia, 2005). The search panel includes a drop-down list of registered labels.
2.2 Status panel
Dasty2 typically queries dozens of servers in response to a user query. To optimize the user experience, data will be displayed as soon as the first server replies are received. A progress bar on the left shows the percentage of DAS servers that have been processed, while status, error and warning messages are displayed under the System information section on the right. The Annotation Sources Loaded section shows the status of each DAS source being loaded, with links to the raw XML data.
2.3 Sequence panel
Displays the protein identifier, amino acid sequence and length.
2.4 Configuration panel
Contains options to control the display of the positional features panel. Users can zoom into the sequence, or show/hide columns and annotations from specific DAS sources.
2.5 Positional features panel
Shows annotations which are located on specific regions of the protein, e.g. signal sequences and functional domains. Annotations are aligned against the protein sequence, allowing visual comparison of features, e.g. predicted signal sequences from SignalP (Bendtsen et al., 2004) versus experimentally determined signal sequences from UniProt. The feature colours are defined by the UniProt DAS style sheet, which provides details on how the annotations should be rendered. Separate columns show the feature type and source of a feature. On mouse-over, a pop-up window displays annotation details on the individual feature, usually also containing a link back to the original annotation source for additional detail. To support visual comparison, drag and drop can be used for manual reordering of features.
2.6 Non-positional features panel
Displays annotations with no positional co-ordinates, e.g. protein synonyms or literature references.
2.7 Filtering panel
Presents three tree menus allowing customization of the display by filtering positional and non-positional annotations by type, server name and category. The hierarchical organization of feature types allows efficient focus on annotation types of interest, by including/excluding whole annotation groups like structural annotation with a single mouse click.
2.8 Protein structure panel
Provides a 3D view of the protein structure whenever PDB structures are available for the UniProt ID requested. If Dasty2 finds more than one structure, it includes a drop-down list to choose among them. Users can choose three defined sizes to display the structure and have an option to view it in a pop-up window.
Technically, Dasty2 retrieves annotations mapped to protein sequence co-ordinates from remote DAS servers, taking as input UniProt identifiers or accession numbers (Apweiler et al., 2004). The client establishes an initial connection to the DAS Registry (Prlic et al., 2007) and retrieves a list of available DAS annotation servers. The client then connects to the UniProt DAS reference server (Jones et al., 2005) to retrieve the protein sequence, and then retrieves feature annotation from each DAS annotation server. Dasty2 merges the collected data from all these servers and provides the user with a unified, structured and interactive view of the sequence and its annotated features.
Dasty2 is based on a client–server architecture. The retrieval module on the server side combines annotations from multiple DAS servers and makes them available to the visualization module. The visualization module displays the information obtained from the retrieval module. The visualization module is based on Ajax and runs on all major web browsers. It has been thoroughly tested on Mozilla Firefox 1.5 and 2.0, and Windows® Internet Explorer 6.0 and 7.0. The retrieval module has been written in Perl (CGI) and runs on HTTP servers like Apache. Dasty2 can be accessed at http://www.ebi.ac.uk/dasty, but can also be downloaded and customized for third party web sites.
| 3 CONCLUSIONS AND FUTURE WORK |
|---|
|
|
|---|
Dasty2 is not just a report of annotations but an interactive display to conveniently manipulate and visualize protein sequence information. This facilitates the interaction between the user and the information stored in DAS servers. The use of Ajax technology improves its efficiency when displaying information as it delivers a richer user-experience; this technology also reduces the server workloads and increases the responsiveness of the application.
As the recognition of the DAS for provision of protein information increases, the load on the Dasty2 client increases through more DAS servers and more features per server. Currently, filtering for relevant results is performed on the client side after querying all available sources. Future versions of Dasty2 will include targeted querying facilities to reduce server and client workload, and increase client responsiveness.
| ACKNOWLEDGEMENTS |
|---|
|
|
|---|
Funding: FP6 Programme of the European Commission (LSHG-CT-2003-503265, BioSapiens); the European Bioinformatics Institute; the National Bioinformatics Network of South Africa; the Spanish Ministerio de Educacion y Ciencia.
Conflict of Interest: none declared.
| FOOTNOTES |
|---|
The authors wish it to be known that, in their opinion, the first three authors should be regarded as joint First Authors. Associate Editor: Jonathan Wren
Received on February 11, 2008; revised on July 21, 2008; accepted on July 23, 2008
| REFERENCES |
|---|
|
|
|---|
Apweiler R, et al. UniProt: the universal protein knowledgebase. Nucleic Acids Res. (2004) D115–D119.
Bendtsen JD, et al. Improved prediction of signal peptides: Signal P 3.0. J. Mol. Biol. (2004) 783–795.
Dowell RD, et al. The distributed annotation system. BMC Bioinformatics (2001) 7.
Flicek P, et al. Ensembl 2008. Nucleic Acids Res. (2007) D707–D714.
Garrett JJ. Ajax: a new approach to web applications. (2005) (last accessed, April 17, 2006). http://www.adaptivepath.com/ideas/essays/archives/000385.php.
Jones P, et al. Dasty and UniProt DAS: a perfect pair for protein feature visualisation. Bioinformatics (2005) 3198–3199.
O'Neill K, et al. OntoDas—integrating DAS with ontology-based queries. Bio-ontologies SIG, ISMB 2007 (2007).
Prlic A, et al. Adding some SPICE to DAS. Bioinformatics (2005) 21(Suppl. 2):ii40–ii41.[Abstract]
Prlic A, et al. Integrating sequence and structural biology with DAS. BMC Bioinformatics (2007) 333.
Valencia A. Research networks: BioSapiens: a European network for integrated genome annotation. Eur. J. Hum. Genet. (2005) 994–997.
This article has been cited by other articles:
![]() |
C. J. A. Sigrist, L. Cerutti, E. de Castro, P. S. Langendijk-Genevaux, V. Bulliard, A. Bairoch, and N. Hulo PROSITE, a protein domain database for functional characterization and annotation Nucleic Acids Res., October 25, 2009; (2009) gkp885v1. [Abstract] [Full Text] [PDF] |
||||
![]() |
B. Aranda, P. Achuthan, Y. Alam-Faruque, I. Armean, A. Bridge, C. Derow, M. Feuermann, A. T. Ghanbarian, S. Kerrien, J. Khadake, et al. The IntAct molecular interaction database in 2010 Nucleic Acids Res., October 22, 2009; (2009) gkp878v1. [Abstract] [Full Text] [PDF] |
||||
![]() |
H. Blankenburg, R. D. Finn, A. Prlic, A. M. Jenkinson, F. Ramirez, D. Emig, S.-E. Schelhorn, J. Buch, T. Lengauer, and M. Albrecht DASMI: exchanging, annotating and assessing molecular interaction data Bioinformatics, May 15, 2009; 25(10): 1321 - 1328. [Abstract] [Full Text] [PDF] |
||||
![]() |
G. A Reeves, D. Talavera, and J. M Thornton Genome and proteome annotation: organization, interpretation and integration J R Soc Interface, February 6, 2009; 6(31): 129 - 147. [Abstract] [Full Text] [PDF] |
||||
![]() |
S. Hunter, R. Apweiler, T. K. Attwood, A. Bairoch, A. Bateman, D. Binns, P. Bork, U. Das, L. Daugherty, L. Duquenne, et al. InterPro: the integrative protein signature database Nucleic Acids Res., January 1, 2009; 37(suppl_1): D211 - D215. [Abstract] [Full Text] [PDF] |
||||
| ||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||



