Skip Navigation


Bioinformatics Advance Access originally published online on November 30, 2004
Bioinformatics 2005 21(8):1745-1746; doi:10.1093/bioinformatics/bti170
This Article
Right arrow Abstract Freely available
Right arrow FREE Full Text (Print PDF) Freely available
Right arrow All Versions of this Article:
21/8/1745    most recent
bti170v1
Right arrow Comments: Submit a response
Right arrow Alert me when this article is cited
Right arrow Alert me when Comments are posted
Right arrow Alert me if a correction is posted
Services
Right arrow Email this article to a friend
Right arrow Similar articles in this journal
Right arrow Similar articles in ISI Web of Science
Right arrow Similar articles in PubMed
Right arrow Alert me to new issues of the journal
Right arrow Add to My Personal Archive
Right arrow Download to citation manager
Right arrow Search for citing articles in:
ISI Web of Science (5)
Right arrowRequest Permissions
Google Scholar
Right arrow Articles by Finak, G.
Right arrow Articles by Tang, Z.
Right arrow Search for Related Content
PubMed
Right arrow PubMed Citation
Right arrow Articles by Finak, G.
Right arrow Articles by Tang, Z.
Social Bookmarking
 Add to CiteULike   Add to Connotea   Add to Del.icio.us  
What's this?

© The Author 2004. Published by Oxford University Press. All rights reserved. For Permissions, please email: journals.permissions{at}oupjournals.org

BIAS: Bioinformatics Integrated Application Software

G. Finak , N. Godin , M. Hallett *, F. Pepin , Z. Rajabi , V. Srivastava and Z. Tang

McGill Centre for Bioinformatics 3775 University Street McGill University Montreal, Canada H3A 2B4

*To whom correspondence should be addressed.


    Abstract
 TOP
 Abstract
 REFERENCES
 

Motivation: We introduce a development platform especially tailored to Bioinformatics research and software development. BIAS (Bioinformatics Integrated Application Software) provides the tools necessary for carrying out integrative Bioinformatics research requiring multiple datasets and analysis tools. It follows an object-relational strategy for providing persistent objects, allows third-party tools to be easily incorporated within the system and supports standards and data-exchange protocols common to Bioinformatics.

Availability: BIAS is an OpenSource project and is freely available to all interested users at http://www.mcb.mcgill.ca/~bias/. This website also contains a paper containing a more detailed description of BIAS and a sample implementation of a Bayesian network approach for the simultaneous prediction of gene regulation events and of mRNA expression from combinations of gene regulation events.

Contact: hallett{at}mcb.mcgill.ca

There now exists a wide range of ontologies, standards, databases and applications available to the Bioinformatics community. However, the vast majority of these objects exist independently and are not tightly integrated into a single system. This situation makes it difficult for research groups to quickly and effectively use state-of-the-art Bioinformatics tools. Furthermore, it necessitates a great effort on their part to both integrate all the relevant objects together into a usable platform and to maintain this system over time. The slow progress towards a development framework for Bioinformatics is due to a complex set of reasons including the heterogeneity and diversity of biological data (Lacroix, 2002). Additionally, the development of fields such as systems biology bring with it a need to integrate multiple types of information within a coherent statistical model (McMillan, 2002; Haas et al., 2003; Segal et al., 2002). However, there remains a general lack of tools available to help the community to manage these complex analyses. Existing development platforms such as [Acero genomics knowledge platform (2003), http://www.acero.com and Rosetta resolver (2004), http://www.rosettabio.com/products/resolver/] are generally not offered at a price accessible to academic laboratories.

We make available a new development platform entitled BIAS (Bioinformatics Integrated Application Software). The mandate of BIAS is to provide infrastructure to integrate many different Bioinformatics objects together into one easy to use, easy to expand and computationally powerful system. Although the system currently has infrastructure for gene expression, transcription factor binding site and protein–protein interaction studies, BIAS is the seed of a Bioinformatics development framework. We have made solid, pragmatic design decisions and the OpenSource aspect of our system should encourage research groups to tailor this kernel to their specific needs rather than ‘re-creating the wheel’. We provide an intuition of the main components of BIAS. Figure 1 depicts an overview of Bias.



View larger version (30K):
[in this window]
[in a new window]
 
Fig. 1 Basic overview of BIAS.

 
Data sources: internal. The notion of an object-relational (OR) model is essentially an amalgamation of relational and object-oriented models [The apache DB project (2004), http://db.apache.org; Java 2 platform, enterprise edition (2003), http://java.sun.com/j2ee; Stonebraker et al. (1999)] that tries to provide the advantages of both systems. In particular, it allows programmers to work with a modern object-oriented programming environment while preserving the rich toolkit of reliable functionality offered for persistent storage by relational database management systems. BIAS offers a pragmatic OR model that maps objects in the object-oriented world to relations in the database and vice versa. When a programmer defines a new class in Java, the class is reflected as a table in the database. When an instance of the class is created, the instance is reflected as a tuple (row) in the corresponding table. The system guarantees that modifications to classes or instances of classes are automatically reflected in the database (and vice versa). The strengths of our system are its ease of use in comparison with other OR systems (http://dbapache.org and http://java.sun.com/j2ee), and the ease with which many important Bioinformatics objects can be integrated into the model.

Data sources: external. Any Bioinformatics software must consider questions regarding the importation and exportation of data from foreign sources. There are three ways with which this is achieved: (1) parsing external files, (2) distributed access queries and (3) via a Java API. (1) BIAS provides a factory of programs for data import/export and data quality control. These routines provide a skeleton which other research groups can tailor to their specific needs. (2) BIAS also provides tools for using distributed annotation systems (Stein, 2003). The purpose of distributed annotation systems is to facilitate the de-centralization of the curation process of large multiple data resources. When a user queries a DAS server, the server retrieves the relevant information from multiple servers. (3) BIAS provides routines for exploiting Java APIs offered by third party databases. Currently, it is possible to connect to Ensembl directly through a Java API. In all cases, the imported data are automatically placed into BIAS Java objects.

Libraries. One of the greatest strengths of BIAS is its ability to incorporate third party software as external libraries. This allows the developer to use the BIAS data model and routines but at the same time make use of high-quality, specialized software packages. We have integrated the statistical package R (and Bioconductor) due to its use in the analysis of gene expression data (R Development Core Team, 2004). BIAS provides a library of basic functions, data structures, graphical user interface libraries and algorithms that are commonly used in many Bioinformatics analyses. In fact, the functionality offered by the BioJava package (http://www.biojava.org) is incorporated within the system.

Modules, clients and servers. A module is an application written by a developer in BIAS. It can make use of the BIAS internal and external libraries, data source access routines and data model. Essentially, modules are the basic means by which developers add Bioinformatics functionality to the framework. Modules can be used as a stand-alone package where the end-user of the system (i.e. the biologist) need not necessarily know that the application is written in BIAS. When an end-user uses BIAS, they execute the client process on a machine of their choice. The client is responsible for all communication to and from the server. The server component acts as a middleware layer between the modules/clients and the data sources/libraries components of the system and receives requests from the clients. It is primarily responsible for accessing the BIAS libraries and data resources of a module. At present, BIAS remains a single-user system but development of a multi-user client/server architecture is underway.

Discussion and future directions. BIAS currently contains infrastructure for: (1) gene expression studies (cDNA, Agilent and Affymetrix) compliant with the MIAME standard; (2) handling complete genomes and annotations; (3) transcription factor binding site analyses; (4) genetic network inference; and (5) protein–protein interaction experiments. The most pressing needs to the underlying system are better graphic user interface classes, and more advanced support for distribution annotation systems. We plan to integrate BioPerl into BIAS and finish the client/server implementation in the immediate future. Recently, we have started to migrate our OR strategy to the HIBERNATE system (http://www.hibernate.org/).

Received on December 19, 2004; revised on June 21, 2004; accepted on November 18, 2004

    REFERENCES
 TOP
 Abstract
 REFERENCES
 

    Acero genomics knowledge platform. (2003) .

    Biojava. (2004) .

    Haas, L.M., Schwarz, P.M., Kodal, P., Kotlav, E., Rice, J.E., Swope, W.C. (2003) Discoverylink: a system for integrated access to life sciences data sources. IBM Syst. J., 40, 1–23.

    Java 2 platform, enterprise edition. (2003) 123–128.

    Lacroix, Z. (2002) Biological data integration: wrapping data and tools. IEEE Trans. Inform. Technol. Biomed., 6, .

    McMillan, S. (2002) The race to computerise biology. Economist Technol. Quart., 16–18.

    R Foundation for Statistical Computing R Development Core Team. (2004) R: A language and environment for statistical computing. , 3-900051-07-0 Vienna, Austria http://www.R-project.org.

    Rosetta resolver. (2004) .

    Segal, E., Barash, Y., Simon, I., Friedman, N., Koller, D. (2002) From promoter sequence to expression: a probabilistic framework. Proceedings of the Sixth International Conference on Computational BiologyApril 2002Washington, DC , NY ACM Press.

    Stein, L. (2003) Integrating biological databases. Nat. Rev. Genet., 4, , pp. 337–345[Web of Science][Medline].

    Stonebraker, M., Moore, D., Brown, P. Object-Relational DBMSs: Tracking the Next Great Wave, (1999) 2nd edn , San Francisco Morgan Kaufmann.

    The apache DB project. (2004) .


Add to CiteULike CiteULike   Add to Connotea Connotea   Add to Del.icio.us Del.icio.us    What's this?


This article has been cited by other articles:


Home page
Mol Cancer ResHome page
A. A.N. Rose, F. Pepin, C. Russo, J. E. Abou Khalil, M. Hallett, and P. M. Siegel
Osteoactivin Promotes Breast Cancer Metastasis to Bone
Mol. Cancer Res., October 1, 2007; 5(10): 1001 - 1014.
[Abstract] [Full Text] [PDF]


Home page
Cancer Res.Home page
B. Schade, S. H.L. Lam, D. Cernea, V. Sanguin-Gendreau, R. D. Cardiff, B. L. Jung, M. Hallett, and W. J. Muller
Distinct ErbB-2 Coupled Signaling Pathways Promote Mammary Tumors with Unique Pathologic and Transcriptional Profiles
Cancer Res., August 15, 2007; 67(16): 7579 - 7588.
[Abstract] [Full Text] [PDF]


This Article
Right arrow Abstract Freely available
Right arrow FREE Full Text (Print PDF) Freely available
Right arrow All Versions of this Article:
21/8/1745    most recent
bti170v1
Right arrow Comments: Submit a response
Right arrow Alert me when this article is cited
Right arrow Alert me when Comments are posted
Right arrow Alert me if a correction is posted
Services
Right arrow Email this article to a friend
Right arrow Similar articles in this journal
Right arrow Similar articles in ISI Web of Science
Right arrow Similar articles in PubMed
Right arrow Alert me to new issues of the journal
Right arrow Add to My Personal Archive
Right arrow Download to citation manager
Right arrow Search for citing articles in:
ISI Web of Science (5)
Right arrowRequest Permissions
Google Scholar
Right arrow Articles by Finak, G.
Right arrow Articles by Tang, Z.
Right arrow Search for Related Content
PubMed
Right arrow PubMed Citation
Right arrow Articles by Finak, G.
Right arrow Articles by Tang, Z.
Social Bookmarking
 Add to CiteULike   Add to Connotea   Add to Del.icio.us  
What's this?