Skip Navigation


Bioinformatics Advance Access originally published online on April 3, 2008
Bioinformatics 2008 24(10):1300-1304; doi:10.1093/bioinformatics/btn113
This Article
Right arrow Abstract Freely available
Right arrow FREE Full Text (Print PDF) Freely available
Right arrow Supplementary Data
Right arrow All Versions of this Article:
24/10/1300    most recent
btn113v1
Right arrow Alert me when this article is cited
Right arrow Alert me if a correction is posted
Services
Right arrow Email this article to a friend
Right arrow Similar articles in this journal
Right arrow Similar articles in ISI Web of Science
Right arrow Similar articles in PubMed
Right arrow Alert me to new issues of the journal
Right arrow Add to My Personal Archive
Right arrow Download to citation manager
Right arrowRequest Permissions
Google Scholar
Right arrow Articles by Castrignanò, T.
Right arrow Articles by Pesole, G.
Right arrow Search for Related Content
PubMed
Right arrow PubMed Citation
Right arrow Articles by Castrignanò, T.
Right arrow Articles by Pesole, G.
Social Bookmarking
 Add to CiteULike   Add to Connotea   Add to Del.icio.us  
What's this?

© The Author 2008. Published by Oxford University Press. All rights reserved. For Permissions, please email: journals.permissions@oxfordjournals.org

ASPicDB: A database resource for alternative splicing analysis

T. Castrignanò 1, M. D’Antonio 1, A. Anselmo 2, D. Carrabino 1, A. D’Onorio De Meo 1, A. M. D’Erchia 3, F. Licciulli 4, M. Mangiulli 3, F. Mignone 2, G. Pavesi 2, E. Picardi 3, A. Riva 3,5, R. Rizzi 6, P. Bonizzoni 6 and G. Pesole 3,4,*

1Consorzio Interuniversitario per le Applicazioni di Supercalcolo per Università e Ricerca, CASPUR, Rome, 2University of Milan, Dipartimento di Scienze Biomolecolari e Biotecnologie, via Celoria 26, Milan 20133, 3University of Bari, Dipartimento di Biochimica e Biologia Molecolare, via Orabona, 4, Bari 70126, 4Istituto Tecnologie Biomediche del Consiglio Nazionale delle Ricerche, via Amendola 122/D, Bari 70126, Italy, 5Department of Molecular Genetics and Microbiology, University of Florida, PO Box 103610, Gainesville, FL 32610-3610, USA and 6DISCo, University of Milan Bicocca, via Bicocca degli Arcimboldi, 8, Milan, 20135, Italy

*To whom correspondence should be addressed.


    ABSTRACT
 TOP
 ABSTRACT
 1 INTRODUCTION
 2 METHODS
 3 RESULTS
 4 DISCUSSION
 5 FUTURE DIRECTIONS
 ACKNOWLEDGEMENTS
 REFERENCES
 

Motivation: Alternative splicing has recently emerged as a key mechanism responsible for the expansion of transcriptome and proteome complexity in human and other organisms. Although several online resources devoted to alternative splicing analysis are available they may suffer from limitations related both to the computational methodologies adopted and to the extent of the annotations they provide that prevent the full exploitation of the available data. Furthermore, current resources provide limited query and download facilities.

Results: ASPicDB is a database designed to provide access to reliable annotations of the alternative splicing pattern of human genes and to the functional annotation of predicted splicing isoforms. Splice-site detection and full-length transcript modeling have been carried out by a genome-wide application of the ASPic algorithm, based on the multiple alignments of gene-related transcripts (typically a Unigene cluster) to the genomic sequence, a strategy that greatly improves prediction accuracy compared to methods based on independent and progressive alignments.

Enhanced query and download facilities for annotations and sequences allow users to select and extract specific sets of data related to genes, transcripts and introns fulfilling a combination of user-defined criteria. Several tabular and graphical views of the results are presented, providing a comprehensive assessment of the functional implication of alternative splicing in the gene set under investigation.

ASPicDB, which is regularly updated on a monthly basis, also includes information on tissue-specific splicing patterns of normal and cancer cells, based on available EST sequences and their library source annotation.

Availability: www.caspur.it/ASPicDB

Contact: graziano.pesole{at}biologia.uniba.it

Supplementary information: Supplementary data are available at Bioinformatics online.


    1 INTRODUCTION
 TOP
 ABSTRACT
 1 INTRODUCTION
 2 METHODS
 3 RESULTS
 4 DISCUSSION
 5 FUTURE DIRECTIONS
 ACKNOWLEDGEMENTS
 REFERENCES
 
Alternative splicing and alternative initiation/termination of transcription have recently emerged as the major mechanisms responsible for the expansion of the transcriptome and proteome complexity in human and other organisms (Brett et al., 2002; Kopelman et al., 2005; Talavera et al., 2007). Although the functional potential of splicing variants has not been widely studied so far, several examples are known where they are involved in the creation of novel or specialized functions (Black, 2003; Blencowe, 2006; Lopez, 1998). In fact, recent experimental studies aimed at the characterization of human and mouse transcriptomes have revealed a remarkable heterogeneity in transcription initiation, and have also shown that alternative splicing is a widespread phenomenon affecting more than 60% of human genes (Matlin et al., 2005; Tress et al., 2007), an estimate constantly increasing from the 35% of the first genome-wide assessment (Mironov et al., 1999).

Splicing regulation, resulting from a combination of cis-elements and trans-acting factors, is thus a key mechanism to tune gene expression to a variety of conditions, while its dysfunction may often be at the basis of the onset of genetic diseases and cancer (Garcia-Blanco et al., 2004).

Most of the methods currently available for the investigation and prediction of gene splicing patterns are based on independent and progressive alignments of transcript data (mostly ESTs) to a genomic sequence. Since these methods present some limitations mostly due to the sequence errors frequently occurring in ESTs and to the repetitive structure of the genome sequence, we previously developed a novel methodology implemented in the ASPic algorithm and software that take into account these issues [see Methods and (Bonizzoni et al., 2005; Castrignano et al., 2006)].

Here we present ASPicDB (version 1.2, January 2008), a database resource for alternative splicing analysis that collects and provides access to the results of a genome-wide analysis of human genes carried out by the ASPic software. ASPicDB is regularly updated on a monthly basis with the latest releases of NCBI Entrez gene and Unigene databases.

A large variety of tools and databases is currently available for genome-wide investigation and prediction of alternative splicing in human and other organisms: ASD (Stamm et al., 2006), ECgene (Lee et al., 2007), ASAP2 (Kim et al., 2007), Hollywood (Holste et al., 2006), AceView (Thierry-Mieg and Thierry-Mieg, 2006). However, the data they present are often significantly discordant due to differences in the input data, in the algorithm adopted for splice sites prediction and transcript assembly [see (Bonizzoni et al., 2006) for further discussion] as well as on the level of stringency adopted to predict a splice site (e.g. occurrence of canonical donor/acceptor, number of supporting ESTs, quality of the alignment at the splice boundaries). ASPicDB combines the high-quality predictions produced by the ASPic algorithm with a powerful user-interface that offers several unique features. Enhanced retrieval tools allow for the definition of composite queries for the selection of subsets of genes, transcripts or introns related to specific features, for example introns belonging to the U2 or U12 class, or found in a specific mRNA localization (e.g. 5'UTR), or supported by a minimum number of ESTs, and so on. Furthermore, the database also includes information on tissue specific splicing in normal and cancer cells based on available expressed sequence tags and their source library annotations. Finally, ASPicDB provides download facilities for sequence extraction (e.g. sequence regions surrounding splice site boundaries featuring specific conditions) that could be used to assist bioinformatics analyses aimed at the detection of splicing regulatory elements.


    2 METHODS
 TOP
 ABSTRACT
 1 INTRODUCTION
 2 METHODS
 3 RESULTS
 4 DISCUSSION
 5 FUTURE DIRECTIONS
 ACKNOWLEDGEMENTS
 REFERENCES
 
2.1 Data resources
ASPicDB is a relational database populated with the results obtained by the genome-wide application of the ASPic program, whose input consists of the genomic sequence corresponding to a specific gene and the collection of related expressed sequences, typically the expressed sequences contained in the relevant Unigene cluster. Only human genes for which a RefSeq NM curated transcript and a Unigene cluster were available were included in the database.

Genomic and expressed sequences used for the prediction were dynamically extracted from an ad hoc developed MySQL genomic platform, that includes data from the latest version of the human genome assembly (NCBI36) and coordinates resulting from BLAT mapping of RefSeq and Unigene entries updated on January 2008.

The genomic coordinates of each input gene were determined as the leftmost and rightmost mapping position of the relevant Unigene entries that also included the RefSeq sequences. In the case that more than one Unigene cluster was associated with a specific gene we selected the one containing the relevant gene-related RefSeq entries.

In order to investigate the differential expression of genes and alternative splicing isoforms in normal versus cancer tissues we annotated Unigene ESTs based on the standard eVOC ontology (Kelso et al., 2003). The eVOC database was also used as the source of expression pattern information. To avoid biases or inconsistencies in the statistical analysis of the expression pattern we excluded from the analysis the ESTs from normalized libraries and considered only tissues represented by more than 40 000 ESTs.

2.2 Computational method for alternative splicing prediction
ASPic implements an algorithm for splice site prediction that performs a multiple EST sequence comparison and alignment against the genomic sequence (Bonizzoni et al., 2005). The algorithm uses an optimization procedure to minimize the number of detected splice sites thus reducing the number of false splice predictions, a common problem with other methods as we previously showed in (Bonizzoni et al., 2006).

ASPic overcomes the limitations of programs based on independent pairwise alignment of ESTs against the genome that tend to predict artefactual splice sites (Bonizzoni et al., 2006) suggested by alternative individual EST alignments, mainly, because of the repetitive structure of genomic sequences and of sequencing errors frequently occurring in ESTs. Furthermore, a dynamic programming procedure is adopted for further refinement of the alignment at exon–intron boundaries, trying to reconcile, where possible, non-canonical splice sites to canonical ones (Bonizzoni et al., 2005), and taking into account the scoring matrices for donor, acceptor and branch sites of U2 and U12 introns derived from (Sheth et al., 2006). However, all 256 possible splice site pairs are acceptable and used in transcript assembly under the condition they are supported by at least two mRNA/ESTs and no mismatches are observed in a 15-bp long sequence upstream and downstream from the intron. The main result of the strategy of aligning spliced ESTs to the genome is a more reliable prediction of introns. Then, a suitable algorithm based on a directed acyclic graph (DAG) has been then designed to assemble spliced ESTs and predicted introns into a minimum set of non-mergeable transcripts which are also annotated with respect to the location of the coding sequence and to the presence of premature stop codon (Maquat, 2004), polyA signal and polyA tail (Zhang et al., 2005). The annotation of splicing variants is done using a RefSeq mRNA as reference (Pruitt et al., 2007).


    3 RESULTS
 TOP
 ABSTRACT
 1 INTRODUCTION
 2 METHODS
 3 RESULTS
 4 DISCUSSION
 5 FUTURE DIRECTIONS
 ACKNOWLEDGEMENTS
 REFERENCES
 
3.1 Database content
Table 1 reports some statistics on the data contained in the current version of ASPicDB (version 1.2, January 2008) which currently contains splicing predictions for 18 442 human genes. We estimated that over 91% of multi-exon genes may generate alternative isoforms and that each gene—on average—may generate about 12 different transcripts and 11 different proteins, most of them translated in frame with the RefSeq annotated protein. The resulting 10% of ‘untranslated’ isoforms includes those transcripts for which a reliable open reading frame (ORF) could not be annotated automatically (see the online documentation for the criteria used for ORF annotation).


View this table:
[in this window]
[in a new window]

 
Table 1. Statistics for ASPicDB (version 1.2, January 2008)

 
ASPicDB also contains information about cancer versus normal tissue specificity for 17 tissue types at both gene and splice site level (see an example in Supplementary Fig. 2): breast, cardiovascular system (e.g. heart), central nervous system (e.g. brain), gastrointestinal tract (e.g. colon, stomach), dermal system (e.g. skin), endocrine system (e.g. parathyroid, thyroid, pancreatic islet), hematological system (e.g. blood, bone marrow), liver and biliary system, musculoskeletal system (e.g. muscle), pancreas, peripheral nervous system (e.g. eye), ovary, placenta, uterus, prostate, respiratory system (e.g. lung), stomach, testis, urinary system (e.g. kidney, bladder).

3.2 Database mining
The database can be accessed through simple or advanced query interfaces. The simple query form allows the user to obtain the ASPic output for one or more genes selected according to one of their HGNC (e.g. NSMCE1), Unigene (e.g. Hs.481720), RefSeq (e.g. NM_152713 [GenBank] ), Entrez (e.g. 3248) or MIM (e.g. 202300) IDs (upper panel), or according to a keyword term (e.g. survivin), or to their associated Gene Ontology (GO) IDs (e.g. 0015248) or textual terms (e.g. ‘calcium ion transport’) belonging to the ‘biological process’, ‘molecular function’ or ‘cellular component’ categories (lower panel).

The advanced query form allows the user to search for: (1) genes; (2) transcripts or (3) splice sites, fulfilling different criteria (e.g. type of splicing event, type of donor/acceptor splice site, etc.). Depending on this choice three separate query forms appear.

The ‘Gene’ retrieval form has been designed to select genes fulfilling specific criteria (Fig. 1A) or showing a specific expression pattern in normal/tumor cells belonging to a given tissue (Fig. 1B).


Figure 1
View larger version (52K):
[in this window]
[in a new window]
[Download PowerPoint slide]
 
Fig. 1. ASPic DB gene advanced query form. The form is split into two parts (A and B) to search for genes fulfilling one or more criteria (A) or showing a specific expression profile in normal/tumor cells of a given tissue (B).

 
Thus, users can select genes that are: (1) exclusively expressed in tumor cells (TT); (2) significantly more expressed in tumor cells (T); (3) equally expressed in tumor and normal cells (E); (4) significantly more expressed in normal cells (N) and (5) exclusively expressed in normal cells (NN). The classification is based on a simple {chi}2 statistic obtained from the comparison of observed and expected number of ESTs in normal and tumor tissues, calculated taking into account the relevant library sizes (for more information see the online Help documentation).

A transcript search can be performed in a similar fashion, with the additional possibility of selecting transcripts potentially targeted to nonsense mediated decay (NMD) for the presence of a premature termination codon (PTC) defined according to (Maquat, 2004) or with a polyA tail and a polyA signal, defined according to (Zhang et al., 2005).

Finally, the splice site retrieval form allows users to retrieve splice sites (introns) fulfilling one ore more criteria, like for example the type of donor/acceptor site, the intron class [U2, U12 or unclassifiable, according to Sheth et al., (2006)], the number of supporting ESTs (defining their reliability degree), and their specific overrepresentation in normal versus tumor cell of a specific tissue, based on the number and the source of supporting ESTs following (Wang et al., 2003).

3.3 Results output
After a ‘Gene’, ‘Transcript’ or ‘Splice site’ query has been completed a Result Table is shown (Supplementary Fig. 1) listing the genes (Supplementary Fig. 1A), transcripts (Supplementary Fig. 1B) or introns (Supplelmentary Fig. 1C) that satisfy the selected criteria. The table contains general information about the query results and provide hyperlinks to access the ASPic results and the expression pattern output at the gene (Supplementary Fig. 2A) or splice site (Supplementary Fig. 2B) level.

A more detailed description of the Aspic output can be found in the online documentation where some example screenshots are also shown.

Briefly, the output is organized in six sections:

  1. Gene Information reports a summary of the genomic and transcript data used by ASPIC to generate the prediction, downloadable by the user, and links to the results of other popular prediction programs [e.g. ASAP2 (Kim et al., 2007), ASTD (Stamm et al., 2006), AceView (Thierry-Mieg and Thierry-Mieg, 2006)].
  2. Gene Structure View provides a schematic graphical view of the gene structure including all predicted exons/introns, using different colors for RefSeq or novel exon/introns.
  3. Predicted Transcripts shows a graphical representation of the assembled transcripts with predicted annotations of 5'UTR, CDS and 3'UTR, PTC and polyA sites.
  4. Predicted Splice Sites shows the multiple sequence alignment between the genomic sequence and the expressed sequences (i.e. mRNAs and ESTs) near the boundaries (splice sites) of all predicted introns. Also included are the donor and acceptor predictive scores, the intron type (U2 or U12), and information on the score and position of the branching site, if available.
  5. Intron Table lists all predicted introns and their relevant features, including their chromosomal location, length, class and donor/acceptor sequences.
  6. Transcript Table lists the details of all predicted alternative transcripts including their length, number of exons, and presence of a predicted coding sequence. The ‘variant type’ columns lists all the alternative splicing events using a RefSeq mRNA as the reference transcript.

All results can also be downloaded by the user in the ‘gene transfer format’ (GTF) (see the Gene Information panel). The user can also download specific sets of sequences in FASTA format for further analyses, e.g. genes, transcripts, proteins, 5'UTRs, coding sequences, 3'UTRs, introns as well as sequence regions surrounding splice site boundaries.

To show the use of ASPicDB on a simple example, we retrieved all splice sites with a non-canonical CC donor and a canonical AG acceptor, supported by at least four ESTs. This search, which is easily performed using the ‘Advanced Search’ on ‘splice sites’, resulted in a total of 26 splice sites from 25 different genes. We then downloaded the 22-bp long sequences, ranging from –10 to +10 with respect to the donor and acceptor splice sites. Quite interestingly the alignment of these regions clearly shows a striking conservation of a repeated AGCC tetramer in both the donor and acceptor splice sites (Fig. 2). This observation would have been hard to obtain without the powerful query options offered by the ASPicdb interface.


Figure 2
View larger version (16K):
[in this window]
[in a new window]
[Download PowerPoint slide]
 
Fig. 2. Logo representation of the sequence alignment of regions spanning positions –10 to +10 surrounding the CC donor and the AG acceptor for 26 splice sites of 25 genes, supported by ≥4 ESTs. The genome-EST alignments supporting such CC/AG splice sites are shown in Supplementary Fig. 3).

 
Of course, these data are inadequate to demonstrate the existence of a new rare non-canonical splice variant that would require suitable experimental demonstration. Indeed, we cannot exclude that this observation could be the result of recombination during library preparation resulting in the loss of internal cDNA sequences or in a deletion present in genomic sequences originating the transcripts supporting the non-canonical splice site but not in the reference genome used in the analysis.


    4 DISCUSSION
 TOP
 ABSTRACT
 1 INTRODUCTION
 2 METHODS
 3 RESULTS
 4 DISCUSSION
 5 FUTURE DIRECTIONS
 ACKNOWLEDGEMENTS
 REFERENCES
 
ASPicDB is the result of the application of a recently published algorithm for splice site detection and transcript assembly to all genes in the human genome. The algorithm was shown to be quite reliable on benchmark tests on gene samples from the ENCODE regions (Bonizzoni et al., 2005) against the curated EGASP annotation (Guigo et al., 2006). In particular, due to its enhanced alignment features, the algorithm allows for the detection of very large introns as well as of very short exons that escape the majority of other prediction tools (Holste et al., 2006; Hsu et al., 2005; Huang et al., 2003; Kim et al., 2007; Lee et al., 2007; Takeda et al., 2007; Thanaraj et al., 2004). Unique features of ASPicDB are the classification of introns as U2 or U12, through an appropriate scoring of the donor, acceptor and branch site (Sheth et al., 2006) and a reliable detection of non-canonical splice sites. In this way, for example, users may query for all introns with non-canonical splice sites supported by more than 10 ESTs, and belonging to the U2 or U12 class, optionally restricting the output to those located on a specific genome region or chromosome. Other specific features of ASPicDB are the possibility to query genes narrowing the search to the ones containing alternative splicing events specifically affecting the 5'UTR, CDS or 3'UTR, or presenting a specific type of splicing event (e.g. exon skip, alternative donor or acceptor, and so on), or producing a given number (or range) of alternative transcripts/proteins, as well as any combination of the above criteria.

The possibility to query genes or splice sites showing a specific expression pattern in normal versus tumor cells in one or more tissues is another key feature of the database. Supplementary Figure 2A shows the expression pattern observed for the tp53 gene in 11 different tissues. Notably, a statistically significant cancer-specific expression of this gene is observed in urinary system, prostate, kidney and brain. The normal versus tumor expression pattern is also generated at the splice site level (Supplementary Fig. 2B) as previously described. Indeed, if a gene is exclusively expressed in tumor cells in a given tissue type it is meaningless to mine for tumor-specific isoforms. Such isoforms should instead be characterized by the occurrence of tumor specific introns (or vice versa normal) in genes expressed in both the normal and tumor status. Supplementary Figure 2B shows that splicing of intron #35 (numbering according to the ASPidDB annotation) in the gene MTO1 is remarkably cancer-specific in endocrine system, breast and muscle. Consequently, all transcript isoforms carrying this specific splice site are likely to be tumor-specific as well.

Finally, the download facilities add another unique feature to ASPicDB. In particular, the possibility to extract specific sequence regions surrounding boundaries of splice sites fulfilling specific criteria (e.g. a given assortment of donor and acceptor sites or a specific expression pattern) may greatly aid customized sequence analyses aimed at the identification of splicing regulatory elements (see also the example in Fig. 2).


    5 FUTURE DIRECTIONS
 TOP
 ABSTRACT
 1 INTRODUCTION
 2 METHODS
 3 RESULTS
 4 DISCUSSION
 5 FUTURE DIRECTIONS
 ACKNOWLEDGEMENTS
 REFERENCES
 
ASPicDB is an ongoing project and we plan to further develop it in the next releases. The annotation of predicted isoforms will be further enriched by including information on specific regulatory elements in alternative mRNA untranslated regions and the functional features of the predicted protein isoforms (e.g. occurrence of PFAM domains, signal peptides, transmembrane helices, etc.).

We also plan to extend the database to other organisms for which the genome sequence and a suitable amount of expressed sequences are available (e.g. mouse, rat, zebrafish, etc.) and to add facilities to perform comparative analysis of alternative splicing of homologous genes.


    ACKNOWLEDGEMENTS
 TOP
 ABSTRACT
 1 INTRODUCTION
 2 METHODS
 3 RESULTS
 4 DISCUSSION
 5 FUTURE DIRECTIONS
 ACKNOWLEDGEMENTS
 REFERENCES
 
This work was supported by FIRB (MIUR), EU STREP project ‘TRANSCODE’, AIRC, and Telethon.

Conflicts of Interest: none declared.


    FOOTNOTES
 
Associate Editor: Alfonso Valencia

Received on November 30, 2007; revised on February 26, 2008; accepted on March 28, 2008

    REFERENCES
 TOP
 ABSTRACT
 1 INTRODUCTION
 2 METHODS
 3 RESULTS
 4 DISCUSSION
 5 FUTURE DIRECTIONS
 ACKNOWLEDGEMENTS
 REFERENCES
 

    Black DL. Mechanisms of alternative pre-messenger RNA splicing. Annu. Rev. Biochem (2003) 72:291–336.[CrossRef][Web of Science][Medline]

    Blencowe BJ. Alternative splicing: new insights from global analyses. Cell (2006) 126:37–47.[CrossRef][Web of Science][Medline]

    Bonizzoni P, et al. ASPIC: a novel method to predict the exon-intron structure of a gene that is optimally compatible to a set of transcript sequences. BMC Bioinformatics (2005) 6:244.[CrossRef][Medline]

    Bonizzoni P, et al. Computational methods for alternative splicing prediction. Brief Funct. Genomic Proteomic (2006) 5:46–51.[Abstract/Free Full Text]

    Brett D, et al. Alternative splicing and genome complexity. Nat. Genet (2002) 30:29–30.[CrossRef][Web of Science][Medline]

    Castrignano T, et al. ASPIC: a web resource for alternative splicing prediction and transcript isoforms characterization. Nucleic Acids Res (2006) 34:W440–W443.[Abstract/Free Full Text]

    Garcia-Blanco MA, et al. Alternative splicing in disease and therapy. Nat. Biotechnol (2004) 22:535–546.[CrossRef][Web of Science][Medline]

    Guigo R, et al. EGASP: the human ENCODE Genome Annotation Assessment Project. Genome Biol (2006) 7(Suppl. 1):S2 1–31.[CrossRef]

    Holste D, et al. HOLLYWOOD: a comparative relational database of alternative splicing. Nucleic Acids Res (2006) 34:D56–D62.[Abstract/Free Full Text]

    Hsu FR, et al. AVATAR: A database for genome-wide alternative splicing event detection using large scale ESTs and mRNAs. Bioinformation (2005) 1:16–18.[Medline]

    Huang HD, et al. ProSplicer: a database of putative alternative splicing information derived from protein, mRNA and expressed sequence tag sequence data. Genome Biol (2003) 4:R29.[CrossRef][Medline]

    Kelso J, et al. eVOC: a controlled vocabulary for unifying gene expression data. Genome Res (2003) 13:1222–1230.[Abstract/Free Full Text]

    Kim N, et al. The ASAP II database: analysis and comparative genomics of alternative splicing in 15 animal species. Nucleic Acids Res (2007) 35:D93–D98.[Abstract/Free Full Text]

    Kopelman NM, et al. Alternative splicing and gene duplication are inversely correlated evolutionary mechanisms. Nat. Genet (2005) 37:588–589.[CrossRef][Web of Science][Medline]

    Lee Y, et al. ECgene: an alternative splicing database update. Nucleic Acids Res (2007) 35:D99–D103.[Abstract/Free Full Text]

    Lopez AJ. Alternative splicing of pre-mRNA: developmental consequences and mechanisms of regulation. Annu. Rev. Genet (1998) 32:279–305.[CrossRef][Web of Science][Medline]

    Maquat LE. Nonsense-mediated mRNA decay: splicing, translation and mRNP dynamics. Nat. Rev. Mol. Cell. Biol (2004) 5:89–99.[CrossRef][Web of Science][Medline]

    Matlin AJ, et al. Understanding alternative splicing: towards a cellular code. Nat. Rev. Mol. Cell. Biol (2005) 6:386–398.[CrossRef][Web of Science][Medline]

    Mironov AA, et al. Frequent alternative splicing of human genes. Genome Res (1999) 9:1288–1293.[Abstract/Free Full Text]

    Pruitt KD, et al. NCBI reference sequences (RefSeq): a curated non-redundant sequence database of genomes, transcripts and proteins. Nucleic Acids Res (2007) 35:D61–D65.[Abstract/Free Full Text]

    Sheth N, et al. Comprehensive splice-site analysis using comparative genomics. Nucleic Acids Res (2006) 34:3955–3967.[Abstract/Free Full Text]

    Stamm S, et al. ASD: a bioinformatics resource on alternative splicing. Nucleic Acids Res (2006) 34:D46–D55.[Abstract/Free Full Text]

    Takeda J, et al. H-DBAS: alternative splicing database of completely sequenced and manually annotated full-length cDNAs based on H-Invitational. Nucleic Acids Res (2007) 35:D104–D109.[Abstract/Free Full Text]

    Talavera D, et al. The (in)dependence of alternative splicing and gene duplication. PLoS Comput. Biol (2007) 3:e33.[CrossRef][Medline]

    Thanaraj TA, et al. ASD: the Alternative Splicing Database. Nucleic Acids Res (2004) 32:D64–D69.[Abstract/Free Full Text]

    Thierry-Mieg D, Thierry-Mieg J. AceView: a comprehensive cDNA-supported gene and transcripts annotation. Genome Biol (2006) 7(Suppl. 1):S12 1–14.

    Tress ML, et al. The implications of alternative splicing in the ENCODE protein complement. Proc. Natl Acad. Sci. USA (2007) 104:5495–5500.[Abstract/Free Full Text]

    Wang Z, et al. Computational analysis and experimental validation of tumor-associated alternative RNA splicing in human cancer. Cancer Res (2003) 63:655–657.[Abstract/Free Full Text]

    Zhang H, et al. PolyA_DB: a database for mammalian mRNA polyadenylation. Nucleic Acids Res (2005) 33:D116–D120.[Abstract/Free Full Text]


Add to CiteULike CiteULike   Add to Connotea Connotea   Add to Del.icio.us Del.icio.us    What's this?


This article has been cited by other articles:


Home page
J R Soc InterfaceHome page
G. A Reeves, D. Talavera, and J. M Thornton
Genome and proteome annotation: organization, interpretation and integration
J R Soc Interface, February 6, 2009; 6(31): 129 - 147.
[Abstract] [Full Text] [PDF]


This Article
Right arrow Abstract Freely available
Right arrow FREE Full Text (Print PDF) Freely available
Right arrow Supplementary Data
Right arrow All Versions of this Article:
24/10/1300    most recent
btn113v1
Right arrow Alert me when this article is cited
Right arrow Alert me if a correction is posted
Services
Right arrow Email this article to a friend
Right arrow Similar articles in this journal
Right arrow Similar articles in ISI Web of Science
Right arrow Similar articles in PubMed
Right arrow Alert me to new issues of the journal
Right arrow Add to My Personal Archive
Right arrow Download to citation manager
Right arrowRequest Permissions
Google Scholar
Right arrow Articles by Castrignanò, T.
Right arrow Articles by Pesole, G.
Right arrow Search for Related Content
PubMed
Right arrow PubMed Citation
Right arrow Articles by Castrignanò, T.
Right arrow Articles by Pesole, G.
Social Bookmarking
 Add to CiteULike   Add to Connotea   Add to Del.icio.us  
What's this?