Skip Navigation


Bioinformatics Advance Access originally published online on November 22, 2006
Bioinformatics 2007 23(3):397-399; doi:10.1093/bioinformatics/btl593
This Article
Right arrow Abstract Freely available
Right arrow FREE Full Text (Print PDF) Freely available
Right arrowOA All Versions of this Article:
23/3/397    most recent
btl593v1
Right arrow Comments: Submit a response
Right arrow Alert me when this article is cited
Right arrow Alert me when Comments are posted
Right arrow Alert me if a correction is posted
Services
Right arrow Email this article to a friend
Right arrow Similar articles in this journal
Right arrow Similar articles in ISI Web of Science
Right arrow Similar articles in PubMed
Right arrow Alert me to new issues of the journal
Right arrow Add to My Personal Archive
Right arrow Download to citation manager
Right arrow Search for citing articles in:
ISI Web of Science (1)
Google Scholar
Right arrow Articles by Han, A.
Right arrow Articles by Park, S.-M.
Right arrow Search for Related Content
PubMed
Right arrow PubMed Citation
Right arrow Articles by Han, A.
Right arrow Articles by Park, S.-M.
Social Bookmarking
 Add to CiteULike   Add to Connotea   Add to Del.icio.us  
What's this?

© 2006 The Author(s)
This is an Open Access article distributed under the terms of the Creative Commons Attribution Non-Commercial License (http://creativecommons.org/licenses/by-nc/2.0/uk/) which permits unrestricted non-commercial use, distribution, and reproduction in any medium, provided the original work is properly cited.

SNP2NMD: A database of human single nucleotide polymorphisms causing nonsense-mediated mRNA decay

Areum Han 1,{dagger}, Woo-Yeon Kim 1,{dagger} and Seong-Min Park 1,2,*

1 Korean Bioinformation Center, KRIBB Daejeon 305-806, Korea
2 Functional Genomics Research Center, KRIBB Daejeon 305-806, Korea

*To whom correspondence should be addressed.


    ABSTRACT
 TOP
 ABSTRACT
 1 INTRODUCTION
 2 METHODS AND RESULTS
 3 WEB INTERFACES
 REFERENCES
 

Summary: Elucidating the effects of genetic polymorphisms on genes and gene networks is an important step in disease association studies. We developed the SNP2NMD database for human SNPs (single nucleotide polymorphisms) that result in PTCs (premature termination codons) and trigger nonsense-mediated mRNA decay (NMD). The SNP2NMD Web interfaces provide extensive genetic information on and graphical views of the queried SNP, gene, and disease terms.

Availability: SNP2NMD is available from http://variome.net, or directly from http://bioportal.kobic.re.kr/SNP2NMD

Contact: kimplove{at}kribb.re.kr, lastmhc{at}kribb.re.kr

Supplementary information: http://bioportal.kobic.re.kr/SNP2NMD/Wiki.jsp?page=Statistics


    1 INTRODUCTION
 TOP
 ABSTRACT
 1 INTRODUCTION
 2 METHODS AND RESULTS
 3 WEB INTERFACES
 REFERENCES
 
Annotating SNPs (single nucleotide polymorphisms) is becoming important in biology since information on their effects on genes and gene networks enables biologists to select and interpret disease-associated SNPs. An expert-curated SNP catalog (SNP@Web; http://bioportal.kobic.re.kr/SNPatWEB/Wiki.jsp?page=Annotation) has reported over 20 SNP-annotation services. However, no application is available for nonsense-mediated mRNA decay (NMD) triggered by SNPs so far. Among SNPs, nonsense SNPs resulting in premature termination codons (PTCs) should be identified systemically since they go beyond altering protein sequences, by being able to trigger the NMD pathway and eliminate the production of proteins (Hentze and Kulozik, 1999; Maquat, 2004). Recent research suggests that NMD controls cellular function as well as corrects biosynthetic errors, since one-third of human transcripts contain PTCs and are potential targets of NMD (Lewis et al., 2003). Savas's group also reported 28 NMD triggering SNPs that are common in human populations (Savas et al., 2006).

Here, we developed an extensive database, SNP2NMD, consisting of SNPs that are able to trigger NMD. The SNP2NMD integrated other genetic information such as gene and disease annotations, and will help researchers to identify SNP-driven NMD and predict their effects on genetic networks. Moreover, we designed user-friendly SNP2NMD interfaces to accept user-defined rules and display graphical views of SNPs and gene structures. The SNP2NMD database can highlight the function of SNPs as sources of NMD and the evolutionary consequences of gene regulation that are able to affect phenotypes, including diseases (Noensie and Dietz, 2001).


    2 METHODS AND RESULTS
 TOP
 ABSTRACT
 1 INTRODUCTION
 2 METHODS AND RESULTS
 3 WEB INTERFACES
 REFERENCES
 
The SNP2NMD database was developed in the three steps described below (Fig. 1a).


Figure 1
View larger version (39K):
[in this window]
[in a new window]
[Download PowerPoint slide]
 
Fig. 1 (a) Flowchart of SNP2NMD database construction (*default NMD rule: SNP location >50 nt upstream of the 3'-most exon–exon junction). (b) Search interface of SNP2NMD. A user can search the SNP2NMD database using SNP, gene and disease terms. (c) Example of SNP2NMD output (‘rs3917594’ within ‘PON1’). NMD-related information and a corresponding 2D image are shown.

 
2.1 Identification of nonsense SNPs
We identified human SNPs resulting in PTCs by integrating transcript structure annotation and positional information of the SNPs of human genes. The chromosomal positions and alleles of SNPs were parsed from a public SNP database (dbSNP ver. 125; Sherry et al., 2001) and mapped to exon and intron structures downloaded from the gene annotation files of UCSC (refGene, hg17; Karolchik et al., 2003) whilst considering the version of the human genome build (ver. 34) and the chromosomal strand. As a result, we identified 1301 (1%) SNPs producing stop codons from 123 908 coding SNPs.

2.2 Application of an NMD rule
According to mammalian NMD research, PTCs followed by an exon–exon junction that is located more than ~50–55 nt downstream in a spliced mRNA generally elicit NMD (Nagy and Maquat, 1998). We therefore denote the ‘NMD distance’ as the distance between an SNP and the 3'-most exon–exon junction, and is set to a default NMD rule as an SNP with an NMD distance >50 nt (Notice that search interfaces of SNP2NMD are also able to accept a user-defined NMD distance). With the default NMD rule, we detected 765 SNP-mRNA pairs with 635 mRNAs as NMD candidates. The mean, median, and standard deviation of the NMD distance were ~1999.4, 775 and 5079.7 nt, respectively.

2.3 Integration with functional annotation
The SNP2NMD database adopted various gene annotations including pathways (KEGG; Kanehisa and Goto, 2000), gene ontology (GOA; Camon et al., 2004), and disease information (GAD, Becker et al., 2004; HGMD, David et al., 2005; and OMIM, McKusick, 1998). The raw data files were integrated into the SNP2NMD database based on a gene synonym table from HGNC (HUGO Gene Nomenclature Committee; Eyre et al., 2006). These annotations provide insight into the effects of SNP2 resulting in NMD and help to characterize target genes of NMD caused from SNPs. To find significant associations of Gene Ontology terms with target genes, we assigned genes into gene ontology categories and selected the categories, which were significantly enriched (P < 0.01) with minimum number of observed gene number >5 (GOTM; Zhang et al., 2004). The analysis showed that NMD-candidate genes were associated with gene ontology categories including ‘cell adhesion’, ‘physical interaction between organisms’, ‘reproductive physiological process’, ‘nucleotide binding’, ‘protein binding’, and ‘extra cellular matrix’ (More information is available on the website supplement page).


    3 WEB INTERFACES
 TOP
 ABSTRACT
 1 INTRODUCTION
 2 METHODS AND RESULTS
 3 WEB INTERFACES
 REFERENCES
 
As shown in Figure 1b, users can search the SNP2NMD database using three entries: (1) SNP identifier (rs number from dbSNP), (2) gene ID (refSeq ID starting with ‘NM’) or name/symbol and (3) a disease term. In the case of a user submitting a gene or disease term, SNP2NMD returns a gene list related to the query that provides summary and gene details. The summary shows the number of NMD candidates within the resultant gene set. When users access the detailed view of NMD candidates by searching an SNP or selecting a gene, NMD information including the NMD distance and 2D views as well as SNP and gene details are shown. The 2D view was developed using Gbrowse (Stein et al., 2002), and provides a graphical view of the gene structure and SNP position simultaneously (Fig. 1c). The summary and external links to SNP, gene and disease information are designed to assist in further research.


    Acknowledgments
 
The authors thank Jong Bhak for editing the manuscript and the SNP Pipeline Team at KOBIC (Korean Bioinformation Center). Also, we thank Young Il Yeom for discussion about the validation of SNP2NMD data. This project was supported by the Korean Ministry of Science and Technology (MOST) under grant number M10407010001-06N0701-00110, which also provided funding to pay for Open Access publication charges.

Conflict of Interest: none declared.


    FOOTNOTES
 
{dagger}The authors wish it to be known that, in their opinion, the first two authors should be regarded as joint First Authors. Back

Associate Editor: Joaquin Dopazo

Received on September 21, 2006; revised on November 18, 2006; accepted on November 18, 2006

    REFERENCES
 TOP
 ABSTRACT
 1 INTRODUCTION
 2 METHODS AND RESULTS
 3 WEB INTERFACES
 REFERENCES
 

    Becker, K.G., et al. (2004) The genetic association database. Nat. Genet, . 36, 431–432[CrossRef][Web of Science][Medline].

    Camon, E., et al. (2004) The Gene Ontology Annotation (GOA) Database: sharing knowledge in Uniprot with Gene Ontology. Nucleic Acids Res, . 32, D262–D266[Abstract/Free Full Text].

    Eyre, T.A., et al. (2006) The HUGO Gene Nomenclature Database, 2006 updates. Nucleic Acids Res, . 34, D319–D321[Abstract/Free Full Text].

    David, N.C., et al. (2005) The human gene mutation database (HGMD) and its exploitation in the study of mutational mechanisms. Curr. Prot. Bioinformatics, Unit 1.13.

    Hentze, M.W. and Kulozik, A.E. (1999) A perfect message: RNA surveillance and nonsense-mediated decay. Cell, 96, 307–310[CrossRef][Web of Science][Medline].

    Kanehisa, M. and Goto, S. (2000) KEGG: kyoto encyclopedia of genes and genomes. Nucleic Acids Res, . 28, 27–30[Abstract/Free Full Text].

    Karolchik, D.R., et al. (2003) The UCSC Genome Browser Database. Nucleic Acids Res, . 31, 51–54[Abstract/Free Full Text].

    Lewis, B.P., et al. (2003) Evidence for the widespread coupling of alternative splicing and nonsense-mediated mRNA decay in humans. Proc. Natl Acad. Sci. USA, 100, 189–192[Abstract/Free Full Text].

    Maquat, L.E. (2004) Nonsense-mediated mRNA decay: splicing, translation and mRNP dynamics. Nat. Rev. Mol. Cell Biol, . 5, 89–99[CrossRef][Web of Science][Medline].

    McKusick, V.A. (1998) Mendelian inheritance in man: a catalog of human genes and genetic disorders. Johns Hopkins University Press.

    Nagy, E. and Maquat, L.E. (1998) A rule for termination-codon position within intron-containing genes: when nonsense affects RNA abundance. Trends Biochem. Sci, . 23, 198–199[CrossRef][Web of Science][Medline].

    Noensie, E.N. and Dietz, H.C. (2001) A strategy for disease gene identification through nonsense-mediated mRNA decay inhibition. Nat. Biotechnol, . 19, 434–439[CrossRef][Web of Science][Medline].

    Savas, S., et al. (2006) Human SNPs resulting in premature stop codons and protein truncation. Hum. Genomics, 2, 274–286[Medline].

    Sherry, S.T., et al. (2001) dbSNP: the NCBI database of genetic variation. Nucleic Acids Res, . 29, 308–311[Abstract/Free Full Text].

    Stein, L.D., et al. (2002) The generic genome browser: a building block for a model organism system database. Genome Res, . 12, 1599–1610[Abstract/Free Full Text].

    Zhang, B., et al. (2004) GOTree Machine (GOTM): a web-based platform for interpreting sets of interesting genes using Gene Ontology hierarchies. BMC Bioinformatics, 5, 16[CrossRef][Medline].


Add to CiteULike CiteULike   Add to Connotea Connotea   Add to Del.icio.us Del.icio.us    What's this?


This article has been cited by other articles:


Home page
The OncologistHome page
S. Savas and G. Liu
Studying Genetic Variations in Cancer Prognosis (and Risk): A Primer for Clinicians
Oncologist, July 1, 2009; 14(7): 657 - 666.
[Abstract] [Full Text] [PDF]


Home page
Nucleic Acids ResHome page
L. G. Wilming, J. G. R. Gilbert, K. Howe, S. Trevanion, T. Hubbard, and J. L. Harrow
The vertebrate genome annotation (Vega) database
Nucleic Acids Res., January 11, 2008; 36(suppl_1): D753 - D760.
[Abstract] [Full Text] [PDF]


This Article
Right arrow Abstract Freely available
Right arrow FREE Full Text (Print PDF) Freely available
Right arrowOA All Versions of this Article:
23/3/397    most recent
btl593v1
Right arrow Comments: Submit a response
Right arrow Alert me when this article is cited
Right arrow Alert me when Comments are posted
Right arrow Alert me if a correction is posted
Services
Right arrow Email this article to a friend
Right arrow Similar articles in this journal
Right arrow Similar articles in ISI Web of Science
Right arrow Similar articles in PubMed
Right arrow Alert me to new issues of the journal
Right arrow Add to My Personal Archive
Right arrow Download to citation manager
Right arrow Search for citing articles in:
ISI Web of Science (1)
Google Scholar
Right arrow Articles by Han, A.
Right arrow Articles by Park, S.-M.
Right arrow Search for Related Content
PubMed
Right arrow PubMed Citation
Right arrow Articles by Han, A.
Right arrow Articles by Park, S.-M.
Social Bookmarking
 Add to CiteULike   Add to Connotea   Add to Del.icio.us  
What's this?