Bioinformatics Advance Access originally published online on June 15, 2009
Bioinformatics 2009 25(17):2244-2250; doi:10.1093/bioinformatics/btp369
| ||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||
A CitationRank algorithm inheriting Google technology designed to highlight genes responsible for serious adverse drug reaction
1 Bio-X Center, Key Laboratory for the Genetics of Developmental and Neuropsychiatric Disorders (Ministry of Education), Shanghai Jiao Tong University, Shanghai 200030, 2 Institutes of Biomedical Sciences, Fudan University, 138 Yixueyuan Road, Shanghai, 200032, 3 College of Life Sciences, Nanjing Agricultural University, 1 Weigang, Nanjing, 210095 and 4 Institute for Nutritional Sciences, Shanghai Institutes of Biological Sciences, Chinese Academy of Sciences, 320 Yueyang Road, Shanghai, 200031, PR China
* To whom correspondence should be addressed.
| Abstract |
|---|
Motivation: Serious adverse drug reaction (SADR) is an urgent, world-wide problem. In the absence of any well-organized gene-oriented SADR information pool, a database should be constructed. Since the importance of a gene to a particular SADR cannot simply be defined in terms of how frequently the two are cited together in the literature, an algorithm should be devised to sort genes according to their relevance to the SADR topics.
Results: The SADR-Gengle database, which is made up of gene–SADR relationships extracted from Pubmed, has been constructed, covering six major SADRs, namely cholestasis, deafness, muscle toxicity, QT prolongation, Stevens–Johnson syndrome and torsades de points. The CitationRank algorithm, which inherits the principle of the Google PageRank algorithm that a gene should be highly ranked when biologically related to other highly ranked genes, is devised. The algorithm performs robustly in recovering SADR-related genes in the presence of extraneous noise, and the use of the algorithm has been extended to sorting genes in our database. Users can browse genes in a Google-type system where genes are ordered according to their descending relevance to the SADR topic selected by the user. The database also provides users with visualized gene–gene knowledge chain networks, helping them to systematize their gene-oriented knowledge chain whilst navigating these networks.
Availability: The SADR-Gengle is freely available at http://Gengle.Bio-X.cn/SADR/.
Contact: helinhelin{at}gmail.com
Supplementary information: Supplementary data are available at Bioinformatics online.
Associate Editor: Limsoon Wong
Received on March 12, 2009; revised on June 9, 2009; accepted on June 9, 2009