Bioinformatics Advance Access originally published online on March 23, 2007
Bioinformatics 2007 23(11):1348-1355; doi:10.1093/bioinformatics/btm102
| ||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||
Quantitating tissue specificity of human genes to facilitate biomarker discovery


1Mayo Clinic Comprehensive Cancer Center and Division of Experimental Pathology, Department of Laboratory Medicine and Pathology, 200 First St. SW, Rochester, MN 55905, USA and 2Division of Biostatistics, Health Sciences Research, Mayo Clinic, USA
*To whom correspondence should be addressed.
| Abstract |
|---|
We describe a method to identify candidate cancer biomarkers by analyzing numeric approximations of tissue specificity of human genes. These approximations were calculated by analyzing predicted tissue expression distributions of genes derived from mapping expressed sequence tags (ESTs) to the human genome sequence using a binary indexing algorithm. Tissue-specificity values facilitated high-throughput analysis of the human genes and enabled the identification of genes highly specific to different tissues. Tissue expression distributions for several genes were compared to estimates obtained from other public gene expression datasets and experimentally validated using quantitative RT-PCR on RNA isolated from several human tissues. Our results demonstrate that most human genes (
98%) are expressed in many tissues (low specificity), and only a small number of genes possess very specific tissue expression profiles. These genes comprise a rich dataset from which novel therapeutic targets and novel diagnostic serum biomarkers may be selected.
Contact: vasm{at}mayo.edu
Supplementary information: Supplementary data are available at Bioinformatics online.
Associate Editor: Alfonso Valencia
The authors wish it to be known that, in their opinion, the first two authors should regarded as joint First Authors.
Received on July 11, 2006; revised on January 16, 2007; accepted on March 10, 2007