Bioinformatics Vol. 18 no. 4 2002
Pages 576-584
© 2002 Oxford University Press
Making sense of microarray data distributions
1 School of Biological Sciences, University
of Manchester, Stopford Building, Oxford Rd, Manchester M13 9PT, UK
2 Department of Computer Science, University
of Manchester, Kilburn Building, Oxford Rd, Manchester M13 9PL, UK
3 Aventis Pharmaceuticals, 4041 Route
202-206, PO Box 6800, Bridgewater, NJ 08807, USA
Received on September 5, 2001
; revised on October 31, 2001
; accepted on November 16, 2001
Motivation: Typical analysis of microarray data has focusedon spot by spot comparisons within a single organism. Less analysis has been done on the comparison of the entire distribution of spot intensities between experiments and between organisms.
Results: Here we show that mRNA transcription data from a wide
range of organisms and measured with a range of experimental
platforms show close agreement with Benfords law (Benford,
Proc. Am. Phil. Soc. , 78, 551572, 1938) and
Zipfs law (Zipf, The Psycho-biology of Language: an
Introduction to Dynamic Philology , 1936 and Human
Behaviour and the Principle of Least Effort , 1949). The
distribution of the bulk of microarray spot intensities is well
approximated by a log-normal with the tail of the distribution being
closer to power law. The variance,
2, of log
spot intensity shows a positive correlation with genome size (in
terms of number of genes) and is therefore relatively fixed within
some range for a given organism. The measured value of
2 can be significantly smaller than the
expected value if the mRNA is extracted from a sample of mixed cell
types. Our research demonstrates that useful biological findings may
result from analyzing microarray data at the level of entire
intensity distributions.
Contact: david.c.hoyle{at}man.ac.uk
* To whom correspondence should be addressed.
![]()
CiteULike
Connotea
Del.icio.us What's this?
This article has been cited by other articles:
![]() |
N. Pavelka, M. L. Fournier, S. K. Swanson, M. Pelizzola, P. Ricciardi-Castagnoli, L. Florens, and M. P. Washburn Statistical Similarities between Transcriptomics and Quantitative Shotgun Proteomics Data Mol. Cell. Proteomics, April 1, 2008; 7(4): 631 - 644. [Abstract] [Full Text] [PDF] |
||||
![]() |
J. Du, J. S. Rozowsky, J. O. Korbel, Z. D. Zhang, T. E. Royce, M. H. Schultz, M. Snyder, and M. Gerstein A supervised hidden markov model framework for efficiently segmenting tiling array data in transcriptional and chIP-chip experiments: systematically incorporating validated biological knowledge Bioinformatics, December 15, 2006; 22(24): 3016 - 3024. [Abstract] [Full Text] [PDF] |
||||
![]() |
E. S. Motakis, G. P. Nason, P. Fryzlewicz, and G. A. Rutter Variance stabilization and normalization for one-color microarray data using a data-driven multiscale approach Bioinformatics, October 15, 2006; 22(20): 2547 - 2553. [Abstract] [Full Text] [PDF] |
||||
![]() |
R. Bishop, T. Shah, R. Pelle, D. Hoyle, T. Pearson, L. Haines, A. Brass, H. Hulme, S. P. Graham, E. L. N. Taracha, et al. Analysis of the transcriptome of the protozoan Theileria parva using MPSS reveals that the majority of genes are transcriptionally active in the schizont stage Nucleic Acids Res., September 25, 2005; 33(17): 5503 - 5511. [Abstract] [Full Text] [PDF] |
||||
![]() |
F. Diraison, E. Motakis, L. E. Parton, G. P. Nason, I. Leclerc, and G. A. Rutter Impact of Adenoviral Transduction With SREBP1c or AMPK on Pancreatic Islet Gene Expression Profile: Analysis With Oligonucleotide Microarrays Diabetes, December 1, 2004; 53(suppl_3): S84 - S91. [Abstract] [Full Text] [PDF] |
||||
![]() |
I. K. Jordan, L. Marino-Ramirez, Y. I. Wolf, and E. V. Koonin Conservation and Coevolution in the Scale-Free Human Gene Coexpression Network Mol. Biol. Evol., November 1, 2004; 21(11): 2058 - 2070. [Abstract] [Full Text] [PDF] |
||||
![]() |
H. R. Ueda, S. Hayashi, S. Matsuyama, T. Yomo, S. Hashimoto, S. A. Kay, J. B. Hogenesch, and M. Iino Universality and flexibility in gene expression from bacteria to human PNAS, March 16, 2004; 101(11): 3765 - 3769. [Abstract] [Full Text] [PDF] |
||||
![]() |
C.-A. Tsai, Y.-J. Chen, and J. J. Chen Testing for differentially expressed genes with microarray data Nucleic Acids Res., May 1, 2003; 31(9): e52 - e52. [Abstract] [Full Text] [PDF] |
||||





