Log on / register
BioMed Central home | Journals A-Z | Feedback | Support | My details
.refereed research
 |  |  |  |  | 


Open AccessHighly AccessResearch

Profound effect of normalization on detection of differentially expressed genes in oligonucleotide microarray data analysis

Reinhard Hoffmann1 email, Thomas Seidl2 and Martin Dugas3

1Department of Bacteriology, Max von Pettenkofer Institut, Pettenkoferstrasse 9a, 80336 Munich, Germany

2Section of Gene Function and Regulation, Institute of Cancer Research, Chester Beatty Laboratories, 237 Fulham Road, London SW3 6JB, UK

3Department of Medical Informatics, Biometrics and Epidemiology, University of Munich, Marchioninistrasse 15, 81377 Munich, Germany

author email corresponding author email

Genome Biology 2002, 3:research0033.1-0033.11doi:10.1186/gb-2002-3-7-research0033

Published: 14 June 2002

Subject areas: Bioinformatics, Methods, Genome studies

Abstract

Background

Oligonucleotide microarrays measure the relative transcript abundance of thousands of mRNAs in parallel. A large number of procedures for normalization and detection of differentially expressed genes have been proposed. However, the relative impact of these methods on the detection of differentially expressed genes remains to be determined.

Results

We have employed four different normalization methods and all possible combinations with three different statistical algorithms for detection of differentially expressed genes on a prototype dataset. The number of genes detected as differentially expressed differs by a factor of about three. Analysis of lists of genes detected as differentially expressed, and rank correlation coefficients for probability of differential expression shows that a high concordance between different methods can only be achieved by using the same normalization procedure.

Conclusions

Normalization has a profound influence of detection of differentially expressed genes. This influence is higher than that of three subsequent statistical analysis procedures examined. Algorithms incorporating more array-derived information than gene-expression values alone are urgently needed.


© 1999-2009 BioMed Central Ltd unless otherwise stated. Part of Springer Science+Business Media.