Bagaimana cara kerja normalisasi kuantil?

15

In gene expression studies using microarrays, intensity data has to be normalized so that intensities can be compared between individuals, between genes. Conceptually, and algorithmically, how does "quantile normalization" work, and how would you explain this to a non-statistician?

Stephen Turner
sumber
2
Simple: it's "grading on a curve." :-) I give an algorithm at quantdec.com/envstats/notes/class_03/probability.htm ("Reading Q-Q plots").
whuber
This PDF explains quantile normalization with a simple worked example: plexdb.org/modules/documentation/RMAexplained.pdf The paper is explaining a larger process (RMA) but quantile normalization is performed as one of the steps.
JHubbard80

Jawaban:

7

A comparison of normalization methods for high density oligonucleotide array data based on variance and bias by Bolstad et al. introduces quantile normalization for array data and compares it to other methods. It has a pretty clear description of the algorithm.

The conceptual understanding is that it is a transformation of array j using a function F^1G^j where G^j is an estimated distribution function and F^1 is the inverse of an estimated distribution function. It has the consequence that the normalized distributions become identical for all the arrays. For quantile normalization G^j is the empirical distribution of array j and F^ is the empirical distribution for the averaged quantiles across arrays.

At the end of the day it is a method for transforming all the arrays to have a common distribution of intensities.

NRH
sumber