| estimateDisp {edgeR} | R Documentation |
Maximizes the negative binomial likelihood to give the estimate of the common, trended and tagwise dispersions across all tags.
estimateDisp(y, design=NULL, offset=NULL, prior.df=NULL, trend.method="locfit", span=NULL, grid.length=21, grid.range=c(-10,10), robust=FALSE, winsor.tail.p=c(0.05,0.1), tol=1e-06)
y |
|
design |
numeric design matrix |
offset |
numeric vector or matrix of offsets for the log-linear models |
prior.df |
prior degrees of freedom. It is used in calculating |
trend.method |
method for estimating dispersion trend. Possible values are |
span |
width of the smoothing window, as a proportion of the data set. |
grid.length |
the number of points on which the interpolation is applied for each tag. |
grid.range |
the range of the grid points around the trend on a log2 scale. |
robust |
logical, should the estimation of |
winsor.tail.p |
numeric vector of length 1 or 2, giving left and right tail proportions of the deviances to Winsorize when estimating |
tol |
the desired accuracy, passed to |
This function calculates a matrix of likelihoods for each gene at a set of dispersion grid points, and then applies weighted likelihood empirical Bayes method to obtain posterior dispersion estimates. If there is no design matrix, it calculates the quantile conditional likelihood for each gene (tag) and then maximize it. The method is same as in the function estimateCommonDisp and estimateTagwiseDisp. If a design matrix is given, it then calculates the adjusted profile log-likelihood for each gene (tag) and then maximize it. It is similar to the functions estimateGLMCommonDisp, estimateGLMTrendedDisp and estimateGLMTagwiseDisp.
Returns object with the following added components:
common.dispersion |
estimate of the common dispersion. |
trended.dispersion |
estimates of the trended dispersions. |
tagwise.dispersion |
tag- or gene-wise estimates of the dispersion parameter. |
logCPM |
the tag abundance in log average counts per million. |
prior.df |
prior degrees of freedom. It is a vector when robust method is used. |
prior.n |
estimate of the prior weight, i.e. the smoothing parameter that indicates the weight to put on the common likelihood compared to the individual tag's likelihood. |
span |
width of the smoothing window used in estimating dispersions. |
Yunshun Chen, Gordon Smyth
McCarthy, DJ, Chen, Y, Smyth, GK (2012). Differential expression analysis of multifactor RNA-Seq experiments with respect to biological variation. Nucleic Acids Research 40, 4288-4297. http://nar.oxfordjournals.org/content/40/10/4288
Robinson, MD, and Smyth, GK (2007). Moderated statistical tests for assessing differences in tag abundance. Bioinformatics 23, 2881-2887. http://bioinformatics.oxfordjournals.org/content/23/21/2881
estimateCommonDisp, estimateTagwiseDisp, estimateGLMCommonDisp, estimateGLMTrendedDisp, estimateGLMTagwiseDisp
# True dispersion is 1/5=0.2 y <- matrix(rnbinom(1000, mu=10, size=5), ncol=4) group <- c(1,1,2,2) design <- model.matrix(~group) d <- DGEList(counts=y, group=group) d1 <- estimateDisp(d) d2 <- estimateDisp(d, design)