R fitdistr 單變量分布的最大似然擬合

R語言 fitdistr 位於 MASS 包(package)。

說明

Maximum-likelihood 單變量分布擬合，如果需要，允許參數保持固定。

用法

fitdistr(x, densfun, start, ...)

參數

`x`	長度至少為 1 且僅包含 finite 值的數值向量。
`densfun`	字符串或返回在其第一個參數處計算的密度的函數。分布 `"beta"` , `"cauchy"` , `"chi-squared"` , `"exponential"` , `"gamma"` , `"geometric"` , `"log-normal"` , `"lognormal"` , `"logistic"` , `"negative binomial"` , `"normal"` , `"Poisson"` , `"t"` 和 `"weibull"` 被識別，大小寫被忽略。
`start`	給出要使用初始值優化的參數的命名列表。對於某些指定的發行版來說可以省略，而對於其他發行版則必須省略(請參閱詳細信息)。
`...`	其他參數，適用於 `densfun` 或 `optim` 。特別是，它可用於通過 `lower` 或 `upper` 或兩者指定邊界。如果包含`densfun`(或對應於字符串規範的密度函數)的參數，它們將保持固定。

細節

對於正態分布、log-Normal、幾何分布、指數分布和泊鬆分布，使用封閉形式 MLE(和精確標準誤差)，並且不應提供start。

對於所有其他分布，使用 optim 執行對數似然的直接優化。估計的標準誤差取自觀察到的信息矩陣，通過數值近似計算。對於一維問題，使用 Nelder-Mead 方法；對於多維問題，使用 BFGS 方法，除非提供了名為 lower 或 upper 的參數(當使用 L-BFGS-B 時)或顯式提供了 method。

對於 "t" 命名分布，密度被視為具有位置 m 和尺度 s 的位置尺度族。

對於以下命名分布，如果省略 start 或僅部分指定，將計算合理的起始值： "cauchy" 、 "gamma" 、 "logistic" 、 "negative binomial" (由 mu 和 size 參數化)， "t" 和 "weibull" 。請注意，如果擬合較差，這些起始值可能不夠好：特別是它們不能抵抗異常值，除非擬合分布是長尾的。

類 "fitdistr" 有 print 、 coef 、 vcov 和 logLik 方法。

值

類 "fitdistr" 的對象，一個包含四個組件的列表，

`estimate`	參數估計，
`sd`	估計的標準誤差，
`vcov`	估計的方差-協方差矩陣，以及
`loglik`	對數似然。

注意

數值優化無法創造奇跡：請注意 optim 中有關縮放數據的注釋。如果擬合的參數遠離 1，請考慮指定控製參數 parscale 重新擬合。

例子

## avoid spurious accuracy
op <- options(digits = 3)
set.seed(123)
x <- rgamma(100, shape = 5, rate = 0.1)
fitdistr(x, "gamma")
## now do this directly with more control.
fitdistr(x, dgamma, list(shape = 1, rate = 0.1), lower = 0.001)

set.seed(123)
x2 <- rt(250, df = 9)
fitdistr(x2, "t", df = 9)
## allow df to vary: not a very good idea!
fitdistr(x2, "t")
## now do fixed-df fit directly with more control.
mydt <- function(x, m, s, df) dt((x-m)/s, df)/s
fitdistr(x2, mydt, list(m = 0, s = 1), df = 9, lower = c(-Inf, 0))

set.seed(123)
x3 <- rweibull(100, shape = 4, scale = 100)
fitdistr(x3, "weibull")

set.seed(123)
x4 <- rnegbin(500, mu = 5, theta = 4)
fitdistr(x4, "Negative Binomial")
options(op)

參考

Venables, W. N. and Ripley, B. D. (2002) Modern Applied Statistics with S. Fourth edition. Springer.

相關用法

注：本文由純淨天空篩選整理自R-devel大神的英文原創作品 Maximum-likelihood Fitting of Univariate Distributions。非經特殊聲明，原始代碼版權歸原作者所有，本譯文未經允許或授權，請勿轉載或複製。