R empinf 經驗影響值

R語言 empinf 位於 boot 包(package)。

說明

此函數計算應用於數據集的統計量的經驗影響值。它允許四種類型的計算，即無窮小折刀(使用數值微分)、通常的折刀估計、‘positive’折刀估計以及使用統計量的引導複製回歸來估計經驗影響值的方法。所有方法均可用於一個或多個樣品。

用法

empinf(boot.out = NULL, data = NULL, statistic = NULL,
       type = NULL, stype = NULL ,index = 1, t = NULL,
       strata = rep(1, n), eps = 0.001, ...)

參數

`boot.out`	由函數 `boot` 創建的引導對象。如果`type` 是`"reg"`，則需要此參數。對於任何其他類型，它是可選參數。如果在可選時包含它，則 `data` 、 `statistic` 、 `stype` 和 `strata` 的值將從 `boot.out` 的組件中獲取，並且直接傳遞給 `empinf` 的任何值都將被忽略。
`data`	包含需要經驗影響值的數據的向量、矩陣或 DataFrame 。如果未提供`boot.out`，則它是必需的參數。如果提供了 `boot.out`，則 `data` 將設置為 `boot.out$data`，並且忽略提供的任何值。
`statistic`	需要經驗影響值的統計數據。它必須是至少兩個參數的函數，即數據集和權重、頻率或指數的向量。第二個參數的性質由 `stype` 的值給出。它采用的任何其他參數都必須提供給`empinf`，並將原樣傳遞給`statistic`。如果未提供`boot.out`，則這是必需的參數，否則其值取自`boot.out`，並且此處提供的任何值都將被忽略。
`type`	用於經驗影響值的計算類型。可能的值`type`是`"inf"`(無窮小折刀)，`"jack"`(通常是折刀)，`"pos"`(正折刀)，以及`"reg"`(回歸估計)。默認值取決於其他參數。如果`t`則提供默認值`type`是`"reg"`和`boot.out`應該存在，以便可以找到其頻率陣列。它`t`不提供，那麽如果`stype`是`"w"`，默認值`type`是`"inf"`;否則，如果`boot.out`存在默認值是`"reg"`。如果這些條件都不適用，則默認值為`"jack"`。請注意，這是一個錯誤`type`成為`"reg"`如果`boot.out`缺失或存在`"inf"`如果`stype`不是`"w"`.
`stype`	一個字符變量，賦予 `statistic` 第二個參數的性質。它可以采用三個值：`"w"`(權重)、`"f"`(頻率)或`"i"`(索引)。如果提供了 `boot.out`，則 `stype` 的值將設置為 `boot.out$stype`，並且此處提供的任何值都將被忽略。否則它是一個可選參數，默認為 `"w"` 。如果 `type` 是 `"inf"` 則 `stype` 必須是 `"w"` 。
`index`	一個整數，給出 `statistic` 輸出中感興趣的變量的位置。
`t`	長度為 `boot.out$R` 的向量，給出感興趣的統計數據的引導複製。 `t` 僅當 `type` 為 `reg` 時使用，默認為 `boot.out$t[,index]` 。
`strata`	指定多樣本問題的層的整數向量或因子。如果提供 `boot.out` ，則 `strata` 的值將設置為 `boot.out$strata` 。否則它是一個可選參數，其默認值對應於單個樣本情況。
`eps`	僅當 `type` 為 `"inf"` 時才使用此參數。在這種情況下，用於數值微分的 epsilon 值將是 `eps` 除以 `data` 中的觀測值數量。
`...`	`statistic` 采用的任何其他參數。每次調用時，它們都會原封不動地傳遞給`statistic`。

細節

如果type 是"inf"，則使用數值微分來近似經驗影響值。這僅對於以加權形式寫入的統計數據有意義(即 stype 是 "w" )。如果type 是"jack"，則返回經驗影響的通常留一折刀估計。如果type 是"pos"，則使用正(include-one-twice) 折刀值。如果type 是"reg"，則必須提供引導對象。然後，回歸方法通過在衍生它們的頻率數組上回歸 statistic 的引導複製來工作。自舉頻率數組是通過調用 boot.array 獲得的。這些方法的更多細節在 Davison 和 Hinkley (1997) 的 2.7 節中給出。

經驗影響值經常在非參數引導應用程序中使用。因此，許多其他函數在需要時調用empinf。它們的一些使用示例包括方差的非參數 delta 估計、BCa 區間以及查找統計數據的線性近似以用作控製變量。它們還用於對立引導重采樣。

值

應用於 data 的 statistic 經驗影響值的向量。這些值的順序與數據中觀察值的順序相同。

警告

empinf 的所有參數都必須使用 name = value 約定傳遞。如果不遵循這一點，則可能會發生不可預測的錯誤。

例子

# The empirical influence values for the ratio of means in
# the city data.
ratio <- function(d, w) sum(d$x *w)/sum(d$u*w)
empinf(data = city, statistic = ratio)
city.boot <- boot(city, ratio, 499, stype="w")
empinf(boot.out = city.boot, type = "reg")

# A statistic that may be of interest in the difference of means
# problem is the t-statistic for testing equality of means.  In
# the bootstrap we get replicates of the difference of means and
# the variance of that statistic and then want to use this output
# to get the empirical influence values of the t-statistic.
grav1 <- gravity[as.numeric(gravity[,2]) >= 7,]
grav.fun <- function(dat, w) {
     strata <- tapply(dat[, 2], as.numeric(dat[, 2]))
     d <- dat[, 1]
     ns <- tabulate(strata)
     w <- w/tapply(w, strata, sum)[strata]
     mns <- as.vector(tapply(d * w, strata, sum)) # drop names
     mn2 <- tapply(d * d * w, strata, sum)
     s2hat <- sum((mn2 - mns^2)/ns)
     c(mns[2] - mns[1], s2hat)
}

grav.boot <- boot(grav1, grav.fun, R = 499, stype = "w",
                  strata = grav1[, 2])

# Since the statistic of interest is a function of the bootstrap
# statistics, we must calculate the bootstrap replicates and pass
# them to empinf using the t argument.
grav.z <- (grav.boot$t[,1]-grav.boot$t0[1])/sqrt(grav.boot$t[,2])
empinf(boot.out = grav.boot, t = grav.z)

參考

Davison, A.C. and Hinkley, D.V. (1997) Bootstrap Methods and Their Application. Cambridge University Press.

Efron, B. (1982) The Jackknife, the Bootstrap and Other Resampling Plans. CBMS-NSF Regional Conference Series in Applied Mathematics, 38, SIAM.

Fernholtz, L.T. (1983) von Mises Calculus for Statistical Functionals. Lecture Notes in Statistics, 19, Springer-Verlag.

也可以看看

boot , boot.array , boot.ci , control , jack.after.boot , linear.approx , var.linear

相關用法

注：本文由純淨天空篩選整理自R-devel大神的英文原創作品 Empirical Influence Values。非經特殊聲明，原始代碼版權歸原作者所有，本譯文未經允許或授權，請勿轉載或複製。