R cor.test 配對樣本之間的關聯/相關性測試

R語言 cor.test 位於 stats 包(package)。

說明

使用 Pearson 的乘積矩相關係數、Kendall 的 \tau 或 Spearman 的 \rho 之一測試配對樣本之間的關聯。

用法

cor.test(x, ...)

## Default S3 method:
cor.test(x, y,
         alternative = c("two.sided", "less", "greater"),
         method = c("pearson", "kendall", "spearman"),
         exact = NULL, conf.level = 0.95, continuity = FALSE, ...)

## S3 method for class 'formula'
cor.test(formula, data, subset, na.action, ...)

參數

`x, y`	數據值的數值向量。 `x` 和`y` 必須具有相同的長度。
`alternative`	表示備擇假設，並且必須是 `"two.sided"` 、 `"greater"` 或 `"less"` 之一。您可以僅指定首字母。 `"greater"` 對應於正關聯，`"less"` 對應於負關聯。
`method`	指示要使用哪個相關係數進行測試的字符串。 `"pearson"` 、 `"kendall"` 或 `"spearman"` 之一可以縮寫。
`exact`	指示是否應計算精確 p 值的邏輯。用於 Kendall 的 `\tau` 和 Spearman 的 `\rho` 。有關`NULL`(默認值)的含義，請參閱“詳細信息”。
`conf.level`	返回的置信區間的置信水平。目前僅在至少有 4 個完整的觀測值對時才用於 Pearson 積矩相關係數。
`continuity`	邏輯：如果為 true，則在未精確計算時，會對 Kendall 的 `\tau` 和 Spearman 的 `\rho` 使用連續性校正。
`formula`	`~ u + v` 形式的公式，其中 `u` 和 `v` 都是數值變量，給出一個樣本的數據值。樣本的長度必須相同。
`data`	包含公式 `formula` 中的變量的可選矩陣或 DataFrame (或類似的：請參閱 `model.frame` )。默認情況下，變量取自`environment(formula)`。
`subset`	一個可選向量，指定要使用的觀測子集。
`na.action`	一個函數，指示當數據包含 `NA` 時應該發生什麽。默認為 `getOption("na.action")` 。
`...`	要傳遞給方法或從方法傳遞的更多參數。

細節

這三種方法各自估計配對樣本之間的關聯並計算值為零的測試。它們使用不同的關聯度量，均在 [-1, 1] 範圍內，其中 0 表示沒有關聯。這些有時被稱為無相關性測試，但該術語通常僅限於默認方法。

如果 method 為 "pearson" ，則檢驗統計量基於 Pearson 乘積矩相關係數 cor(x, y)，並且如果樣本遵循獨立正態分布，則遵循具有 length(x)-2 自由度的 t 分布。如果至少有 4 對完整的觀測值，則基於 Fisher Z 變換給出漸近置信區間。

如果 method 是 "kendall" 或 "spearman" ，則使用 Kendall 的 \tau 或 Spearman 的 \rho 統計量來估計基於排名的關聯度量。如果數據不一定來自二元正態分布，則可以使用這些檢驗。

對於 Kendall 檢驗，默認情況下(如果 exact 為 NULL)，如果包含有限值且不存在聯係的配對樣本少於 50 個，則會計算精確的 p 值。否則，檢驗統計量是縮放至零均值和單位方差的估計值，並且近似正態分布。

對於 Spearman 檢驗，p 值是使用 n < 1290 和 exact = TRUE 的算法 AS 89 計算的，否則通過漸近 t 近似計算。請注意，這些是 n < 10 的 ‘exact’ ，並使用 Edgeworth 級數近似來獲得更大的樣本量(截止值已從原始論文中更改)。

值

類"htest" 的列表包含以下組件：

`statistic`	檢驗統計量的值。
`parameter`	檢驗統計量服從 t 分布時的自由度。
`p.value`	檢驗的 p 值。
`estimate`	估計的關聯度量，名稱為 `"cor"` 、 `"tau"` 或 `"rho"` 對應於所采用的方法。
`null.value`	原假設下關聯度量的值，始終為 `0` 。
`alternative`	說明備擇假設的字符串。
`method`	指示如何測量關聯的字符串。
`data.name`	給出數據名稱的字符串。
`conf.int`	關聯度量的置信區間。目前僅在至少 4 個完整觀測值對的情況下給出 Pearson 乘積矩相關係數。

例子

## Hollander & Wolfe (1973), p. 187f.
## Assessment of tuna quality.  We compare the Hunter L measure of
##  lightness to the averages of consumer panel scores (recoded as
##  integer values from 1 to 6 and averaged over 80 such values) in
##  9 lots of canned tuna.

x <- c(44.4, 45.9, 41.9, 53.3, 44.7, 44.1, 50.7, 45.2, 60.1)
y <- c( 2.6,  3.1,  2.5,  5.0,  3.6,  4.0,  5.2,  2.8,  3.8)

##  The alternative hypothesis of interest is that the
##  Hunter L value is positively associated with the panel score.

cor.test(x, y, method = "kendall", alternative = "greater")
## => p=0.05972

cor.test(x, y, method = "kendall", alternative = "greater",
         exact = FALSE) # using large sample approximation
## => p=0.04765

## Compare this to
cor.test(x, y, method = "spearm", alternative = "g")
cor.test(x, y,                    alternative = "g")

## Formula interface.
require(graphics)
pairs(USJudgeRatings)
cor.test(~ CONT + INTG, data = USJudgeRatings)

參考

D. J. Best & D. E. Roberts (1975). Algorithm AS 89: The Upper Tail Probabilities of Spearman's \rho. Applied Statistics, 24, 377-379. doi:10.2307/2347111.

Myles Hollander & Douglas A. Wolfe (1973), Nonparametric Statistical Methods. New York: John Wiley & Sons. Pages 185-194 (Kendall and Spearman tests).

也可以看看

Kendall 位於包 Kendall 中。

pKendall 和 pSpearman 位於包 SuppDists 中，spearman.test 位於包 pspearman 中，它們提供不同的(通常更準確)近似值。

相關用法

注：本文由純淨天空篩選整理自R-devel大神的英文原創作品 Test for Association/Correlation Between Paired Samples。非經特殊聲明，原始代碼版權歸原作者所有，本譯文未經允許或授權，請勿轉載或複製。