R survreg 參數生存模型的回歸

R語言 survreg 位於 survival 包(package)。

說明

擬合參數生存回歸模型。這些是時間變量任意變換的位置尺度模型；最常見的情況是使用對數轉換，從而導致模型的故障時間加速。

用法

survreg(formula, data, weights, subset, 
        na.action, dist="weibull", init=NULL, scale=0, 
        control,parms=NULL,model=FALSE, x=FALSE,
        y=TRUE, robust=FALSE, cluster, score=FALSE, ...)

參數

`formula`	與其他回歸模型一樣的公式表達式。響應通常是 `Surv` 函數返回的生存對象。有關詳細信息，請參閱 `Surv` 、 `lm` 和 `formula` 的文檔。
`data`	一個數據幀，用於解釋 `formula` 、 `weights` 或 `subset` 參數中指定的變量。
`weights`	案例權重的可選向量
`subset`	擬合中使用的觀測值的子集
`na.action`	缺失數據過濾器函數，在使用任何 `subset` 參數後應用於 model.frame。默認為 `options()\$na.action` 。
`dist`	y 變量的假設分布。如果參數是字符串，則假定命名 `survreg.distributions` 中的元素。其中包括 `"weibull"` 、 `"exponential"` 、 `"gaussian"` 、 `"logistic"` 、 `"lognormal"` 和 `"loglogistic"` 。否則，假定是符合 `survreg.distributions` 中說明的格式的用戶定義列表。
`parms`	固定參數列表。例如，對於 t 分布，這是自由度；大多數分布沒有參數。
`init`	參數初始值的可選向量。
`scale`	比例的可選固定值。如果設置為 <=0，則估計比例。
`control`	控製值列表，采用 `survreg.control` 生成的格式。默認值為`survreg.control()`
`model` , `x` , `y`	控製返回內容的標誌。如果其中任何一個為真，則模型框架、模型矩陣和/或響應時間向量將作為最終結果的組成部分返回，並具有與標誌參數相同的名稱。
`score`	返回分數向量。 (成功收斂後，預計該值為零。)
`robust`	使用魯棒夾心誤差代替漸近公式。如果存在 `cluster` 參數，則默認為 TRUE。
`cluster`	用於識別受試者組的可選變量，用於計算穩健方差。與 `model` 變量一樣，它是在 `data` 參數指向的數據集中搜索的。
`...`	其他參數將傳遞給 `survreg.control` 。

細節

所有分布都被納入基於 Kalbfleisch 和 Prentice 第 2.2 章的位置尺度框架中。所得分布的參數化有時(例如高斯)與統計教科書中常見的形式相同，但有時(例如威布爾)則不然。詳細公式請參見書本。

使用權重時，請注意複製權重和采樣權重之間的差異。在前者中，權重 '2' 意味著有兩個相同的觀測值，它們已合並為單行數據。對於抽樣權重，有一個單一的觀察值，權重用於實現某些總體的平衡。要獲得複製權重的適當方差，請使用默認方差，對於采樣權重，請使用穩健方差。複製權重曾經很常見(當計算機內存小得多時)，但現在很少見。

值

返回類survreg 的對象。

例子

# Fit an exponential model: the two fits are the same
survreg(Surv(futime, fustat) ~ ecog.ps + rx, ovarian, dist='weibull',
                                    scale=1)
survreg(Surv(futime, fustat) ~ ecog.ps + rx, ovarian,
        dist="exponential")

#
# A model with different baseline survival shapes for two groups, i.e.,
#   two different scale parameters
survreg(Surv(time, status) ~ ph.ecog + age + strata(sex), lung)

# There are multiple ways to parameterize a Weibull distribution. The survreg 
# function embeds it in a general location-scale family, which is a 
# different parameterization than the rweibull function, and often leads
# to confusion.
#   survreg's scale  =    1/(rweibull shape)
#   survreg's intercept = log(rweibull scale)
#   For the log-likelihood all parameterizations lead to the same value.
y <- rweibull(1000, shape=2, scale=5)
survreg(Surv(y)~1, dist="weibull")

# Economists fit a model called `tobit regression', which is a standard
# linear regression with Gaussian errors, and left censored data.
tobinfit <- survreg(Surv(durable, durable>0, type='left') ~ age + quant,
	            data=tobin, dist='gaussian')

參考

Kalbfleisch, J. D. and Prentice, R. L., The statistical analysis of failure time data, Wiley, 2002.

也可以看看

survreg.object , survreg.distributions , pspline , frailty , ridge

相關用法

注：本文由純淨天空篩選整理自R-devel大神的英文原創作品 Regression for a Parametric Survival Model。非經特殊聲明，原始代碼版權歸原作者所有，本譯文未經允許或授權，請勿轉載或複製。