R survreg 参数生存模型的回归

R语言 survreg 位于 survival 包(package)。

说明

拟合参数生存回归模型。这些是时间变量任意变换的位置尺度模型；最常见的情况是使用对数转换，从而导致模型的故障时间加速。

用法

survreg(formula, data, weights, subset, 
        na.action, dist="weibull", init=NULL, scale=0, 
        control,parms=NULL,model=FALSE, x=FALSE,
        y=TRUE, robust=FALSE, cluster, score=FALSE, ...)

参数

`formula`	与其他回归模型一样的公式表达式。响应通常是 `Surv` 函数返回的生存对象。有关详细信息，请参阅 `Surv` 、 `lm` 和 `formula` 的文档。
`data`	一个数据帧，用于解释 `formula` 、 `weights` 或 `subset` 参数中指定的变量。
`weights`	案例权重的可选向量
`subset`	拟合中使用的观测值的子集
`na.action`	缺失数据过滤器函数，在使用任何 `subset` 参数后应用于 model.frame。默认为 `options()\$na.action` 。
`dist`	y 变量的假设分布。如果参数是字符串，则假定命名 `survreg.distributions` 中的元素。其中包括 `"weibull"` 、 `"exponential"` 、 `"gaussian"` 、 `"logistic"` 、 `"lognormal"` 和 `"loglogistic"` 。否则，假定是符合 `survreg.distributions` 中说明的格式的用户定义列表。
`parms`	固定参数列表。例如，对于 t 分布，这是自由度；大多数分布没有参数。
`init`	参数初始值的可选向量。
`scale`	比例的可选固定值。如果设置为 <=0，则估计比例。
`control`	控制值列表，采用 `survreg.control` 生成的格式。默认值为`survreg.control()`
`model` , `x` , `y`	控制返回内容的标志。如果其中任何一个为真，则模型框架、模型矩阵和/或响应时间向量将作为最终结果的组成部分返回，并具有与标志参数相同的名称。
`score`	返回分数向量。 (成功收敛后，预计该值为零。)
`robust`	使用鲁棒夹心误差代替渐近公式。如果存在 `cluster` 参数，则默认为 TRUE。
`cluster`	用于识别受试者组的可选变量，用于计算稳健方差。与 `model` 变量一样，它是在 `data` 参数指向的数据集中搜索的。
`...`	其他参数将传递给 `survreg.control` 。

细节

所有分布都被纳入基于 Kalbfleisch 和 Prentice 第 2.2 章的位置尺度框架中。所得分布的参数化有时(例如高斯)与统计教科书中常见的形式相同，但有时(例如威布尔)则不然。详细公式请参见书本。

使用权重时，请注意复制权重和采样权重之间的差异。在前者中，权重 '2' 意味着有两个相同的观测值，它们已合并为单行数据。对于抽样权重，有一个单一的观察值，权重用于实现某些总体的平衡。要获得复制权重的适当方差，请使用默认方差，对于采样权重，请使用稳健方差。复制权重曾经很常见(当计算机内存小得多时)，但现在很少见。

值

返回类survreg 的对象。

例子

# Fit an exponential model: the two fits are the same
survreg(Surv(futime, fustat) ~ ecog.ps + rx, ovarian, dist='weibull',
                                    scale=1)
survreg(Surv(futime, fustat) ~ ecog.ps + rx, ovarian,
        dist="exponential")

#
# A model with different baseline survival shapes for two groups, i.e.,
#   two different scale parameters
survreg(Surv(time, status) ~ ph.ecog + age + strata(sex), lung)

# There are multiple ways to parameterize a Weibull distribution. The survreg 
# function embeds it in a general location-scale family, which is a 
# different parameterization than the rweibull function, and often leads
# to confusion.
#   survreg's scale  =    1/(rweibull shape)
#   survreg's intercept = log(rweibull scale)
#   For the log-likelihood all parameterizations lead to the same value.
y <- rweibull(1000, shape=2, scale=5)
survreg(Surv(y)~1, dist="weibull")

# Economists fit a model called `tobit regression', which is a standard
# linear regression with Gaussian errors, and left censored data.
tobinfit <- survreg(Surv(durable, durable>0, type='left') ~ age + quant,
	            data=tobin, dist='gaussian')

参考

Kalbfleisch, J. D. and Prentice, R. L., The statistical analysis of failure time data, Wiley, 2002.

也可以看看

survreg.object , survreg.distributions , pspline , frailty , ridge

相关用法

注：本文由纯净天空筛选整理自R-devel大神的英文原创作品 Regression for a Parametric Survival Model。非经特殊声明，原始代码版权归原作者所有，本译文未经允许或授权，请勿转载或复制。