R predict.gam 根据拟合的 GAM 模型进行预测

R语言 predict.gam 位于 mgcv 包(package)。

说明

采用 gam() 生成的拟合 gam 对象，并在给定模型协变量的一组新值或用于模型拟合的原始值的情况下生成预测。基于模型系数的后验分布，预测可能伴随有标准误差。该例程可以选择返回矩阵，模型系数必须预先乘以该矩阵，以便在提供的协变量值处产生线性预测变量的值：这对于获得从模型导出的数量的可信区域(例如，平滑)，以及 R 之外的查找表预测(请参见下面的示例代码)。

用法

## S3 method for class 'gam'
predict(object,newdata,type="link",se.fit=FALSE,terms=NULL,
        exclude=NULL,block.size=NULL,newdata.guaranteed=FALSE,
        na.action=na.pass,unconditional=FALSE,iterms.type=NULL,...)

参数

`object`	由 `gam()` 生成的拟合 `gam` 对象。
`newdata`	包含需要预测的模型协变量值的 DataFrame 或列表。如果未提供，则返回与原始数据相对应的预测。如果提供了`newdata`，那么它应该包含预测所需的所有变量：如果没有，则会生成警告。请参阅与 `link{linear.functional.terms}` 一起使用的详细信息。
`type`	当其值为`"link"`(默认)时，返回线性预测器(可能具有相关的标准误差)。当`type="terms"` 线性预测器的每个分量单独返回(可能带有标准误差)：这包括参数模型分量，后面是每个平滑分量，但不包括任何偏移和任何截距。 `type="iterms"` 是相同的，只是为平滑分量返回的任何标准误差将包括截距/总体平均值的不确定性。当返回 `type="response"` 对响应规模的预测时(可能具有近似标准误差)。当 `type="lpmatrix"` 时，返回一个矩阵，该矩阵在后乘于参数向量时产生线性预测器的值(减去任何偏移)(在这种情况下 `se.fit` 被忽略)。后一个选项对于获取从模型导出的数量的方差估计最有用：例如积分数量或平滑的导数。线性预测矩阵也可用于在 `R` 之外实现近似预测(请参见下面的示例代码)。
`se.fit`	当此值为 TRUE(非默认值)时，将为每个预测返回标准误差估计值。
`terms`	如果`type=="terms"` 或`type="iterms"` 则仅返回此数组中指定的项(平滑或参数)的结果。否则，该数组中未命名的任何术语都将设置为零。如果`NULL`则包含所有术语。
`exclude`	如果 `type=="terms"` 或 `type="iterms"` 则不会返回此数组中指定的项(平滑或参数)。否则，该数组中命名的任何术语都将设置为零。如果`NULL`，则不排除任何术语。请注意，这是模型摘要中显示的术语名称，请参阅示例。您可以通过设置 `newdata.guaranteed=TRUE` 来避免为排除的平滑项提供协变量，这将避免对 `newdata` 进行所有检查(不能跳过参数项的协变量)。
`block.size`	每次调用底层代码时要处理的最大预测数：越大速度越快，但内存消耗更大。设置为 < 1 以使用预测总数。如果`NULL`，则如果提供了新数据，则块大小为 1000，否则为模型框架中的行数。
`newdata.guaranteed`	设置为 `TRUE` 以关闭对 `newdata` 的所有检查，除了因子水平的健全性之外：这可以加快大型预测任务的速度，但 `newdata` 必须完整，并且不需要预测变量的 `NA` 值。模型。
`na.action`	如何处理 `newdata` 中的 `NA` 值。使用默认值 `na.pass` ，`newdata` 中包含所需预测变量的 `NA` 值的任何行都会产生 `NA` 预测(即使相关术语没有 `NA` 预测变量)。 `na.exclude` 或 `na.omit` 会导致删除 `newdata` 行(如果它们包含所需预测变量的任何 `NA` 值)。如果 `newdata` 缺失，则 `NA` 处理由 `object$na.action` 确定。
`unconditional`	如果`TRUE`，则使用平滑参数不确定性校正协方差矩阵(如果可用)，否则使用以估计平滑参数为条件的协方差矩阵。
`iterms.type`	如果 `type="iterms"` 则标准误差可以包括总体平均值的不确定性(默认、包括固定效应和随机效应)或仅包含非平滑固定效应平均值的不确定性(`iterms.type=2`)。
`...`	其他论点。

细节

predict.gam 产生的标准误差基于拟合 gam 对象中参数 Vp 的贝叶斯后验协方差矩阵。

当使用 linear.functional.terms 模型进行预测时，有两种可能性。如果要在预测中使用求和约定，就像在拟合中一样，那么 newdata 应该是一个列表，其中命名的矩阵参数对应于拟合中作为矩阵的任何变量。或者，人们可以选择简单地评估特定值的构成平滑，在这种情况下，矩阵参数可以用向量替换(并且 newdata 可以是数据帧)。有关示例代码，请参阅linear.functional.terms。

为了便于使用 termplot 进行绘图，如果 object 拥有属性 "para.only" 和 type=="terms"，则仅返回 1 阶参数项(即 termplot 可以处理的参数项)。

请注意，与其他预测函数一样，与 gam 模型公式中指定的偏移量不同，在预测时始终忽略作为参数提供给 gam 的任何偏移量。

请参阅示例，了解如何使用 lpmatrix 获取从模型导出的数量的可信区域。

值

如果type=="lpmatrix"，则返回一个矩阵，当应用于模型系数向量时，该矩阵将在所提供的协变量值处给出线性预测变量值(减去任何偏移)的向量。否则，如果 se.fit 是 TRUE，则返回 2 项列表，其中项(两个数组)fit 和 se.fit 包含预测和关联的标准误差估计，否则返回预测数组。返回数组的维度取决于 type 是否是 "terms"：如果是，则数组是二维的，线性预测器中的每个项都是单独的，否则数组是一维的，包含线性预测器/预测值(或相应的 s.e.s)。按项返回的线性预测器将不包括偏移量或截距。

newdata 可以是 DataFrame 架、列表或 model.frame：如果它是模型框架，则必须提供所有变量。

警告

如果在平滑调用中使用协变量的数据相关变换，则预测可能不正确。请参阅示例。

请注意，此函数的行为与 Splus 中的predict.gam() 不同。

type=="terms" 与 predict.lm 对参数模型组件所做的并不完全匹配。

例子

library(mgcv)
n <- 200
sig <- 2
dat <- gamSim(1,n=n,scale=sig)

b <- gam(y~s(x0)+s(I(x1^2))+s(x2)+offset(x3),data=dat)

newd <- data.frame(x0=(0:30)/30,x1=(0:30)/30,x2=(0:30)/30,x3=(0:30)/30)
pred <- predict.gam(b,newd)
pred0 <- predict(b,newd,exclude="s(x0)") ## prediction excluding a term
## ...and the same, but without needing to provide x0 prediction data...
newd1 <- newd;newd1$x0 <- NULL ## remove x0 from `newd1'
pred1 <- predict(b,newd1,exclude="s(x0)",newdata.guaranteed=TRUE)

## custom perspective plot...

m1 <- 20;m2 <- 30; n <- m1*m2
x1 <- seq(.2,.8,length=m1);x2 <- seq(.2,.8,length=m2) ## marginal grid points
df <- data.frame(x0=rep(.5,n),x1=rep(x1,m2),x2=rep(x2,each=m1),x3=rep(0,n))
pf <- predict(b,newdata=df,type="terms")
persp(x1,x2,matrix(pf[,2]+pf[,3],m1,m2),theta=-130,col="blue",zlab="")

#############################################
## difference between "terms" and "iterms"
#############################################
nd2 <- data.frame(x0=c(.25,.5),x1=c(.25,.5),x2=c(.25,.5),x3=c(.25,.5))
predict(b,nd2,type="terms",se=TRUE)
predict(b,nd2,type="iterms",se=TRUE)

#########################################################
## now get variance of sum of predictions using lpmatrix
#########################################################

Xp <- predict(b,newd,type="lpmatrix") 

## Xp %*% coef(b) yields vector of predictions

a <- rep(1,31)
Xs <- t(a) %*% Xp ## Xs %*% coef(b) gives sum of predictions
var.sum <- Xs %*% b$Vp %*% t(Xs)


#############################################################
## Now get the variance of non-linear function of predictions
## by simulation from posterior distribution of the params
#############################################################

rmvn <- function(n,mu,sig) { ## MVN random deviates
  L <- mroot(sig);m <- ncol(L);
  t(mu + L%*%matrix(rnorm(m*n),m,n)) 
}

br <- rmvn(1000,coef(b),b$Vp) ## 1000 replicate param. vectors
res <- rep(0,1000)
for (i in 1:1000)
{ pr <- Xp %*% br[i,] ## replicate predictions
  res[i] <- sum(log(abs(pr))) ## example non-linear function
}
mean(res);var(res)

## loop is replace-able by following .... 

res <- colSums(log(abs(Xp %*% t(br))))



##################################################################
## The following shows how to use use an "lpmatrix" as a lookup 
## table for approximate prediction. The idea is to create 
## approximate prediction matrix rows by appropriate linear 
## interpolation of an existing prediction matrix. The additivity 
## of a GAM makes this possible. 
## There is no reason to ever do this in R, but the following 
## code provides a useful template for predicting from a fitted 
## gam *outside* R: all that is needed is the coefficient vector 
## and the prediction matrix. Use larger `Xp'/ smaller `dx' and/or 
## higher order interpolation for higher accuracy.  
###################################################################

xn <- c(.341,.122,.476,.981) ## want prediction at these values
x0 <- 1         ## intercept column
dx <- 1/30      ## covariate spacing in `newd'
for (j in 0:2) { ## loop through smooth terms
  cols <- 1+j*9 +1:9      ## relevant cols of Xp
  i <- floor(xn[j+1]*30)  ## find relevant rows of Xp
  w1 <- (xn[j+1]-i*dx)/dx ## interpolation weights
  ## find approx. predict matrix row portion, by interpolation
  x0 <- c(x0,Xp[i+2,cols]*w1 + Xp[i+1,cols]*(1-w1))
}
dim(x0)<-c(1,28) 
fv <- x0%*%coef(b) + xn[4];fv    ## evaluate and add offset
se <- sqrt(x0%*%b$Vp%*%t(x0));se ## get standard error
## compare to normal prediction
predict(b,newdata=data.frame(x0=xn[1],x1=xn[2],
        x2=xn[3],x3=xn[4]),se=TRUE)

##################################################################
# illustration of unsafe scale dependent transforms in smooths....
##################################################################

b0 <- gam(y~s(x0)+s(x1)+s(x2)+x3,data=dat) ## safe
b1 <- gam(y~s(x0)+s(I(x1/2))+s(x2)+scale(x3),data=dat) ## safe
b2 <- gam(y~s(x0)+s(scale(x1))+s(x2)+scale(x3),data=dat) ## unsafe
pd <- dat; pd$x1 <- pd$x1/2; pd$x3 <- pd$x3/2
par(mfrow=c(1,2))
plot(predict(b0,pd),predict(b1,pd),main="b0 and b1 predictions match")
abline(0,1,col=2)
plot(predict(b0,pd),predict(b2,pd),main="b2 unsafe, doesn't match")
abline(0,1,col=2)


####################################################################
## Differentiating the smooths in a model (with CIs for derivatives)
####################################################################

## simulate data and fit model...
dat <- gamSim(1,n=300,scale=sig)
b<-gam(y~s(x0)+s(x1)+s(x2)+s(x3),data=dat)
plot(b,pages=1)

## now evaluate derivatives of smooths with associated standard 
## errors, by finite differencing...
x.mesh <- seq(0,1,length=200) ## where to evaluate derivatives
newd <- data.frame(x0 = x.mesh,x1 = x.mesh, x2=x.mesh,x3=x.mesh)
X0 <- predict(b,newd,type="lpmatrix") 

eps <- 1e-7 ## finite difference interval
x.mesh <- x.mesh + eps ## shift the evaluation mesh
newd <- data.frame(x0 = x.mesh,x1 = x.mesh, x2=x.mesh,x3=x.mesh)
X1 <- predict(b,newd,type="lpmatrix")

Xp <- (X1-X0)/eps ## maps coefficients to (fd approx.) derivatives
colnames(Xp)      ## can check which cols relate to which smooth

par(mfrow=c(2,2))
for (i in 1:4) {  ## plot derivatives and corresponding CIs
  Xi <- Xp*0 
  Xi[,(i-1)*9+1:9+1] <- Xp[,(i-1)*9+1:9+1] ## Xi%*%coef(b) = smooth deriv i
  df <- Xi%*%coef(b)              ## ith smooth derivative 
  df.sd <- rowSums(Xi%*%b$Vp*Xi)^.5 ## cheap diag(Xi%*%b$Vp%*%t(Xi))^.5
  plot(x.mesh,df,type="l",ylim=range(c(df+2*df.sd,df-2*df.sd)))
  lines(x.mesh,df+2*df.sd,lty=2);lines(x.mesh,df-2*df.sd,lty=2)
}

作者

Simon N. Wood simon.wood@r-project.org

The design is inspired by the S function of the same name described in Chambers and Hastie (1993) (but is not a clone).

参考

Chambers and Hastie (1993) Statistical Models in S. Chapman & Hall.

Marra, G and S.N. Wood (2012) Coverage Properties of Confidence Intervals for Generalized Additive Model Components. Scandinavian Journal of Statistics, 39(1), 53-74. doi:10.1111/j.1467-9469.2011.00760.x

Wood S.N. (2017, 2nd ed) Generalized Additive Models: An Introduction with R. Chapman and Hall/CRC Press. doi:10.1201/9781315370279

也可以看看

gam , gamm , plot.gam

相关用法

注：本文由纯净天空筛选整理自R-devel大神的英文原创作品 Prediction from fitted GAM model。非经特殊声明，原始代码版权归原作者所有，本译文未经允许或授权，请勿转载或复制。