R yardstick roc_aunp 使用先驗類別分布，每個類別相對於其他類別的 ROC 曲線下麵積

roc_aunp() 是一個多類度量，它使用先驗類分布來計算每個類相對於其他類的 ROC 曲線下的麵積。這相當於 roc_auc(estimator = "macro_weighted") 。

用法

roc_aunp(data, ...)

# S3 method for data.frame
roc_aunp(data, truth, ..., na_rm = TRUE, case_weights = NULL, options = list())

roc_aunp_vec(
  truth,
  estimate,
  na_rm = TRUE,
  case_weights = NULL,
  options = list(),
  ...
)

參數

data

包含 truth 和 ... 指定的列的 data.frame 。

...

一組不帶引號的列名稱或一個或多個 dplyr 選擇器函數，用於選擇哪些變量包含類概率。列數應與 truth 的因子級別一樣多。

truth

真實類結果的列標識符(即 factor )。這應該是一個不帶引號的列名，盡管此參數是通過表達式傳遞的並且支持quasiquotation(您可以不帶引號的列名)。對於 _vec() 函數，一個 factor 向量。

na_rm

logical 值，指示在計算繼續之前是否應剝離 NA 值。

case_weights

案例權重的可選列標識符。這應該是一個不帶引號的列名稱，其計算結果為 data 中的數字列。對於 _vec() 函數，一個數值向量。

options

[deprecated]

從尺度 1.0.0 開始不再支持。如果您在此處傳遞某些內容，它將被忽略並發出警告。

以前，這些選項傳遞給 pROC::roc() 。如果您需要對此的支持，請直接使用 pROC 包。

estimate

具有與因子級別一樣多的列的矩陣truth.假設它們的順序與 truth 的級別相同。

值

tibble 包含列 .metric 、 .estimator 和 .estimate 以及 1 行值。

對於分組 DataFrame ，返回的行數將與組數相同。

對於 roc_aunp_vec() ，單個 numeric 值(或 NA )。

多級

這種用於計算 ROC 曲線下麵積的多類方法使用先驗類分布，相當於 roc_auc(estimator = "macro_weighted") 。

參考

Ferri, C.、Hernández-Orallo、J. 和 Modroiu, R. (2009)。 “分類性能測量的實驗比較”。模式識別字母。 30 (1)，第 27-38 頁。

也可以看看

roc_aunu() 用於使用均勻類別分布計算每個類別相對於其餘類別的 ROC 曲線下麵積。

其他類概率指標：average_precision() , brier_class() , classification_cost() , gain_capture() , mn_log_loss() , pr_auc() , roc_auc() , roc_aunu()

作者

Julia·西爾格

例子

# Multiclass example

# `obs` is a 4 level factor. The first level is `"VF"`, which is the
# "event of interest" by default in yardstick. See the Relevant Level
# section above.
data(hpc_cv)

# You can use the col1:colN tidyselect syntax
library(dplyr)
hpc_cv %>%
  filter(Resample == "Fold01") %>%
  roc_aunp(obs, VF:L)
#> # A tibble: 1 × 3
#>   .metric  .estimator .estimate
#>   <chr>    <chr>          <dbl>
#> 1 roc_aunp macro          0.880

# Change the first level of `obs` from `"VF"` to `"M"` to alter the
# event of interest. The class probability columns should be supplied
# in the same order as the levels.
hpc_cv %>%
  filter(Resample == "Fold01") %>%
  mutate(obs = relevel(obs, "M")) %>%
  roc_aunp(obs, M, VF:L)
#> # A tibble: 1 × 3
#>   .metric  .estimator .estimate
#>   <chr>    <chr>          <dbl>
#> 1 roc_aunp macro          0.880

# Groups are respected
hpc_cv %>%
  group_by(Resample) %>%
  roc_aunp(obs, VF:L)
#> # A tibble: 10 × 4
#>    Resample .metric  .estimator .estimate
#>    <chr>    <chr>    <chr>          <dbl>
#>  1 Fold01   roc_aunp macro          0.880
#>  2 Fold02   roc_aunp macro          0.873
#>  3 Fold03   roc_aunp macro          0.906
#>  4 Fold04   roc_aunp macro          0.867
#>  5 Fold05   roc_aunp macro          0.866
#>  6 Fold06   roc_aunp macro          0.865
#>  7 Fold07   roc_aunp macro          0.868
#>  8 Fold08   roc_aunp macro          0.865
#>  9 Fold09   roc_aunp macro          0.841
#> 10 Fold10   roc_aunp macro          0.869

# Vector version
# Supply a matrix of class probabilities
fold1 <- hpc_cv %>%
  filter(Resample == "Fold01")

roc_aunp_vec(
  truth = fold1$obs,
  matrix(
    c(fold1$VF, fold1$F, fold1$M, fold1$L),
    ncol = 4
  )
)
#> [1] 0.8795121

源代碼：R/prob-roc_aunp.R

相關用法

注：本文由純淨天空篩選整理自Max Kuhn等大神的英文原創作品 Area under the ROC curve of each class against the rest, using the a priori class distribution。非經特殊聲明，原始代碼版權歸原作者所有，本譯文未經允許或授權，請勿轉載或複製。

用法

參數

值

相關級別

多級

參考

也可以看看

作者

例子