R dplyr n_distinct 計算獨特的組合

n_distinct() 計算一組一個或多個向量中唯一/不同組合的數量。它更快、更簡潔，相當於 nrow(unique(data.frame(...))) 。

用法

n_distinct(..., na.rm = FALSE)

參數

...: 未命名的向量。如果提供多個向量，那麽它們應該具有相同的長度。
na.rm: 如果是 TRUE ，則從計數中排除缺失的觀測值。如果 ... 中有多個向量，如果缺少任何值，則將排除觀察結果。

值

一個數字。

例子

x <- c(1, 1, 2, 2, 2)
n_distinct(x)
#> [1] 2

y <- c(3, 3, NA, 3, 3)
n_distinct(y)
#> [1] 2
n_distinct(y, na.rm = TRUE)
#> [1] 1

# Pairs (1, 3), (2, 3), and (2, NA) are distinct
n_distinct(x, y)
#> [1] 3

# (2, NA) is dropped, leaving 2 distinct combinations
n_distinct(x, y, na.rm = TRUE)
#> [1] 2

# Also works with data frames
n_distinct(data.frame(x, y))
#> [1] 3

源代碼：R/n-distinct.R

相關用法

注：本文由純淨天空篩選整理自Hadley Wickham等大神的英文原創作品 Count unique combinations。非經特殊聲明，原始代碼版權歸原作者所有，本譯文未經允許或授權，請勿轉載或複製。