R stringr str_length 计算长度/宽度

str_length() 返回字符串中的代码点数量。这些是可以使用 str_sub() 提取的各个元素(通常是但不总是字母)。

str_width() 返回以固定宽度字体打印时(即在控制台中打印时)字符串将占用多少空间。

用法

str_length(string)

str_width(string)

参数

string: 输入向量。或者是一个字符向量，或者是可强制转换为一个的东西。

值

与 string 长度相同的数值向量。

也可以看看

该函数包装的stringi::stri_length()。

例子

str_length(letters)
#>  [1] 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1
str_length(NA)
#> [1] NA
str_length(factor("abc"))
#> [1] 3
str_length(c("i", "like", "programming", NA))
#> [1]  1  4 11 NA

# Some characters, like emoji and Chinese characters (hanzi), are square
# which means they take up the width of two Latin characters
x <- c("\u6c49\u5b57", "\U0001f60a")
str_view(x)
#> [1] │ 汉字
#> [2] │ 😊
str_width(x)
#> [1] 4 2
str_length(x)
#> [1] 2 1

# There are two ways of representing a u with an umlaut
u <- c("\u00fc", "u\u0308")
# They have the same width
str_width(u)
#> [1] 1 1
# But a different length
str_length(u)
#> [1] 1 2
# Because the second element is made up of a u + an accent
str_sub(u, 1, 1)
#> [1] "ü" "u"

源代码：R/length.R

相关用法

注：本文由纯净天空筛选整理自Hadley Wickham等大神的英文原创作品 Compute the length/width。非经特殊声明，原始代码版权归原作者所有，本译文未经允许或授权，请勿转载或复制。