R stringr word 从句子中提取单词

从句子中提取单词

用法

word(string, start = 1L, end = start, sep = fixed(" "))

参数

string

输入向量。或者是一个字符向量，或者是可强制转换为一个的东西。

start, end

给出要提取的单词范围(包括)的整数向量对。如果为负数，则从最后一个单词开始倒数。

默认值选择第一个单词。

sep

单词之间的分隔符。默认为单个空格。

值

与 string /start /end 长度相同的字符向量。

例子

sentences <- c("Jane saw a cat", "Jane sat down")
word(sentences, 1)
#> [1] "Jane" "Jane"
word(sentences, 2)
#> [1] "saw" "sat"
word(sentences, -1)
#> [1] "cat"  "down"
word(sentences, 2, -1)
#> [1] "saw a cat" "sat down" 

# Also vectorised over start and end
word(sentences[1], 1:3, -1)
#> [1] "Jane saw a cat" "saw a cat"      "a cat"         
word(sentences[1], 1, 1:4)
#> [1] "Jane"           "Jane saw"       "Jane saw a"     "Jane saw a cat"

# Can define words by other separators
str <- 'abc.def..123.4568.999'
word(str, 1, sep = fixed('..'))
#> [1] "abc.def"
word(str, 2, sep = fixed('..'))
#> [1] "123.4568.999"

源代码：R/word.R

相关用法

注：本文由纯净天空筛选整理自Hadley Wickham等大神的英文原创作品 Extract words from a sentence。非经特殊声明，原始代码版权归原作者所有，本译文未经允许或授权，请勿转载或复制。