本文整理汇总了Java中org.apache.lucene.analysis.en.EnglishMinimalStemFilter类的典型用法代码示例。如果您正苦于以下问题:Java EnglishMinimalStemFilter类的具体用法?Java EnglishMinimalStemFilter怎么用?Java EnglishMinimalStemFilter使用的例子?那么恭喜您, 这里精选的类代码示例或许可以为您提供帮助。
EnglishMinimalStemFilter类属于org.apache.lucene.analysis.en包,在下文中一共展示了EnglishMinimalStemFilter类的2个代码示例,这些例子默认根据受欢迎程度排序。您可以为喜欢或者感觉有用的代码点赞,您的评价将有助于系统推荐出更棒的Java代码示例。
示例1: annotate
import org.apache.lucene.analysis.en.EnglishMinimalStemFilter; //导入依赖的package包/类
@Override
public List<Annotation> annotate(String text) throws Exception {
text = SimpleTokenizer.format(text);
Analyzer analyser = new EnglishAnalyzer(Version.LUCENE_47, CharArraySet.EMPTY_SET);
TokenFilter filter = new EnglishMinimalStemFilter(analyser.tokenStream("text", new StringReader(text)));
List<Annotation> out = Lists.newArrayList();
while (filter.incrementToken()) {
CharTermAttribute az = filter.getAttribute(CharTermAttribute.class);
OffsetAttribute o = filter.getAttribute(OffsetAttribute.class);
String token = text.substring(o.startOffset(), o.endOffset());
String lemma = az.toString();
Annotation t = new Annotation();
t.setForm(token);
t.setLemma(lemma);
out.add(t);
}
if (out.size() == 0) {
log.debug("Input string is empty");
}
filter.close();
analyser.close();
return out;
}
示例2: create
import org.apache.lucene.analysis.en.EnglishMinimalStemFilter; //导入依赖的package包/类
@Override
public TokenStream create(TokenStream input) {
return new EnglishMinimalStemFilter(input);
}