本文整理汇总了Java中edu.stanford.nlp.io.NumberRangeFileFilter类的典型用法代码示例。如果您正苦于以下问题:Java NumberRangeFileFilter类的具体用法?Java NumberRangeFileFilter怎么用?Java NumberRangeFileFilter使用的例子?那么恭喜您, 这里精选的类代码示例或许可以为您提供帮助。
NumberRangeFileFilter类属于edu.stanford.nlp.io包,在下文中一共展示了NumberRangeFileFilter类的2个代码示例,这些例子默认根据受欢迎程度排序。您可以为喜欢或者感觉有用的代码点赞,您的评价将有助于系统推荐出更棒的Java代码示例。
示例1: getTrees
import edu.stanford.nlp.io.NumberRangeFileFilter; //导入依赖的package包/类
public static List<Tree> getTrees(String path, int low, int high, int minLength, int maxLength) {
Treebank treebank = new DiskTreebank(new TreeReaderFactory() {
public TreeReader newTreeReader(Reader in) {
return new PennTreeReader(in, new LabeledScoredTreeFactory(new WordFactory()), new BobChrisTreeNormalizer());
}
});
treebank.loadPath(path, new NumberRangeFileFilter(low, high, true));
List<Tree> trees = new ArrayList<Tree>();
for (Tree tree : treebank) {
if (tree.yield().size() <= maxLength && tree.yield().size() >= minLength) {
trees.add(tree);
}
}
return trees;
}
示例2: getEnglishSplitCategories
import edu.stanford.nlp.io.NumberRangeFileFilter; //导入依赖的package包/类
/**
* This is hardwired to calculate the split categories from English
* Penn Treebank sections 2-21 with a default cutoff of 300 (as used
* in ACL03PCFG). It was added to upgrading of code in cases where no
* Treebank was available, and the pre-stored list was being used).
*/
public static Set<String> getEnglishSplitCategories(String treebankRoot) {
TreebankLangParserParams tlpParams = new EnglishTreebankParserParams();
Treebank trees = tlpParams.memoryTreebank();
trees.loadPath(treebankRoot, new NumberRangeFileFilter(200, 2199, true));
return getSplitCategories(trees, 300.0, tlpParams.treebankLanguagePack());
}