当前位置: 首页>>代码示例>>Java>>正文


Java ChineseLexiconAndWordSegmenter类代码示例

本文整理汇总了Java中edu.stanford.nlp.parser.lexparser.ChineseLexiconAndWordSegmenter的典型用法代码示例。如果您正苦于以下问题:Java ChineseLexiconAndWordSegmenter类的具体用法?Java ChineseLexiconAndWordSegmenter怎么用?Java ChineseLexiconAndWordSegmenter使用的例子?那么恭喜您, 这里精选的类代码示例或许可以为您提供帮助。


ChineseLexiconAndWordSegmenter类属于edu.stanford.nlp.parser.lexparser包,在下文中一共展示了ChineseLexiconAndWordSegmenter类的2个代码示例,这些例子默认根据受欢迎程度排序。您可以为喜欢或者感觉有用的代码点赞,您的评价将有助于系统推荐出更棒的Java代码示例。

示例1: parse

import edu.stanford.nlp.parser.lexparser.ChineseLexiconAndWordSegmenter; //导入依赖的package包/类
/**
 * Tokenizes the highlighted text (using a tokenizer appropriate for the
 * selected language, and initiates the ParseThread to parse the tokenized
 * text.
 */
public void parse() {
  if (textPane.getText().length() == 0) {
    return;
  }

  // use endIndex+1 because substring subtracts 1
  String text = textPane.getText().substring(startIndex, endIndex + 1).trim();

  if (parser != null && text.length() > 0) {
    if (segmentWords) {
      ChineseLexiconAndWordSegmenter lex = (ChineseLexiconAndWordSegmenter) parser.getLexicon();
      ChineseTreebankLanguagePack.setTokenizerFactory(WordSegmentingTokenizer.factory(lex));
    }
    Tokenizer<? extends HasWord> toke = tlp.getTokenizerFactory().getTokenizer(new CharArrayReader(text.toCharArray()));
    List<? extends HasWord> wordList = toke.tokenize();
    parseThread = new ParseThread(wordList);
    parseThread.start();
    startProgressMonitor("Parsing", PARSE_TIME);
  }
}
 
开发者ID:FabianFriedrich,项目名称:Text2Process,代码行数:26,代码来源:ParserPanel.java

示例2: parse

import edu.stanford.nlp.parser.lexparser.ChineseLexiconAndWordSegmenter; //导入依赖的package包/类
/**
 * Tokenizes the highlighted text (using a tokenizer appropriate for the
 * selected language, and initiates the ParseThread to parse the tokenized
 * text.
 */
public void parse() {
  if (textPane.getText().length() == 0) {
    return;
  }

  // use endIndex+1 because substring subtracts 1
  String text = textPane.getText().substring(startIndex, endIndex + 1).trim();

  if (parser != null && text.length() > 0) {
    if (segmentWords) {
      ChineseLexiconAndWordSegmenter lex = (ChineseLexiconAndWordSegmenter) parser.getLexicon();
      ChineseTreebankLanguagePack.setTokenizerFactory(WordSegmentingTokenizer.factory(lex));
    }
    //Tokenizer<? extends HasWord> toke = tlp.getTokenizerFactory().getTokenizer(new CharArrayReader(text.toCharArray()));
    Tokenizer<? extends HasWord> toke = tlp.getTokenizerFactory().getTokenizer(new StringReader(text));
    List<? extends HasWord> wordList = toke.tokenize();
    parseThread = new ParseThread(wordList);
    parseThread.start();
    startProgressMonitor("Parsing", PARSE_TIME);
  }
}
 
开发者ID:amark-india,项目名称:eventspotter,代码行数:27,代码来源:ParserPanel.java


注:本文中的edu.stanford.nlp.parser.lexparser.ChineseLexiconAndWordSegmenter类示例由纯净天空整理自Github/MSDocs等开源代码及文档管理平台,相关代码片段筛选自各路编程大神贡献的开源项目,源码版权归原作者所有,传播和使用请参考对应项目的License;未经允许,请勿转载。