当前位置: 首页>>代码示例>>Java>>正文


Java AdditionalTextCombiner类代码示例

本文整理汇总了Java中org.galagosearch.core.parse.AdditionalTextCombiner的典型用法代码示例。如果您正苦于以下问题:Java AdditionalTextCombiner类的具体用法?Java AdditionalTextCombiner怎么用?Java AdditionalTextCombiner使用的例子?那么, 这里精选的类代码示例或许可以为您提供帮助。


AdditionalTextCombiner类属于org.galagosearch.core.parse包,在下文中一共展示了AdditionalTextCombiner类的1个代码示例,这些例子默认根据受欢迎程度排序。您可以为喜欢或者感觉有用的代码点赞,您的评价将有助于系统推荐出更棒的Java代码示例。

示例1: getParsePostingsStage

import org.galagosearch.core.parse.AdditionalTextCombiner; //导入依赖的package包/类
public Stage getParsePostingsStage() {
    Stage stage = new Stage("parsePostings");

    stage.add(new StageConnectionPoint(
            ConnectionPointType.Input,
            "splits", new DocumentSplit.FileNameStartKeyOrder()));
    stage.add(new StageConnectionPoint(
            ConnectionPointType.Output,
            "postings", new DocumentWordPosition.DocumentWordPositionOrder()));
    stage.add(new StageConnectionPoint(
            ConnectionPointType.Output,
            "extents", new DocumentExtent.IdentifierOrder()));
    stage.add(new StageConnectionPoint(
            ConnectionPointType.Output,
            "documentData", new DocumentData.IdentifierOrder()));
    if (stemming) {
        stage.add(new StageConnectionPoint(
            ConnectionPointType.Output,
            "stemmedPostings", new DocumentWordPosition.DocumentWordPositionOrder()));
    }
    if (useLinks) {
        stage.add(new StageConnectionPoint(
            ConnectionPointType.Input,
            "anchorText", new AdditionalDocumentText.IdentifierOrder()));
    }

    stage.add(new InputStep("splits"));
    stage.add(new Step(UniversalParser.class));
    if (useLinks) {
        Parameters p = new Parameters();
        p.add("textSource", "anchorText");
        stage.add(new Step(AdditionalTextCombiner.class, p));
    }
    stage.add(new Step(TagTokenizer.class));

    MultiStep multi = new MultiStep();
    ArrayList<Step> text =
            getExtractionSteps("postings", PostingsPositionExtractor.class,
                               new DocumentWordPosition.DocumentWordPositionOrder());
    ArrayList<Step> extents =
            getExtractionSteps("extents", ExtentExtractor.class,
                               new DocumentExtent.IdentifierOrder());
    ArrayList<Step> documentData =
            getExtractionSteps("documentData", DocumentDataExtractor.class,
                               new DocumentData.IdentifierOrder());

    multi.groups.add(text);
    multi.groups.add(extents);
    multi.groups.add(documentData);

    if (stemming) {
        ArrayList<Step> stemmedSteps = new ArrayList<Step>();
        stemmedSteps.add(new Step(Porter2Stemmer.class));
        stemmedSteps.add(new Step(PostingsPositionExtractor.class));
        stemmedSteps.add(Utility.getSorter(new DocumentWordPosition.DocumentWordPositionOrder()));
        stemmedSteps.add(new OutputStep("stemmedPostings"));
        multi.groups.add(stemmedSteps);
    }

    stage.add(multi);
    return stage;
}
 
开发者ID:jjfiv,项目名称:galagosearch,代码行数:63,代码来源:BuildIndex.java


注:本文中的org.galagosearch.core.parse.AdditionalTextCombiner类示例由纯净天空整理自Github/MSDocs等开源代码及文档管理平台,相关代码片段筛选自各路编程大神贡献的开源项目,源码版权归原作者所有,传播和使用请参考对应项目的License;未经允许,请勿转载。