当前位置: 首页>>代码示例>>Java>>正文


Java MultiStep类代码示例

本文整理汇总了Java中org.galagosearch.tupleflow.execution.MultiStep的典型用法代码示例。如果您正苦于以下问题:Java MultiStep类的具体用法?Java MultiStep怎么用?Java MultiStep使用的例子?那么, 这里精选的类代码示例或许可以为您提供帮助。


MultiStep类属于org.galagosearch.tupleflow.execution包,在下文中一共展示了MultiStep类的2个代码示例,这些例子默认根据受欢迎程度排序。您可以为喜欢或者感觉有用的代码点赞,您的评价将有助于系统推荐出更棒的Java代码示例。

示例1: getParseLinksStage

import org.galagosearch.tupleflow.execution.MultiStep; //导入依赖的package包/类
public Stage getParseLinksStage() {
    Stage stage = new Stage("parseLinks");

    stage.add(new StageConnectionPoint(
            ConnectionPointType.Input,
            "splits", new DocumentSplit.FileNameStartKeyOrder()));
    stage.add(new StageConnectionPoint(
            ConnectionPointType.Output,
            "links", new ExtractedLink.DestUrlOrder()));
    stage.add(new StageConnectionPoint(
            ConnectionPointType.Output,
            "documentUrls", new DocumentData.UrlOrder()));

    stage.add(new InputStep("splits"));
    stage.add(new Step(UniversalParser.class));
    stage.add(new Step(TagTokenizer.class));

    MultiStep multi = new MultiStep();
    ArrayList<Step> links =
            getExtractionSteps("links", LinkExtractor.class, new ExtractedLink.DestUrlOrder());
    ArrayList<Step> data =
            getExtractionSteps("documentUrls", DocumentDataExtractor.class,
                               new DocumentData.UrlOrder());

    multi.groups.add(links);
    multi.groups.add(data);
    stage.add(multi);

    return stage;
}
 
开发者ID:jjfiv,项目名称:galagosearch,代码行数:31,代码来源:BuildIndex.java

示例2: getParsePostingsStage

import org.galagosearch.tupleflow.execution.MultiStep; //导入依赖的package包/类
public Stage getParsePostingsStage() {
    Stage stage = new Stage("parsePostings");

    stage.add(new StageConnectionPoint(
            ConnectionPointType.Input,
            "splits", new DocumentSplit.FileNameStartKeyOrder()));
    stage.add(new StageConnectionPoint(
            ConnectionPointType.Output,
            "postings", new DocumentWordPosition.DocumentWordPositionOrder()));
    stage.add(new StageConnectionPoint(
            ConnectionPointType.Output,
            "extents", new DocumentExtent.IdentifierOrder()));
    stage.add(new StageConnectionPoint(
            ConnectionPointType.Output,
            "documentData", new DocumentData.IdentifierOrder()));
    if (stemming) {
        stage.add(new StageConnectionPoint(
            ConnectionPointType.Output,
            "stemmedPostings", new DocumentWordPosition.DocumentWordPositionOrder()));
    }
    if (useLinks) {
        stage.add(new StageConnectionPoint(
            ConnectionPointType.Input,
            "anchorText", new AdditionalDocumentText.IdentifierOrder()));
    }

    stage.add(new InputStep("splits"));
    stage.add(new Step(UniversalParser.class));
    if (useLinks) {
        Parameters p = new Parameters();
        p.add("textSource", "anchorText");
        stage.add(new Step(AdditionalTextCombiner.class, p));
    }
    stage.add(new Step(TagTokenizer.class));

    MultiStep multi = new MultiStep();
    ArrayList<Step> text =
            getExtractionSteps("postings", PostingsPositionExtractor.class,
                               new DocumentWordPosition.DocumentWordPositionOrder());
    ArrayList<Step> extents =
            getExtractionSteps("extents", ExtentExtractor.class,
                               new DocumentExtent.IdentifierOrder());
    ArrayList<Step> documentData =
            getExtractionSteps("documentData", DocumentDataExtractor.class,
                               new DocumentData.IdentifierOrder());

    multi.groups.add(text);
    multi.groups.add(extents);
    multi.groups.add(documentData);

    if (stemming) {
        ArrayList<Step> stemmedSteps = new ArrayList<Step>();
        stemmedSteps.add(new Step(Porter2Stemmer.class));
        stemmedSteps.add(new Step(PostingsPositionExtractor.class));
        stemmedSteps.add(Utility.getSorter(new DocumentWordPosition.DocumentWordPositionOrder()));
        stemmedSteps.add(new OutputStep("stemmedPostings"));
        multi.groups.add(stemmedSteps);
    }

    stage.add(multi);
    return stage;
}
 
开发者ID:jjfiv,项目名称:galagosearch,代码行数:63,代码来源:BuildIndex.java


注:本文中的org.galagosearch.tupleflow.execution.MultiStep类示例由纯净天空整理自Github/MSDocs等开源代码及文档管理平台,相关代码片段筛选自各路编程大神贡献的开源项目,源码版权归原作者所有,传播和使用请参考对应项目的License;未经允许,请勿转载。