當前位置: 首頁>>代碼示例>>Java>>正文


Java TokenizerFactory類代碼示例

本文整理匯總了Java中org.elasticsearch.index.analysis.TokenizerFactory的典型用法代碼示例。如果您正苦於以下問題:Java TokenizerFactory類的具體用法?Java TokenizerFactory怎麽用?Java TokenizerFactory使用的例子?那麽, 這裏精選的類代碼示例或許可以為您提供幫助。


TokenizerFactory類屬於org.elasticsearch.index.analysis包,在下文中一共展示了TokenizerFactory類的15個代碼示例,這些例子默認根據受歡迎程度排序。您可以為喜歡或者感覺有用的代碼點讚,您的評價將有助於係統推薦出更棒的Java代碼示例。

示例1: getTokenizerFactory

import org.elasticsearch.index.analysis.TokenizerFactory; //導入依賴的package包/類
public synchronized TokenizerFactory getTokenizerFactory(final Version version) {
TokenizerFactory tokenizerFactory = cache.get(version);
if (tokenizerFactory == null) {
    final String finalName = name().toLowerCase(Locale.ROOT);
    if (getMultiTermComponent(version) != null) {
        tokenizerFactory = new MultiTermAwareTokenizerFactory() {
            @Override
            public String name() {
                return finalName;
            }

            @Override
            public Tokenizer create() {
                return PreBuiltTokenizers.this.create(version);
            }

            @Override
            public Object getMultiTermComponent() {
                return PreBuiltTokenizers.this.getMultiTermComponent(version);
            }
        };
    } else {
        tokenizerFactory = new TokenizerFactory() {
 
開發者ID:justor,項目名稱:elasticsearch_my,代碼行數:24,代碼來源:PreBuiltTokenizers.java

示例2: setupTokenizers

import org.elasticsearch.index.analysis.TokenizerFactory; //導入依賴的package包/類
private NamedRegistry<AnalysisProvider<TokenizerFactory>> setupTokenizers(List<AnalysisPlugin> plugins) {
    NamedRegistry<AnalysisProvider<TokenizerFactory>> tokenizers = new NamedRegistry<>("tokenizer");
    tokenizers.register("standard", StandardTokenizerFactory::new);
    tokenizers.register("uax_url_email", UAX29URLEmailTokenizerFactory::new);
    tokenizers.register("path_hierarchy", PathHierarchyTokenizerFactory::new);
    tokenizers.register("PathHierarchy", PathHierarchyTokenizerFactory::new);
    tokenizers.register("keyword", KeywordTokenizerFactory::new);
    tokenizers.register("letter", LetterTokenizerFactory::new);
    tokenizers.register("lowercase", LowerCaseTokenizerFactory::new);
    tokenizers.register("whitespace", WhitespaceTokenizerFactory::new);
    tokenizers.register("nGram", NGramTokenizerFactory::new);
    tokenizers.register("ngram", NGramTokenizerFactory::new);
    tokenizers.register("edgeNGram", EdgeNGramTokenizerFactory::new);
    tokenizers.register("edge_ngram", EdgeNGramTokenizerFactory::new);
    tokenizers.register("pattern", PatternTokenizerFactory::new);
    tokenizers.register("classic", ClassicTokenizerFactory::new);
    tokenizers.register("thai", ThaiTokenizerFactory::new);
    tokenizers.extractAndRegister(plugins, AnalysisPlugin::getTokenizers);
    return tokenizers;
}
 
開發者ID:justor,項目名稱:elasticsearch_my,代碼行數:21,代碼來源:AnalysisModule.java

示例3: testOpenNLPAnalysis

import org.elasticsearch.index.analysis.TokenizerFactory; //導入依賴的package包/類
@Test
public void testOpenNLPAnalysis() throws Exception {
    AnalysisService analysisService = MapperTestUtils.analysisService(getResource());
    TokenizerFactory tokenizerFactory = analysisService.tokenizer("opennlp");
    MatcherAssert.assertThat(tokenizerFactory, instanceOf(OpenNLPTokenizerFactory.class));
    this.tokenizerFactory = (OpenNLPTokenizerFactory) tokenizerFactory;
    TokenFilterFactory filterFactory = analysisService.tokenFilter("opennlp");
    MatcherAssert.assertThat(filterFactory, instanceOf(OpenNLPTokenFilterFactory.class));
    this.filterFactory = (OpenNLPTokenFilterFactory) filterFactory;

    TokenFilterFactory filterpayloadsfilterFactory = analysisService.tokenFilter("filterpayloads");
    MatcherAssert.assertThat(filterpayloadsfilterFactory, instanceOf(FilterPayloadsTokenFilterFactory.class));
    TokenFilterFactory strippayloadsfilterFactory = analysisService.tokenFilter("strippayloads");
    MatcherAssert.assertThat(strippayloadsfilterFactory, instanceOf(StripPayloadsTokenFilterFactory.class));

    analyze();
}
 
開發者ID:jprante,項目名稱:elasticsearch-analysis-opennlp,代碼行數:18,代碼來源:BaseOpenNLPAnalysis.java

示例4: ReloadableJapaneseTokenizerFactory

import org.elasticsearch.index.analysis.TokenizerFactory; //導入依賴的package包/類
public ReloadableJapaneseTokenizerFactory(final IndexSettings indexSettings, final Environment env, final String name,
        final Settings settings, final FessAnalysisService fessAnalysisService) {
    super(indexSettings, name, settings);

    for (final String factoryClass : FACTORIES) {
        final Class<?> tokenizerFactoryClass = fessAnalysisService.loadClass(factoryClass);
        if (tokenizerFactoryClass != null) {
            if (logger.isInfoEnabled()) {
                logger.info("{} is found.", factoryClass);
            }
            tokenizerFactory = AccessController.doPrivileged((PrivilegedAction<TokenizerFactory>) () -> {
                try {
                    final Constructor<?> constructor =
                            tokenizerFactoryClass.getConstructor(IndexSettings.class, Environment.class, String.class, Settings.class);
                    return (TokenizerFactory) constructor.newInstance(indexSettings, env, name, settings);
                } catch (final Exception e) {
                    throw new ElasticsearchException("Failed to load " + factoryClass, e);
                }
            });
            break;
        } else if (logger.isDebugEnabled()) {
            logger.debug("{} is not found.", factoryClass);
        }
    }
}
 
開發者ID:codelibs,項目名稱:elasticsearch-analysis-fess,代碼行數:26,代碼來源:ReloadableJapaneseTokenizerFactory.java

示例5: ChineseTokenizerFactory

import org.elasticsearch.index.analysis.TokenizerFactory; //導入依賴的package包/類
public ChineseTokenizerFactory(final IndexSettings indexSettings, final Environment env, final String name, final Settings settings,
        final FessAnalysisService fessAnalysisService) {
    super(indexSettings, name, settings);

    final Class<?> tokenizerFactoryClass = fessAnalysisService.loadClass(FACTORY);
    if (tokenizerFactoryClass != null) {
        if (logger.isInfoEnabled()) {
            logger.info("{} is found.", FACTORY);
        }
        tokenizerFactory = AccessController.doPrivileged((PrivilegedAction<TokenizerFactory>) () -> {
            try {
                final Constructor<?> constructor =
                        tokenizerFactoryClass.getConstructor(IndexSettings.class, Environment.class, String.class, Settings.class);
                return (TokenizerFactory) constructor.newInstance(indexSettings, env, name, settings);
            } catch (final Exception e) {
                throw new ElasticsearchException("Failed to load " + FACTORY, e);
            }
        });
    } else if (logger.isDebugEnabled()) {
        logger.debug("{} is not found.", FACTORY);
    }
}
 
開發者ID:codelibs,項目名稱:elasticsearch-analysis-fess,代碼行數:23,代碼來源:ChineseTokenizerFactory.java

示例6: VietnameseTokenizerFactory

import org.elasticsearch.index.analysis.TokenizerFactory; //導入依賴的package包/類
public VietnameseTokenizerFactory(final IndexSettings indexSettings, final Environment env, final String name, final Settings settings,
        final FessAnalysisService fessAnalysisService) {
    super(indexSettings, name, settings);

    final Class<?> tokenizerFactoryClass = fessAnalysisService.loadClass(FACTORY);
    if (tokenizerFactoryClass != null) {
        if (logger.isInfoEnabled()) {
            logger.info("{} is found.", FACTORY);
        }
        tokenizerFactory = AccessController.doPrivileged((PrivilegedAction<TokenizerFactory>) () -> {
            try {
                final Constructor<?> constructor =
                        tokenizerFactoryClass.getConstructor(IndexSettings.class, Environment.class, String.class, Settings.class);
                return (TokenizerFactory) constructor.newInstance(indexSettings, env, name, settings);
            } catch (final Exception e) {
                throw new ElasticsearchException("Failed to load " + FACTORY, e);
            }
        });
    } else if (logger.isDebugEnabled()) {
        logger.debug("{} is not found.", FACTORY);
    }
}
 
開發者ID:codelibs,項目名稱:elasticsearch-analysis-fess,代碼行數:23,代碼來源:VietnameseTokenizerFactory.java

示例7: JapaneseTokenizerFactory

import org.elasticsearch.index.analysis.TokenizerFactory; //導入依賴的package包/類
public JapaneseTokenizerFactory(final IndexSettings indexSettings, final Environment env, final String name, final Settings settings,
        final FessAnalysisService fessAnalysisService) {
    super(indexSettings, name, settings);

    for (final String factoryClass : FACTORIES) {
        final Class<?> tokenizerFactoryClass = fessAnalysisService.loadClass(factoryClass);
        if (tokenizerFactoryClass != null) {
            if (logger.isInfoEnabled()) {
                logger.info("{} is found.", factoryClass);
            }
            tokenizerFactory = AccessController.doPrivileged((PrivilegedAction<TokenizerFactory>) () -> {
                try {
                    final Constructor<?> constructor =
                            tokenizerFactoryClass.getConstructor(IndexSettings.class, Environment.class, String.class, Settings.class);
                    return (TokenizerFactory) constructor.newInstance(indexSettings, env, name, settings);
                } catch (final Exception e) {
                    throw new ElasticsearchException("Failed to load " + factoryClass, e);
                }
            });
            break;
        } else if (logger.isDebugEnabled()) {
            logger.debug("{} is not found.", factoryClass);
        }
    }
}
 
開發者ID:codelibs,項目名稱:elasticsearch-analysis-fess,代碼行數:26,代碼來源:JapaneseTokenizerFactory.java

示例8: KoreanTokenizerFactory

import org.elasticsearch.index.analysis.TokenizerFactory; //導入依賴的package包/類
public KoreanTokenizerFactory(final IndexSettings indexSettings, final Environment env, final String name, final Settings settings,
        final FessAnalysisService fessAnalysisService) {
    super(indexSettings, name, settings);

    final Class<?> tokenizerFactoryClass = fessAnalysisService.loadClass(FACTORY);
    if (tokenizerFactoryClass != null) {
        if (logger.isInfoEnabled()) {
            logger.info("{} is found.", FACTORY);
        }
        tokenizerFactory = AccessController.doPrivileged((PrivilegedAction<TokenizerFactory>) () -> {
            try {
                final Constructor<?> constructor =
                        tokenizerFactoryClass.getConstructor(IndexSettings.class, Environment.class, String.class, Settings.class);
                return (TokenizerFactory) constructor.newInstance(indexSettings, env, name, settings);
            } catch (final Exception e) {
                throw new ElasticsearchException("Failed to load " + FACTORY, e);
            }
        });
    } else if (logger.isDebugEnabled()) {
        logger.debug("{} is not found.", FACTORY);
    }
}
 
開發者ID:codelibs,項目名稱:elasticsearch-analysis-fess,代碼行數:23,代碼來源:KoreanTokenizerFactory.java

示例9: getTokenizers

import org.elasticsearch.index.analysis.TokenizerFactory; //導入依賴的package包/類
@Override
public Map<String, AnalysisProvider<TokenizerFactory>> getTokenizers() {
    final Map<String, AnalysisProvider<TokenizerFactory>> extra = new HashMap<>();
    extra.put("fess_japanese_tokenizer", (indexSettings, env, name, settings) -> new JapaneseTokenizerFactory(indexSettings, env, name,
            settings, pluginComponent.getFessAnalysisService()));
    extra.put("fess_japanese_reloadable_tokenizer",
            (indexSettings, env, name, settings) -> new ReloadableJapaneseTokenizerFactory(indexSettings, env, name, settings,
                    pluginComponent.getFessAnalysisService()));
    extra.put("fess_korean_tokenizer", (indexSettings, env, name, settings) -> new KoreanTokenizerFactory(indexSettings, env, name,
            settings, pluginComponent.getFessAnalysisService()));
    extra.put("fess_vietnamese_tokenizer", (indexSettings, env, name, settings) -> new VietnameseTokenizerFactory(indexSettings, env,
            name, settings, pluginComponent.getFessAnalysisService()));
    extra.put("fess_simplified_chinese_tokenizer", (indexSettings, env, name, settings) -> new ChineseTokenizerFactory(indexSettings,
            env, name, settings, pluginComponent.getFessAnalysisService()));
    return extra;
}
 
開發者ID:codelibs,項目名稱:elasticsearch-analysis-fess,代碼行數:17,代碼來源:FessAnalysisPlugin.java

示例10: testDefaultsIcuAnalysis

import org.elasticsearch.index.analysis.TokenizerFactory; //導入依賴的package包/類
@Test
public void testDefaultsIcuAnalysis() throws IOException {

    CharFilterFactory charFilterFactory = charFilterFactory("icu_normalizer");
    assertThat(charFilterFactory, instanceOf(IcuNormalizerCharFilterFactory.class));

    TokenizerFactory tf = tokenizerFactory("icu_tokenizer");
    assertThat(tf, instanceOf(IcuTokenizerFactory.class));

    TokenFilterFactory filterFactory = tokenFilterFactory("icu_normalizer");
    assertThat(filterFactory, instanceOf(IcuNormalizerTokenFilterFactory.class));

    filterFactory = tokenFilterFactory("icu_folding");
    assertThat(filterFactory, instanceOf(IcuFoldingTokenFilterFactory.class));

    filterFactory = tokenFilterFactory("icu_transform");
    assertThat(filterFactory, instanceOf(IcuTransformTokenFilterFactory.class));

    Analyzer analyzer = MapperTestUtils.analyzer( "icu_collation");
    assertThat(analyzer, instanceOf(NamedAnalyzer.class));
}
 
開發者ID:jprante,項目名稱:elasticsearch-plugin-bundle,代碼行數:22,代碼來源:IcuAnalysisTests.java

示例11: getTokenizers

import org.elasticsearch.index.analysis.TokenizerFactory; //導入依賴的package包/類
@Override
public Map<String, AnalysisModule.AnalysisProvider<TokenizerFactory>> getTokenizers() {
  Map<String, AnalysisModule.AnalysisProvider<TokenizerFactory>> extra = new HashMap<>();

  extra.put("jieba_search", JiebaTokenizerFactory::getJiebaSearchTokenizerFactory);
  extra.put("jieba_index", JiebaTokenizerFactory::getJiebaIndexTokenizerFactory);

  return extra;
}
 
開發者ID:sing1ee,項目名稱:elasticsearch-jieba-plugin,代碼行數:10,代碼來源:AnalysisJiebaPlugin.java

示例12: TestAnalysis

import org.elasticsearch.index.analysis.TokenizerFactory; //導入依賴的package包/類
public TestAnalysis(IndexAnalyzers indexAnalyzers,
                    Map<String, TokenFilterFactory> tokenFilter,
                    Map<String, TokenizerFactory> tokenizer,
                    Map<String, CharFilterFactory> charFilter) {
    this.indexAnalyzers = indexAnalyzers;
    this.tokenFilter = tokenFilter;
    this.tokenizer = tokenizer;
    this.charFilter = charFilter;
}
 
開發者ID:justor,項目名稱:elasticsearch_my,代碼行數:10,代碼來源:ESTestCase.java

示例13: AnalysisModule

import org.elasticsearch.index.analysis.TokenizerFactory; //導入依賴的package包/類
public AnalysisModule(Environment environment, List<AnalysisPlugin> plugins) throws IOException {
    NamedRegistry<AnalysisProvider<CharFilterFactory>> charFilters = setupCharFilters(plugins);
    NamedRegistry<org.apache.lucene.analysis.hunspell.Dictionary> hunspellDictionaries = setupHunspellDictionaries(plugins);
    hunspellService = new HunspellService(environment.settings(), environment, hunspellDictionaries.getRegistry());
    NamedRegistry<AnalysisProvider<TokenFilterFactory>> tokenFilters = setupTokenFilters(plugins, hunspellService);
    NamedRegistry<AnalysisProvider<TokenizerFactory>> tokenizers = setupTokenizers(plugins);
    NamedRegistry<AnalysisProvider<AnalyzerProvider<?>>> analyzers = setupAnalyzers(plugins);
    NamedRegistry<AnalysisProvider<AnalyzerProvider<?>>> normalizers = setupNormalizers(plugins);
    analysisRegistry = new AnalysisRegistry(environment, charFilters.getRegistry(), tokenFilters.getRegistry(), tokenizers
        .getRegistry(), analyzers.getRegistry(), normalizers.getRegistry());
}
 
開發者ID:justor,項目名稱:elasticsearch_my,代碼行數:12,代碼來源:AnalysisModule.java

示例14: createStackedTokenStream

import org.elasticsearch.index.analysis.TokenizerFactory; //導入依賴的package包/類
private static TokenStream createStackedTokenStream(String source, CharFilterFactory[] charFilterFactories, TokenizerFactory tokenizerFactory, TokenFilterFactory[] tokenFilterFactories, int current) {
    Reader reader = new FastStringReader(source);
    for (CharFilterFactory charFilterFactory : charFilterFactories) {
        reader = charFilterFactory.create(reader);
    }
    Tokenizer tokenizer = tokenizerFactory.create();
    tokenizer.setReader(reader);
    TokenStream tokenStream = tokenizer;
    for (int i = 0; i < current; i++) {
        tokenStream = tokenFilterFactories[i].create(tokenStream);
    }
    return tokenStream;
}
 
開發者ID:justor,項目名稱:elasticsearch_my,代碼行數:14,代碼來源:TransportAnalyzeAction.java

示例15: getTokenizers

import org.elasticsearch.index.analysis.TokenizerFactory; //導入依賴的package包/類
@Override
public Map<String, AnalysisProvider<TokenizerFactory>> getTokenizers() {
    Map<String, AnalysisProvider<TokenizerFactory>> extra = new HashMap<>();
    extra.put("smartcn_tokenizer", SmartChineseTokenizerTokenizerFactory::new);
    // This is an alias to "smartcn_tokenizer"; it's here for backwards compat
    extra.put("smartcn_sentence", SmartChineseTokenizerTokenizerFactory::new);
    return extra;
}
 
開發者ID:justor,項目名稱:elasticsearch_my,代碼行數:9,代碼來源:AnalysisSmartChinesePlugin.java


注:本文中的org.elasticsearch.index.analysis.TokenizerFactory類示例由純淨天空整理自Github/MSDocs等開源代碼及文檔管理平台,相關代碼片段篩選自各路編程大神貢獻的開源項目,源碼版權歸原作者所有,傳播和使用請參考對應項目的License;未經允許,請勿轉載。