當前位置: 首頁>>代碼示例>>Java>>正文


Java Tika類代碼示例

本文整理匯總了Java中org.apache.tika.Tika的典型用法代碼示例。如果您正苦於以下問題:Java Tika類的具體用法?Java Tika怎麽用?Java Tika使用的例子?那麽, 這裏精選的類代碼示例或許可以為您提供幫助。


Tika類屬於org.apache.tika包,在下文中一共展示了Tika類的15個代碼示例,這些例子默認根據受歡迎程度排序。您可以為喜歡或者感覺有用的代碼點讚,您的評價將有助於係統推薦出更棒的Java代碼示例。

示例1: BinaryImportDestinationChooser

import org.apache.tika.Tika; //導入依賴的package包/類
/**
 * Creates a new {@link RepositoryLocationChooser} that allows to specify a media or MIME type
 * for the given data source.
 *
 * @param source
 *            the data source
 * @param initialDestination
 *            the initial location (optional)
 */
public BinaryImportDestinationChooser(BinaryDataSource source, String initialDestination) {
    super(null, null, initialDestination, true, false, true, true, Colors.WHITE);

    // Use generic mime type as default and try to guess more specific.
    String type = "application/octet-stream";
    Tika tika = new Tika();
    try {
        type = tika.detect(source.getLocation());
    } catch (IOException e) {
        // ignore
    }

    JLabel mediaTypelabel = new ResourceLabel("repository_chooser.mime_type");
    mediaType = new JTextArea(type);

    GridBagConstraints c = new GridBagConstraints();
    c.insets = new Insets(ButtonDialog.GAP, 0, 0, ButtonDialog.GAP);
    c.gridwidth = GridBagConstraints.RELATIVE;
    add(mediaTypelabel, c);

    c.insets = new Insets(ButtonDialog.GAP, 0, 0, 0);
    c.gridwidth = GridBagConstraints.REMAINDER;
    c.fill = GridBagConstraints.HORIZONTAL;
    add(mediaType, c);
}
 
開發者ID:transwarpio,項目名稱:rapidminer,代碼行數:35,代碼來源:BinaryImportDestinationChooser.java

示例2: setUp

import org.apache.tika.Tika; //導入依賴的package包/類
@Before
public void setUp() throws TikaException, IOException, SAXException {
    VelocityEngine engine = new VelocityEngine();
    engine.setProperty(RuntimeConstants.RESOURCE_LOADER, "classpath");
    engine.setProperty("classpath.resource.loader.class", ClasspathResourceLoader.class.getName());
    engine.init();

    Templater templater = new Templater();
    templater.setEngine(engine);

    exporter = new HtmlExporter();
    exporter.setTemplater(templater);

    TikaProvider provider = new TikaProvider();
    Tika tika = provider.tika();

    transformer = new TikaTransformer();
    transformer.setTika(tika);
}
 
開發者ID:LIBCAS,項目名稱:ARCLib,代碼行數:20,代碼來源:HtmlExporterTest.java

示例3: setUp

import org.apache.tika.Tika; //導入依賴的package包/類
@Before
public void setUp() throws TikaException, IOException, SAXException {
    VelocityEngine engine = new VelocityEngine();
    engine.setProperty(RuntimeConstants.RESOURCE_LOADER, "classpath");
    engine.setProperty("classpath.resource.loader.class", ClasspathResourceLoader.class.getName());
    engine.init();

    Templater templater = new Templater();
    templater.setEngine(engine);

    exporter = new PdfExporter();
    exporter.setTemplater(templater);

    TikaProvider provider = new TikaProvider();
    Tika tika = provider.tika();

    transformer = new TikaTransformer();
    transformer.setTika(tika);
}
 
開發者ID:LIBCAS,項目名稱:ARCLib,代碼行數:20,代碼來源:PdfExporterTest.java

示例4: detect

import org.apache.tika.Tika; //導入依賴的package包/類
/**
 * 利用 Tika 分析 Mime Type
 * 因為 Tika 要解析 File 、 URL 數據流,所以解析需要一定時間。不要用解析擴展名的方法,無法動態判斷,不準。
 * <p>
 * Parses the given file and returns the extracted text content.
 *
 * @param file
 * @return
 */
public static String detect(File file) throws Exception {

    //文件不存在
    if (!file.exists()) {
        throw new Exception("exception ! " + file.getAbsoluteFile() + " not existes.");
    }
    Tika t = new Tika();
    return t.detect(file);

}
 
開發者ID:h819,項目名稱:spring-boot,代碼行數:20,代碼來源:MyMimeTypeUtils.java

示例5: failedMimetypeDetectionShouldWork

import org.apache.tika.Tika; //導入依賴的package包/類
@Test
@DirtiesContext
public void failedMimetypeDetectionShouldWork() throws Exception {
    final Reflect controllerReflect = Reflect.on(this.controller);
    // Much more evil isn't possible, i guess... DirtiesContext!!!!
    Tika tika = controllerReflect.field("tika").get();
    tika = spy(tika);
    when(tika.detect(any(InputStream.class), any(String.class))).thenThrow(IOException.class);
    controllerReflect.set("tika", tika);

    final MockMultipartFile multipartFile = new MockMultipartFile("assetData", "asset.png", null, this.getClass().getResourceAsStream("/eu/euregjug/site/assets/asset.png"));
    when(this.gridFsTemplate.findOne(any(Query.class))).thenReturn(null);

    mvc
            .perform(
                    fileUpload("/api/assets")
                    .file(multipartFile)
            )
            .andExpect(status().isCreated())
            .andExpect(content().string("asset.png"));

    verify(this.gridFsTemplate).findOne(any(Query.class));
    verify(this.gridFsTemplate).store(any(InputStream.class), eq("asset.png"), isNull(String.class));
    verifyNoMoreInteractions(this.gridFsTemplate);
}
 
開發者ID:EuregJUG-Maas-Rhine,項目名稱:site,代碼行數:26,代碼來源:AssetApiControllerTest.java

示例6: getMimeType

import org.apache.tika.Tika; //導入依賴的package包/類
/**
  * return the mime type of a file, dont check extension
  * @param barr
  * @return mime type of the file
  * @throws IOException 
  */
 public static String getMimeType(byte[] barr, String defaultValue) {
     
 	//String mt = getMimeType(new ByteArrayInputStream(barr), null);
 	//if(!StringUtil.isEmpty(mt,true)) return mt;
 	
 	PrintStream out = System.out;
     try {
     	Tika tika = new Tika();
     	return tika.detect(barr);
     } 
     catch(Throwable t) {
     	ExceptionUtil.rethrowIfNecessary(t);
return defaultValue;
     }
 }
 
開發者ID:lucee,項目名稱:Lucee,代碼行數:22,代碼來源:IOUtil.java

示例7: initialise

import org.apache.tika.Tika; //導入依賴的package包/類
@PostConstruct
public void initialise() {
    // find all kiwi-module.properties and check whether they contain a baseurl property to map module web
    // resources to a certain path prefix; if yes, store the prefix and jar URL in the map for lookup and
    // resource resolving by the filter
    this.resourceMap = new HashMap<>();

    for(String module : moduleService.listModules()) {
        ModuleConfiguration config = moduleService.getModuleConfiguration(module);
        if(config.getConfiguration().containsKey("baseurl")) {
            String path = config.getConfiguration().getString("baseurl");
            resourceMap.put(path.startsWith("/")?path:"/"+path,moduleService.getModuleJar(module).toString());
        }
    }

    this.tika = new Tika();
}
 
開發者ID:apache,項目名稱:marmotta,代碼行數:18,代碼來源:MarmottaResourceServiceImpl.java

示例8: DownloadedFiles

import org.apache.tika.Tika; //導入依賴的package包/類
public DownloadedFiles(int thread) {
  File file = new File("./download_cache" + thread);
  Collection<File> list = FileUtils.listFiles(file, null, false);
  if (!list.isEmpty()) {
    try {
      File download = list.iterator().next();
      byte[] bytes = FileUtils.readFileToByteArray(download);
      content = Base64.encodeBase64String(bytes);
      filename = download.getName();
      mimeType = new Tika().detect(bytes, filename);
      int index = filename.lastIndexOf(".");
      if (index > -1 && index < filename.length()) {
        extension = filename.substring(index + 1).toLowerCase();
        filename = filename.substring(0, index);
      }
    } catch (Throwable t) {
      Log.exception(t);
    } finally {
      for (File cur : list) {
        FileUtils.deleteQuietly(cur);
      }
    }
  }
}
 
開發者ID:MachinePublishers,項目名稱:ScreenSlicer,代碼行數:25,代碼來源:Scrape.java

示例9: makeMedia

import org.apache.tika.Tika; //導入依賴的package包/類
private Media makeMedia(String filename) throws FileTikaException {
    File file = new File(filename);

    if (!isBeneathParent(root, file)) {
        throw new FileTikaException(file + " is not a child of " + root);
    }

    Tika tika = new Tika();
    Metadata metadata = new Metadata();
    try {
        logger.trace("Reading metadata for {}", filename);
        InputStream in = TikaInputStream.get(file, metadata);
        Reader reader = tika.parse(in, metadata);
        logger.trace("Metadata read to {}", metadata);
        in.close();
        reader.close();
        return new FileTikaMediaImpl(file, metadata);
    } catch (Exception ex) {
        throw new FileTikaException(ex);
    }
}
 
開發者ID:KolonelKustard,項目名稱:discodj,代碼行數:22,代碼來源:FileTikaCatalogueImpl.java

示例10: onTrigger

import org.apache.tika.Tika; //導入依賴的package包/類
@Override
public void onTrigger(final ProcessContext context, final ProcessSession session) throws ProcessException {
    FlowFile flowFile = session.get();
    if (flowFile == null) {
        flowFile = session.create();
    }
    try {
        flowFile.getAttributes();
        flowFile = session.putAttribute(flowFile, "mime.type", "application/json");
        flowFile = session.write(flowFile, new StreamCallback() {
            @Override
            public void process(InputStream inputStream, OutputStream outputStream) throws IOException {
                Tika tika = new Tika();
                String text = "";
                try {
                    text = tika.parseToString(inputStream);
                } catch (TikaException e) {
                    getLogger().error("Apache Tika failed to parse input " + e.getLocalizedMessage());
                    e.printStackTrace();
                }
                // TODO: wrap in JSON???
                outputStream.write(text.getBytes());
            }
        });
        session.transfer(flowFile, REL_SUCCESS);
        session.commit();
    } catch (final Throwable t) {
        getLogger().error("Unable to process ExtractTextProcessor file " + t.getLocalizedMessage());
        getLogger().error("{} failed to process due to {}; rolling back session", new Object[] { this, t });
        throw t;
    }
}
 
開發者ID:tspannhw,項目名稱:nifi-extracttext-processor,代碼行數:33,代碼來源:ExtractTextProcessor.java

示例11: open

import org.apache.tika.Tika; //導入依賴的package包/類
public void open(File file) {

        Tika tika = new Tika();
        tika.setMaxStringLength(999999);
        StringWriter sw = new StringWriter();
        PrintWriter pw = new PrintWriter(sw);
        
        fileName.setText(file.getName());
        int dot = file.getName().lastIndexOf('.');
        String saveName = file.getName().substring(0, dot) + ".txt";
        try {
            //long start = System.currentTimeMillis();
            String text = tika.parseToString(file);
            //long end = System.currentTimeMillis();
            pw.println(text);
            //mimeType.setText(tika.detect(file) + " (" + NumberFormat.getNumberInstance().format(end-start) + "ms)");
        } catch(Exception ex){
            ex.printStackTrace(pw);
        }
        
        pw.flush();

        plain.setText(sw.toString());
        saveFile(sw.toString(), saveName);
        plain.setCaretPosition(0);
        return ;
    }
 
開發者ID:GeneZC,項目名稱:Apache-tika-gui,代碼行數:28,代碼來源:TikaGUI.java

示例12: setUp

import org.apache.tika.Tika; //導入依賴的package包/類
@Before
public void setUp() throws IOException, TikaException, SAXException {
    MockitoAnnotations.initMocks(this);

    createDirectories(Paths.get("fileTestFiles"));

    docxExporter = new DocxExporter();

    xlsxExporter = new XlsxExporter();

    VelocityEngine engine = new VelocityEngine();
    engine.setProperty(RuntimeConstants.RESOURCE_LOADER, "classpath");
    engine.setProperty("classpath.resource.loader.class", ClasspathResourceLoader.class.getName());
    engine.init();

    templater = new Templater();
    templater.setEngine(engine);

    pdfExporter = new PdfExporter();
    pdfExporter.setTemplater(templater);

    ObjectMapperProducer objectMapperProducer = new ObjectMapperProducer();
    mapper = objectMapperProducer.objectMapper(false, false);

    TikaProvider provider = new TikaProvider();
    Tika tika = provider.tika();

    transformer = new TikaTransformer();
    transformer.setTika(tika);
}
 
開發者ID:LIBCAS,項目名稱:ARCLib,代碼行數:31,代碼來源:ReportGeneratorTest.java

示例13: setUp

import org.apache.tika.Tika; //導入依賴的package包/類
@Before
public void setUp() throws TikaException, IOException, SAXException {
    TikaProvider provider = new TikaProvider();
    Tika tika = provider.tika();

    transformer = new TikaTransformer();
    transformer.setTika(tika);
}
 
開發者ID:LIBCAS,項目名稱:ARCLib,代碼行數:9,代碼來源:TikaTransformerTest.java

示例14: getMimeType

import org.apache.tika.Tika; //導入依賴的package包/類
public String getMimeType() {
    try {
        Tika tika = new Tika();
        return tika.detect(Paths.get(getFilePath()));
    } catch (IOException e) {
        throw new RuntimeException(e);
    }
}
 
開發者ID:NHS-digital-website,項目名稱:hippo,代碼行數:9,代碼來源:Attachment.java

示例15: parse

import org.apache.tika.Tika; //導入依賴的package包/類
@Override
public AldermanAttendance parse(Path targetFile) {
    try {
        Tika tika = new Tika();
        String content = tika.parseToString(targetFile.toFile());
        AldermanAttendance aldermanAttendance = ParserUtils.fromText(content);
        return aldermanAttendance;
    } catch (IOException | TikaException e) {
        e.printStackTrace();
        return null;
    }
}
 
開發者ID:sjcdigital,項目名稱:presenca-vereadores-sjc,代碼行數:13,代碼來源:AnyDocumentAttendanceParser.java


注:本文中的org.apache.tika.Tika類示例由純淨天空整理自Github/MSDocs等開源代碼及文檔管理平台,相關代碼片段篩選自各路編程大神貢獻的開源項目,源碼版權歸原作者所有,傳播和使用請參考對應項目的License;未經允許,請勿轉載。