本文整理汇总了Java中org.archive.io.warc.WARCRecordInfo.getRecordId方法的典型用法代码示例。如果您正苦于以下问题:Java WARCRecordInfo.getRecordId方法的具体用法?Java WARCRecordInfo.getRecordId怎么用?Java WARCRecordInfo.getRecordId使用的例子?那么恭喜您, 这里精选的方法代码示例或许可以为您提供帮助。您也可以进一步了解该方法所在类org.archive.io.warc.WARCRecordInfo
的用法示例。
在下文中一共展示了WARCRecordInfo.getRecordId方法的2个代码示例,这些例子默认根据受欢迎程度排序。您可以为喜欢或者感觉有用的代码点赞,您的评价将有助于系统推荐出更棒的Java代码示例。
示例1: writeRequest
import org.archive.io.warc.WARCRecordInfo; //导入方法依赖的package包/类
protected URI writeRequest(URI id) throws IOException, ParseException {
WARCRecordInfo record = new WARCRecordInfo();
record.setType(WARCConstants.WARCRecordType.request);
record.setUrl(getUrl());
record.setCreate14DigitDate(DateUtils
.getLog14Date(Long.parseLong(metadata.get("nutch.fetch.time"))));
record.setMimetype(WARCConstants.HTTP_REQUEST_MIMETYPE);
record.setRecordId(GENERATOR.getRecordID());
if (id != null) {
ANVLRecord headers = new ANVLRecord();
headers.addLabelValue(WARCConstants.HEADER_KEY_CONCURRENT_TO,
'<' + id.toString() + '>');
record.setExtraHeaders(headers);
}
ByteArrayOutputStream output = new ByteArrayOutputStream();
output.write(metadata.get("_request_").getBytes());
record.setContentLength(output.size());
record.setContentStream(new ByteArrayInputStream(output.toByteArray()));
writer.writeRecord(record);
return record.getRecordId();
}
示例2: writeResponse
import org.archive.io.warc.WARCRecordInfo; //导入方法依赖的package包/类
protected URI writeResponse() throws IOException, ParseException {
WARCRecordInfo record = new WARCRecordInfo();
record.setType(WARCConstants.WARCRecordType.response);
record.setUrl(getUrl());
String fetchTime;
record.setCreate14DigitDate(DateUtils
.getLog14Date(Long.parseLong(metadata.get("nutch.fetch.time"))));
record.setMimetype(WARCConstants.HTTP_RESPONSE_MIMETYPE);
record.setRecordId(GENERATOR.getRecordID());
String IP = getResponseAddress();
if (StringUtils.isNotBlank(IP))
record.addExtraHeader(WARCConstants.HEADER_KEY_IP, IP);
if (ParseSegment.isTruncated(content))
record.addExtraHeader(WARCConstants.HEADER_KEY_TRUNCATED, "unspecified");
ByteArrayOutputStream output = new ByteArrayOutputStream();
String httpHeaders = metadata.get("_response.headers_");
if (StringUtils.isNotBlank(httpHeaders)) {
output.write(httpHeaders.getBytes());
} else {
// change the record type to resource as we not have information about
// the headers
record.setType(WARCConstants.WARCRecordType.resource);
record.setMimetype(content.getContentType());
}
output.write(getResponseContent().getBytes());
record.setContentLength(output.size());
record.setContentStream(new ByteArrayInputStream(output.toByteArray()));
if (output.size() > 0) {
// avoid generating a 0 sized record, as the webarchive library will
// complain about it
writer.writeRecord(record);
}
return record.getRecordId();
}