本文整理汇总了Java中net.htmlparser.jericho.CharacterReference.decode方法的典型用法代码示例。如果您正苦于以下问题:Java CharacterReference.decode方法的具体用法?Java CharacterReference.decode怎么用?Java CharacterReference.decode使用的例子?那么恭喜您, 这里精选的方法代码示例或许可以为您提供帮助。您也可以进一步了解该方法所在类net.htmlparser.jericho.CharacterReference
的用法示例。
在下文中一共展示了CharacterReference.decode方法的4个代码示例,这些例子默认根据受欢迎程度排序。您可以为喜欢或者感觉有用的代码点赞,您的评价将有助于系统推荐出更棒的Java代码示例。
示例1: reencodeTextSegment
import net.htmlparser.jericho.CharacterReference; //导入方法依赖的package包/类
private static void reencodeTextSegment(Source source, OutputDocument outputDocument, int begin, int end, boolean formatWhiteSpace) {
if (begin>=end) return;
Segment textSegment=new Segment(source,begin,end);
String decodedText=CharacterReference.decode(textSegment);
String encodedText=formatWhiteSpace ? CharacterReference.encodeWithWhiteSpaceFormatting(decodedText) : CharacterReference.encode(decodedText);
outputDocument.replace(textSegment,encodedText);
}
示例2: reencodeTextSegment
import net.htmlparser.jericho.CharacterReference; //导入方法依赖的package包/类
private void reencodeTextSegment(Source source, OutputDocument output, int begin, int end) {
if (begin >= end)
return;
Segment textSegment = new Segment(source, begin, end);
String decodedText = CharacterReference.decode(textSegment);
String encodedText = CharacterReference.encode(decodedText);
output.replace(textSegment, encodedText);
}
示例3: reencodeTextSegment
import net.htmlparser.jericho.CharacterReference; //导入方法依赖的package包/类
private void reencodeTextSegment(Source source, OutputDocument output,
int begin, int end) {
if (begin >= end)
return;
Segment textSegment = new Segment(source, begin, end);
String decodedText = CharacterReference.decode(textSegment);
String encodedText = CharacterReference.encode(decodedText);
output.replace(textSegment, encodedText);
}
示例4: mosesPostprocess
import net.htmlparser.jericho.CharacterReference; //导入方法依赖的package包/类
private String mosesPostprocess(String text, Language targetLanguage) {
String result = CharacterReference.decode(text);
result = DeNormalize.processSingleLine(result).replaceAll("\\s+", " ").trim();
if (!targetLanguage.isSpaceDelimited()) {
result = result.replaceAll("(?<=[\u3001-\u9fa0])\\s+(?=[\u3001-\u9fa0])", "");
}
return cleanSpacesAroundTags(result, text);
}