本文整理汇总了Python中readability.readability.Document.write方法的典型用法代码示例。如果您正苦于以下问题:Python Document.write方法的具体用法?Python Document.write怎么用?Python Document.write使用的例子?那么恭喜您, 这里精选的方法代码示例或许可以为您提供帮助。您也可以进一步了解该方法所在类readability.readability.Document
的用法示例。
在下文中一共展示了Document.write方法的1个代码示例,这些例子默认根据受欢迎程度排序。您可以为喜欢或者感觉有用的代码点赞,您的评价将有助于系统推荐出更棒的Python代码示例。
示例1: _update
# 需要导入模块: from readability.readability import Document [as 别名]
# 或者: from readability.readability.Document import write [as 别名]
def _update(self, response):
app.logger.debug("Updating %s" % response.url)
data = Document(response.text).summary()
doc = lxml.html.fromstring(data)
images = []
imageElems = doc.xpath("//img")
app.logger.debug("%d images for %s",len(imageElems), response.url)
for img in imageElems:
src = urlparse.urljoin(response.url, img.get("src"))
imgResp = requests.get(src)
encoded = base64.b64encode(imgResp.content)
if len(encoded) < 3000:
src = "data:" + imgResp.headers["content-type"] + ";base64," + encoded
else:
md5 = hashlib.sha1()
md5.update(encoded)
name = md5.hexdigest()
src = name +"." + src.rpartition(".")[2]
images.append((src, encoded))
img.set("src", src)
data = StringIO()
data.write(lxml.etree.tostring(doc, pretty_print=True))
for (name, imageData) in images:
data.write("\n--data:"+name+"\n"+imageData)
data.seek(0)
self.article = data.read()
self.save()