本文整理汇总了Python中readability.readability.Document.summary_with_metadata方法的典型用法代码示例。如果您正苦于以下问题:Python Document.summary_with_metadata方法的具体用法?Python Document.summary_with_metadata怎么用?Python Document.summary_with_metadata使用的例子?那么恭喜您, 这里精选的方法代码示例或许可以为您提供帮助。您也可以进一步了解该方法所在类readability.readability.Document
的用法示例。
在下文中一共展示了Document.summary_with_metadata方法的1个代码示例,这些例子默认根据受欢迎程度排序。您可以为喜欢或者感觉有用的代码点赞,您的评价将有助于系统推荐出更棒的Python代码示例。
示例1: get_webpage_by_html
# 需要导入模块: from readability.readability import Document [as 别名]
# 或者: from readability.readability.Document import summary_with_metadata [as 别名]
def get_webpage_by_html(url, html=None):
html = get_html_str(url, html)
summary_obj = predefined_site(url, html)
article = video_site(url)
if summary_obj is None:
doc = Document(html, url=url, debug=True, multipage=False)
summary_obj = doc.summary_with_metadata(enclose_with_html_tag=False)
title = summary_obj.short_title
if article is None:
article = summary_obj.html
from urllib.parse import urlparse
webpage = Webpage()
webpage.url = url
webpage.domain = urlparse(url).hostname
webpage.title = title
webpage.favicon = ""
webpage.top_image = None
webpage.excerpt = summary_obj.description
webpage.author = None
webpage.content = article
webpage.tags = get_suggest_tags(title, article, summary_obj.keywords)
webpage.movies = []
webpage.raw_html = html
webpage.publish_date = None
webpage.segmentation = get_segmentation(title, article)
return webpage.__dict__