当前位置: 首页>>代码示例>>Python>>正文


Python dammit.UnicodeDammit方法代码示例

本文整理汇总了Python中bs4.dammit.UnicodeDammit方法的典型用法代码示例。如果您正苦于以下问题:Python dammit.UnicodeDammit方法的具体用法?Python dammit.UnicodeDammit怎么用?Python dammit.UnicodeDammit使用的例子?那么, 这里精选的方法代码示例或许可以为您提供帮助。您也可以进一步了解该方法所在bs4.dammit的用法示例。


在下文中一共展示了dammit.UnicodeDammit方法的10个代码示例,这些例子默认根据受欢迎程度排序。您可以为喜欢或者感觉有用的代码点赞,您的评价将有助于系统推荐出更棒的Python代码示例。

示例1: prepare_markup

# 需要导入模块: from bs4 import dammit [as 别名]
# 或者: from bs4.dammit import UnicodeDammit [as 别名]
def prepare_markup(self, markup, user_specified_encoding=None,
                       document_declared_encoding=None, exclude_encodings=None):
        """
        :return: A 4-tuple (markup, original encoding, encoding
        declared within markup, whether any characters had to be
        replaced with REPLACEMENT CHARACTER).
        """
        if isinstance(markup, unicode):
            yield (markup, None, None, False)
            return

        try_encodings = [user_specified_encoding, document_declared_encoding]
        dammit = UnicodeDammit(markup, try_encodings, is_html=True,
                               exclude_encodings=exclude_encodings)
        yield (dammit.markup, dammit.original_encoding,
               dammit.declared_html_encoding,
               dammit.contains_replacement_characters) 
开发者ID:MarcelloLins,项目名称:ServerlessCrawler-VancouverRealState,代码行数:19,代码来源:_htmlparser.py

示例2: prepare_markup

# 需要导入模块: from bs4 import dammit [as 别名]
# 或者: from bs4.dammit import UnicodeDammit [as 别名]
def prepare_markup(self, markup, user_specified_encoding=None,
                       document_declared_encoding=None):
        """
        :return: A 4-tuple (markup, original encoding, encoding
        declared within markup, whether any characters had to be
        replaced with REPLACEMENT CHARACTER).
        """
        if isinstance(markup, unicode):
            yield (markup, None, None, False)
            return

        try_encodings = [user_specified_encoding, document_declared_encoding]
        dammit = UnicodeDammit(markup, try_encodings, is_html=True)
        yield (dammit.markup, dammit.original_encoding,
               dammit.declared_html_encoding,
               dammit.contains_replacement_characters) 
开发者ID:MayOneUS,项目名称:pledgeservice,代码行数:18,代码来源:_htmlparser.py

示例3: prepare_markup

# 需要导入模块: from bs4 import dammit [as 别名]
# 或者: from bs4.dammit import UnicodeDammit [as 别名]
def prepare_markup(self, markup, user_specified_encoding=None,
                       document_declared_encoding=None, exclude_encodings=None):
        """
        :return: A 4-tuple (markup, original encoding, encoding
        declared within markup, whether any characters had to be
        replaced with REPLACEMENT CHARACTER).
        """
        if isinstance(markup, str):
            yield (markup, None, None, False)
            return

        try_encodings = [user_specified_encoding, document_declared_encoding]
        dammit = UnicodeDammit(markup, try_encodings, is_html=True,
                               exclude_encodings=exclude_encodings)
        yield (dammit.markup, dammit.original_encoding,
               dammit.declared_html_encoding,
               dammit.contains_replacement_characters) 
开发者ID:the-ethan-hunt,项目名称:B.E.N.J.I.,代码行数:19,代码来源:_htmlparser.py

示例4: prepare_markup

# 需要导入模块: from bs4 import dammit [as 别名]
# 或者: from bs4.dammit import UnicodeDammit [as 别名]
def prepare_markup(self, markup, user_specified_encoding=None,
                       document_declared_encoding=None):
        """
        :return: A 4-tuple (markup, original encoding, encoding
        declared within markup, whether any characters had to be
        replaced with REPLACEMENT CHARACTER).
        """
        if isinstance(markup, unicode):
            return markup, None, None, False

        try_encodings = [user_specified_encoding, document_declared_encoding]
        dammit = UnicodeDammit(markup, try_encodings, is_html=True)
        return (dammit.markup, dammit.original_encoding,
                dammit.declared_html_encoding,
                dammit.contains_replacement_characters) 
开发者ID:einstein95,项目名称:crunchy-xml-decoder,代码行数:17,代码来源:_htmlparser.py

示例5: prepare_markup

# 需要导入模块: from bs4 import dammit [as 别名]
# 或者: from bs4.dammit import UnicodeDammit [as 别名]
def prepare_markup(self, markup, user_specified_encoding=None,
                       document_declared_encoding=None):
        """
        :return: A 3-tuple (markup, original encoding, encoding
        declared within markup).
        """
        if isinstance(markup, unicode):
            return markup, None, None, False

        try_encodings = [user_specified_encoding, document_declared_encoding]
        dammit = UnicodeDammit(markup, try_encodings, is_html=True)
        return (dammit.markup, dammit.original_encoding,
                dammit.declared_html_encoding,
                dammit.contains_replacement_characters) 
开发者ID:einstein95,项目名称:crunchy-xml-decoder,代码行数:16,代码来源:_lxml.py

示例6: test_smart_quote_substitution

# 需要导入模块: from bs4 import dammit [as 别名]
# 或者: from bs4.dammit import UnicodeDammit [as 别名]
def test_smart_quote_substitution(self):
        # MS smart quotes are a common source of frustration, so we
        # give them a special test.
        quotes = b"\x91\x92foo\x93\x94"
        dammit = UnicodeDammit(quotes)
        self.assertEqual(self.sub.substitute_html(dammit.markup),
                          "‘’foo“”") 
开发者ID:kuri65536,项目名称:python-for-android,代码行数:9,代码来源:test_soup.py

示例7: test_smart_quotes_to_unicode

# 需要导入模块: from bs4 import dammit [as 别名]
# 或者: from bs4.dammit import UnicodeDammit [as 别名]
def test_smart_quotes_to_unicode(self):
        markup = b"<foo>\x91\x92\x93\x94</foo>"
        dammit = UnicodeDammit(markup)
        self.assertEqual(
            dammit.unicode_markup, "<foo>\u2018\u2019\u201c\u201d</foo>") 
开发者ID:kuri65536,项目名称:python-for-android,代码行数:7,代码来源:test_soup.py

示例8: test_smart_quotes_to_xml_entities

# 需要导入模块: from bs4 import dammit [as 别名]
# 或者: from bs4.dammit import UnicodeDammit [as 别名]
def test_smart_quotes_to_xml_entities(self):
        markup = b"<foo>\x91\x92\x93\x94</foo>"
        dammit = UnicodeDammit(markup, smart_quotes_to="xml")
        self.assertEqual(
            dammit.unicode_markup, "<foo>&#x2018;&#x2019;&#x201C;&#x201D;</foo>") 
开发者ID:kuri65536,项目名称:python-for-android,代码行数:7,代码来源:test_soup.py

示例9: test_smart_quotes_to_html_entities

# 需要导入模块: from bs4 import dammit [as 别名]
# 或者: from bs4.dammit import UnicodeDammit [as 别名]
def test_smart_quotes_to_html_entities(self):
        markup = b"<foo>\x91\x92\x93\x94</foo>"
        dammit = UnicodeDammit(markup, smart_quotes_to="html")
        self.assertEqual(
            dammit.unicode_markup, "<foo>&lsquo;&rsquo;&ldquo;&rdquo;</foo>") 
开发者ID:kuri65536,项目名称:python-for-android,代码行数:7,代码来源:test_soup.py

示例10: test_detect_utf8

# 需要导入模块: from bs4 import dammit [as 别名]
# 或者: from bs4.dammit import UnicodeDammit [as 别名]
def test_detect_utf8(self):
        utf8 = b"\xc3\xa9"
        dammit = UnicodeDammit(utf8)
        self.assertEqual(dammit.unicode_markup, '\xe9')
        self.assertEqual(dammit.original_encoding, 'utf-8') 
开发者ID:kuri65536,项目名称:python-for-android,代码行数:7,代码来源:test_soup.py


注:本文中的bs4.dammit.UnicodeDammit方法示例由纯净天空整理自Github/MSDocs等开源代码及文档管理平台,相关代码片段筛选自各路编程大神贡献的开源项目,源码版权归原作者所有,传播和使用请参考对应项目的License;未经允许,请勿转载。