当前位置: 首页>>代码示例>>Python>>正文


Python Request.meta['surgery_item']方法代码示例

本文整理汇总了Python中scrapy.http.request.Request.meta['surgery_item']方法的典型用法代码示例。如果您正苦于以下问题:Python Request.meta['surgery_item']方法的具体用法?Python Request.meta['surgery_item']怎么用?Python Request.meta['surgery_item']使用的例子?那么恭喜您, 这里精选的方法代码示例或许可以为您提供帮助。您也可以进一步了解该方法所在scrapy.http.request.Request的用法示例。


在下文中一共展示了Request.meta['surgery_item']方法的2个代码示例,这些例子默认根据受欢迎程度排序。您可以为喜欢或者感觉有用的代码点赞,您的评价将有助于系统推荐出更棒的Python代码示例。

示例1: parse_surgery

# 需要导入模块: from scrapy.http.request import Request [as 别名]
# 或者: from scrapy.http.request.Request import meta['surgery_item'] [as 别名]
    def parse_surgery(self, response):
        print response.url
        surgery_item = SurgeryItem()
        surgery_item['url'] = response.url
        surgery_item['name'] = response.xpath('//div[@class="w_n"]/h3/text()').extract()[0]
        surgery_item['summary'] = response.xpath('//dd[@class="w_d3"]/text()').extract()[0]

        # Go on parsing details
        _next = response.xpath('//div[@class="w_n"]/div[@class="w_na clears"]/a[@class="hover"]/following-sibling::a[not(@class="w_la")][1]/@href').extract()
        next_detail_url = urljoin(response.url, _next[0])
        request = Request(url=next_detail_url, dont_filter=True, callback=self._parse_surgery_detail)
        request.meta['surgery_item'] = surgery_item
        yield request
开发者ID:whypro,项目名称:medical_crawler,代码行数:15,代码来源:a120ask.py

示例2: _parse_surgery_detail

# 需要导入模块: from scrapy.http.request import Request [as 别名]
# 或者: from scrapy.http.request.Request import meta['surgery_item'] [as 别名]
    def _parse_surgery_detail(self, response):
        print response.url
        surgery_item = response.meta['surgery_item']
        key = response.url.split('/')[-1].split('.')[0]
        field = self._surgery_detail_url_map[key]
        print surgery_item['name'], key, field
        surgery_item[field] = strip_tags('\n'.join(response.xpath('//div[@class="w_contl fl"]/h3/following-sibling::*').extract())).strip()

        _next = response.xpath('//div[@class="w_n"]/div[@class="w_na clears"]/a[@class="hover"]/following-sibling::a[not(@class="w_la")][1]/@href').extract()
        if _next:
            next_detail_url = urljoin(response.url, _next[0])
            request = Request(url=next_detail_url, dont_filter=True, callback=self._parse_surgery_detail)
            request.meta['surgery_item'] = surgery_item
            yield request
        else:
            yield surgery_item
开发者ID:whypro,项目名称:medical_crawler,代码行数:18,代码来源:a120ask.py


注:本文中的scrapy.http.request.Request.meta['surgery_item']方法示例由纯净天空整理自Github/MSDocs等开源代码及文档管理平台,相关代码片段筛选自各路编程大神贡献的开源项目,源码版权归原作者所有,传播和使用请参考对应项目的License;未经允许,请勿转载。