This repository contains source code of examples from the book *Web Scraping with Python*, published by Packt Publishing. Examples have been tested with Python 2.7 and depend on: * [BeautifulSoup](http://www.crummy.com/software/BeautifulSoup/) (Ch 2) * [lxml](http://lxml.de/) (Ch 2-9) * [pymongo](http://api.mongodb.org/python/current/) (Ch 3-5, 9) * [PyQt](http://www.riverbankcomputing.co.uk/software/pyqt/intro) / [PySide](https://pypi.python.org/pypi/PySide) (Ch 5) * [ghost](http://jeanphix.me/Ghost.py/) (Ch 5) * [Selenium WebDriver](http://www.seleniumhq.org/projects/webdriver/) (Ch 5, 9) * [mechanize](http://wwwsearch.sourceforge.net/mechanize/) (Ch 6) * [PIL](http://www.pythonware.com/products/pil/) / [Pillow](https://python-pillow.github.io/) (Ch 7) * [pytesseract](https://github.com/madmaze/pytesseract) (Ch 7) * [scrapy](http://scrapy.org/) (Ch 8) * [portia](https://github.com/scrapinghub/portia) (Ch 8) * [scrapely](https://github.com/scrapy/scrapely) (Ch 8) This examples will break in future as websites change and dependencies are updated, so [bug reports and patches](https://bitbucket.org/wswp/code/issues?status=new&status=open) are welcome.
- 1
- 2
- 粉丝: 1
- 资源: 3
- 我的内容管理 展开
- 我的资源 快来上传第一个资源
- 我的收益 登录查看自己的收益
- 我的积分 登录查看自己的积分
- 我的C币 登录后查看C币余额
- 我的收藏
- 我的下载
- 下载帮助
最新资源
- (源码)基于Django和OpenCV的智能车视频处理系统.zip
- (源码)基于ESP8266的WebDAV服务器与3D打印机管理系统.zip
- (源码)基于Nio实现的Mycat 2.0数据库代理系统.zip
- (源码)基于Java的高校学生就业管理系统.zip
- (源码)基于Spring Boot框架的博客系统.zip
- (源码)基于Spring Boot框架的博客管理系统.zip
- (源码)基于ESP8266和Blynk的IR设备控制系统.zip
- (源码)基于Java和JSP的校园论坛系统.zip
- (源码)基于ROS Kinetic框架的AGV激光雷达导航与SLAM系统.zip
- (源码)基于PythonDjango框架的资产管理系统.zip