该项目是一个使用celery作为主体框架的爬虫应用.zip资源-CSDN文库

共93个文件

py：59个

txt：6个

md：6个

版权申诉

python

爬虫

数据收集

5星 · 超过95%的资源 194 浏览量 2024-03-01 13:57:35 上传评论收藏 30.33MB ZIP 举报

资源推荐

资源详情

资源评论

收起资源包目录

该项目是一个使用celery作为主体框架的爬虫应用，能够灵活的添加爬虫任务，并且同时运行多站点的爬虫工作，所有组件都能够原生支持规模并发和分布式，加上celery原生的分布式调.zip （93个子文件）

SJT-code

debug.log 10KB

doc

img

main-start.jpg 3.48MB

eg-middleware.jpg 4.87MB

eg-pipeline.jpg 586KB

task-start.jpg 847KB

beat-start.png 939KB

eg-crawler.jpg 3.71MB

develop.md 0B

install.md 0B

develop.md 0B

install.md 0B

index.md 0B

data

custom

pkuseg_user_dict.txt 21B

stopwords

cn_stopwords.txt 5KB

scu_stopwords.txt 7KB

hit_stopwords.txt 5KB

baidu_stopwords.txt 9KB

utils

__init__.py 266B

loader.py 702B

network.py 798B

driver.py 1KB

bin

.gitkeep 0B

driver

chromedriver_win32.zip 4.54MB

chromedriver_linux64.zip 4.71MB

chromedriver_mac64.zip 6.68MB

Dockerfile 99B

common

__init__.py 248B

plugins

__init__.py 266B

human

__init__.py 266B

notify.py 982B

slider.py 1KB

verification.py 267B

storage

__init__.py 266B

sqlitestorage.py 0B

mongostorage.py 2KB

filestorage.py 894B

timetrans.py 2KB

sqlitedao.py 1KB

settings.py 4KB

exceptions.py 354B

singleton.py 475B

requirements.txt 1KB

test

__init__.py 266B

test_sqlitedao.py 892B

test_settings.py 943B

test_browser.py 361B

test_stopwords.py 694B

test_hanlp.py 2KB

test_segments.py 15KB

.gitignore 205B

deadpool

__init__.py 244B

celery.py 6KB

README.md 18KB

contrib

__init__.py 248B

mysql

__init__.py 249B

tables

__init__.py 314B

cookie.py 1KB

proxy.py 1KB

base.py 344B

base.py 2KB

redis

__init__.py 266B

base.py 1KB

elastic

__init__.py 248B

indices

__init__.py 248B

rlogs.py 964B

base.py 2KB

apps

__init__.py 248B

periodic

__init__.py 248B

tasks

__init__.py 248B

task_proxy

__init__.py 325B

__main__.py 4KB

validator.py 6KB

upstream.py 2KB

task_cookie

__init__.py 266B

__main__.py 269B

base.py 2KB

asynch

__init__.py 249B

tasks

__init__.py 248B

task_eastmoney

__init__.py 334B

middleware.py 4KB

crawler.py 2KB

__main__.py 7KB

pipeline.py 1KB

base.py 6KB

config

jobs.yaml 322B

jobs.d

periodic.d

task_cookie.yaml 245B

task_proxy.yaml 283B

async.d

task_eastmoney.yaml 321B

config.yaml 1KB

conf.d

deadpool 935B

result

task_eastmoney.db3 12KB

scripts

deadpool.service 480B

deadpool-beat.service 506B

评论收藏

内容反馈

版权申诉

2401_82471966

2024-04-10

资源内容详实，描述详尽，解决了我的问题，受益匪浅，学到了。

JJJ69

粉丝: 6367
资源: 5917

该项目是一个使用celery作为主体框架的爬虫应用.zip

爬虫项目.zip

爬虫相关.zip

爬虫轻型框架.zip

基于scrapy框架实现的爬虫.zip

一个爬虫脚本

轻量型A股爬虫项目.zip

爬虫脚本.zip

node爬虫项目前端.zip

python写的爬虫项目.zip

微信公众号的爬虫项目.zip

使用feapder爬虫框架开发的爬虫示例.zip

超高速异步协程Python爬虫.zip

爬虫训练.zip

zlibrary爬虫项目.zip

一个Java版本的组件化的分布式通用爬虫.zip

python爬虫.zip

基于Scrapy的通用爬虫框架.zip

python大作业 含爬虫、数据可视化、地图、报告、及源码（2016-2021全国各地区粮食产量）.rar

《点燃我温暖你》中李峋的同款爱心代码

Python金融量化的高级库：TA-Lib-0.4.24（包含python3.7、3.8、3.9、3.10的32位和64位版本）

大麦网抢票脚本【Python脚本】

人脸识别系统OpenCV+dlib+python（含数据库）Pyqt5界面设计 项目源码 毕业设计

Python学习笔记(干货) 中文PDF完整版.pdf

Python教程2020版 完全入门 达到Python工程师水平 笔记+代码+课件+资料

人体姿态检测

抢购haiwei.rar

Python 八股文.pdf

Python基于机器学习实现的股票价格预测、股票预测源码+数据集，机器学习大作业

最新资源

python大作业含爬虫、数据可视化、地图、报告、及源码（2016-2021全国各地区粮食产量）.rar

人脸识别系统OpenCV+dlib+python（含数据库）Pyqt5界面设计项目源码毕业设计

Python教程2020版完全入门达到Python工程师水平笔记+代码+课件+资料