毕业设计基于Python水利“舆情”构建知识图谱(Neo4j)应用源码+详细文档+全部数据资料（高分项目）.zip

共18个文件

py：14个

zip：1个

gitattributes：1个

版权申诉

毕业设计

课程设计

Python

知识图谱

neo4j

7 浏览量 2024-04-22 17:41:00 上传评论收藏 21KB ZIP 举报

资源推荐

资源详情

资源评论

收起资源包目录

毕业设计基于Python水利“舆情”构建知识图谱(Neo4j)应用源码+详细文档+全部数据资料（高分项目）.zip （18个子文件）

Irrigation-Proj-master

ltp_date_st.py 5KB

ltp_cloud.py 3KB

pdf2txt.py 4KB

.gitattributes 378B

p_loader.py 4KB

mongodb_class.py 4KB

ltp.py 5KB

put2db.py 3KB

api.py 3KB

pdfacc.py 2KB

ltp_date.py 5KB

rd.py 2KB

example.py 3KB

.gitignore 1KB

parse.py 2KB

restacc.py 757B

ir_loader.sh 227B

171265889347208773632.zip 416B

#-*-coding:UTF-8-*- # import sys from pyltp import Segmentor from pyltp import Postagger from pyltp import Parser from pyltp import SentenceSplitter from pyltp import NamedEntityRecognizer import re import json seg = Segmentor() seg.load_with_lexicon('ltp_data_v3.4.0/cws.model', './ext_word') post = Postagger() post.load_with_lexicon('ltp_data_v3.4.0/pos.model', './ext_word_pos') recognizer = NamedEntityRecognizer() recognizer.load('ltp_data_v3.4.0/ner.model') parser = Parser() parser.load('ltp_data_v3.4.0/parser.model') def segmentor(sentence): """ 断词 :param sentence: 语句 :return: 词列表 """"" global seg words = seg.segment(sentence) words_list = list(words) return words_list def postagger(words): """ 获取词性 :param words: 词 :return: 词性 """ global post pos = post.postag(words) pos_list = list(pos) return pos_list def getdate(word): """ 获取月份 :param word: 词 :return: 月 """ if ("春" in word) or ("夏" in word) or ("秋" in word) or ("冬" in word): return word if "月" in word: _sentence = word.replace('十一月', '11月').replace('十二月', '12月') _sentence = _sentence.replace('一月', '1月').replace('二月', '2月').replace('三月', '3月'). \ replace('四月', '4月').replace('五月', '5月') _sentence = _sentence.replace('六月', '6月').replace('七月', '7月').replace('八月', '8月'). \ replace('九月', '9月').replace('十月', '10月') else: _sentence = word m = re.search(r"(\d+)月", _sentence) if m is not None: _m = int(m.group(0).split("月")[0]) if (_m < 13) and (_m > 0): return _m return None def hasSBV(arcs): sbv = False vob = False hed = False for nn in arcs: if "SBV" in nn.relation: sbv = True if "VOB" in nn.relation: vob = True if "HED" in nn.relation: hed = True if (hed and sbv) or (vob and sbv): return True return False def chs_replace(sentence): """ :param sentence: :return: """ _s = sentence.replace("零", "0") _s = _s.replace("一", "1").replace("二", "2").replace("三", "3") _s = _s.replace("四", "4").replace("五", "5").replace("六", "6") _s = _s.replace("七", "7").replace("八", "8").replace("九", "9") _n_idx = 0 while True: _idx = _s.find("十", _n_idx) if _idx < 0: break if _idx == _n_idx: break if _idx > 0: if _s[_idx-1:_idx].isdigit(): if _s[_idx+3:_idx+4].isdigit(): _s = _s[:_idx] + _s[_idx+3:] else: _s = _s[:_idx] + "0" + _s[_idx+3:] else: if _s[_idx+3:_idx+4].isdigit(): _s = _s[:_idx] + "1" + _s[_idx+3:] _n_idx = _idx return _s year = None month = None events = {} line_cnt = 0 sentence_cnt = 0 """获取文本文件""" f = open(sys.argv[1]) while True: """读一段文本""" sentence = f.readline() if (sentence is "") or (len(sentence) == 0): """ Eof """ break """替代中文引号""" sentence = sentence.replace("‘", "").replace("’", "").replace("”", "").replace("“", "") line_cnt += 1 """断句""" sents = SentenceSplitter.split(sentence) for ss in sents: s = chs_replace(ss) if re.search(r"(\d+)年(\d+)月(\d+)日", s) is None: continue """语句处理""" sentence_cnt += 1 words = segmentor(s) pos = postagger(words) netags = list(recognizer.recognize(words, pos)) arcs = list(parser.parse(words, pos)) _i = 0 for _p in pos: # print _p, " : ", words[_i] if "nt" in _p: if "年" in words[_i]: _y = words[_i].split("年")[0] if str(_y).isdigit(): _y = int(_y) if (_y > 1000) and (_y < 2024): year = _y else: _m = getdate(words[_i]) if _m is not None: month = _m """Next""" _i += 1 if hasSBV(arcs) and (year is not None): if year not in events: events[year] = {} if month not in events[year]: events[year][month] = [] events[year][month].append(ss) """ _i = 0 for nn in arcs: print "%d:%s " % (nn.head, nn.relation), if nn.head > 0: print words[_i], " <---> ", words[nn.head-1] else: print words[_i] _i += 1 if "SBV" is nn.relation: if nn not in events[year][month]: events[year][month].append("%d:%s" % (nn.head, nn.relation)) """ # print(">>> %d, %d" % (line_cnt, sentence_cnt)) out_info = {} for _date in sorted(events, key=lambda x: x): print("<<< %s >>>" % str(_date)) if _date not in out_info: out_info[_date] = [] for _m in sorted(events[_date], key=lambda x: x): for _evt in events[_date][_m]: print("\t%-22s" % _evt) out_info[_date].append(_evt) parser.release() post.release() seg.release() with open("out.json", "w") as f: f.write(json.dumps(out_info)) f.close() print("Done.")

评论收藏

内容反馈

版权申诉

不走小道

粉丝: 3221
资源: 5113

毕业设计 基于Python水利“舆情”构建知识图谱(Neo4j)应用源码+详细文档+全部数据资料（高分项目）.zip

毕业设计 基于Python汽修领域知识图谱(Neo4j)问答源码+详细文档+全部数据资料（高分项目）.zip

毕业设计 基于地理格网的时空知识图谱(Neo4j)源码+详细文档+全部数据资料（高分项目）.zip

毕业设计 基于Python中医药知识图谱(Neo4j)智能问答源码+详细文档+全部数据资料（高分项目）.zip

毕业设计 基于Python+Flask知识图谱(Neo4j)的建筑资料智能管理源码+详细文档+全部数据资料（高分项目）.zip

毕业设计 基于Python+Django医药知识图谱(Neo4j)自动问答系统源码+详细文档+全部数据资料（高分项目）.zip

毕业设计 基于Python+Flask知识图谱(Neo4j)的脑卒中人机对话系统源码+详细文档+全部数据资料（高分项目）.zip

毕业设计 基于Python+知识图谱(Neo4j)的文档搜索系统源码+详细文档+全部数据资料（高分项目）.zip

毕业设计 基于Python+知识图谱(Neo4j)的传感器情境问答系统源码+详细文档+全部数据资料（高分项目）.zip

毕业设计 基于SpringBoot的电影搜索和电影知识图谱(Neo4j)源码+详细文档+全部数据资料（高分项目）

毕业设计 基于Python+Flask模板的知识图谱(Neo4j)问答系统源码+详细文档+全部数据资料（高分项目）.zip

毕业设计 基于Python+知识图谱(Neo4j)的古诗词问答系统源码+详细文档+全部数据资料（高分项目）.zip

毕业设计 基于Python+知识图谱(Neo4j)的高校信息查询系统源码+详细文档+全部数据资料（高分项目）.zip

毕业设计 基于Python+知识图谱(Neo4j)的教务智能问答系统源码+详细文档+全部数据资料（高分项目）.zip

毕业设计 基于Python+知识图谱(Neo4j)的心血管疾病问答系统源码+详细文档+全部数据资料（高分项目）.zip

毕业设计 基于Python+知识图谱(Neo4j)和trietree的垃圾分类源码+详细文档+全部数据资料（高分项目）.zip

毕业设计 基于Python+知识图谱(Neo4j)的出版物检索和推荐系统源码+详细文档+全部数据资料（高分项目）.zip

毕业设计 基于Python知识图谱(Neo4j)的金融文本数据挖掘源码+详细文档+全部数据资料（高分项目）.zip

毕业设计 基于Python+Flask哈利波特知识图谱(Neo4j)的问答系统源码+详细文档+全部数据资料（高分项目）.zip

毕业设计 基于Pytorch毕业论文知识图谱(Neo4j)构建平台源码+详细文档+全部数据资料（高分项目）.zip

34个经典javaweb项目实例.zip

毕业设计 springBoot人力资源管理系统+毕业论文+前后端源代码

项目源码：基于Hadoop+Spark招聘推荐可视化系统 大数据项目 计算机毕业设计

基于spring boot的小区物业管理系统源码+论文+答辩ppt

毕业设计：舆情监测系统（SpringBoot+NLP）

计算机毕业设计：Flask股票数据采集分析可视化系统 python+爬虫+金融数据

毕业设计-基于JAVA的springboot超市进销存系统(源代码+论文）

人脸识别系统OpenCV+dlib+python（含数据库）Pyqt5界面设计 项目源码 毕业设计

基于深度学习的课堂行为识别和考试作弊检测系统的设计与实现（python源码）

基于51单片机的智能电子秤系统设计(含代码仿真及论文)

最新资源

毕业设计基于Python水利“舆情”构建知识图谱(Neo4j)应用源码+详细文档+全部数据资料（高分项目）.zip

毕业设计基于Python汽修领域知识图谱(Neo4j)问答源码+详细文档+全部数据资料（高分项目）.zip

毕业设计基于地理格网的时空知识图谱(Neo4j)源码+详细文档+全部数据资料（高分项目）.zip

毕业设计基于Python中医药知识图谱(Neo4j)智能问答源码+详细文档+全部数据资料（高分项目）.zip

毕业设计基于Python+Flask知识图谱(Neo4j)的建筑资料智能管理源码+详细文档+全部数据资料（高分项目）.zip

毕业设计基于Python+Django医药知识图谱(Neo4j)自动问答系统源码+详细文档+全部数据资料（高分项目）.zip

毕业设计基于Python+Flask知识图谱(Neo4j)的脑卒中人机对话系统源码+详细文档+全部数据资料（高分项目）.zip

毕业设计基于Python+知识图谱(Neo4j)的文档搜索系统源码+详细文档+全部数据资料（高分项目）.zip

毕业设计基于Python+知识图谱(Neo4j)的传感器情境问答系统源码+详细文档+全部数据资料（高分项目）.zip

毕业设计基于SpringBoot的电影搜索和电影知识图谱(Neo4j)源码+详细文档+全部数据资料（高分项目）

毕业设计基于Python+Flask模板的知识图谱(Neo4j)问答系统源码+详细文档+全部数据资料（高分项目）.zip

毕业设计基于Python+知识图谱(Neo4j)的古诗词问答系统源码+详细文档+全部数据资料（高分项目）.zip

毕业设计基于Python+知识图谱(Neo4j)的高校信息查询系统源码+详细文档+全部数据资料（高分项目）.zip

毕业设计基于Python+知识图谱(Neo4j)的教务智能问答系统源码+详细文档+全部数据资料（高分项目）.zip

毕业设计基于Python+知识图谱(Neo4j)的心血管疾病问答系统源码+详细文档+全部数据资料（高分项目）.zip

毕业设计基于Python+知识图谱(Neo4j)和trietree的垃圾分类源码+详细文档+全部数据资料（高分项目）.zip

毕业设计基于Python+知识图谱(Neo4j)的出版物检索和推荐系统源码+详细文档+全部数据资料（高分项目）.zip

毕业设计基于Python知识图谱(Neo4j)的金融文本数据挖掘源码+详细文档+全部数据资料（高分项目）.zip

毕业设计基于Python+Flask哈利波特知识图谱(Neo4j)的问答系统源码+详细文档+全部数据资料（高分项目）.zip

毕业设计基于Pytorch毕业论文知识图谱(Neo4j)构建平台源码+详细文档+全部数据资料（高分项目）.zip

项目源码：基于Hadoop+Spark招聘推荐可视化系统大数据项目计算机毕业设计

人脸识别系统OpenCV+dlib+python（含数据库）Pyqt5界面设计项目源码毕业设计