# nlp
自然语言处理:中文分词,打标签,文章匹配相似度
打标签:
extra_tags.py:
关键函数:extarct_tags ,通过help(jieba.analyse.extarct_tags) 查看
函数提示如下:
withWeight:单词权重
allowPOS:单词性质,参看https://wenku.baidu.com/view/49eab3a9ad51f01dc281f1f8.html
withFlag:
======================================================
Help on method extract_tags in module jieba.analyse.tfidf:
extract_tags(self, sentence, topK=20, withWeight=False, allowPOS=(), withFlag=False) method of jieba.analyse.tfidf.TFIDF instance
Extract keywords from sentence using TF-IDF algorithm.
Parameter:
- topK: return how many top keywords. `None` for all possible words.
- withWeight: if True, return a list of (word, weight);
if False, return a list of words.
- allowPOS: the allowed POS list eg. ['ns', 'n', 'vn', 'v','nr'].
if the POS of w is not in this list,it will be filtered.
- withFlag: only work with allowPOS is not empty.
if True, return a list of pair(word, weight) like posseg.cut
if False, return a list of words
自然语言处理:中文分词,打标签,文章匹配相似度,机器学习.zip
需积分: 5 194 浏览量
2024-05-06
11:45:55
上传
评论
收藏 46KB ZIP 举报
生瓜蛋子
- 粉丝: 3824
- 资源: 5678
最新资源
- Screenshot_2024-06-05-21-20-09-259_net.csdn.csdnplus.jpg
- miflash_unlock.zip
- stream.x64.x-none.rarstream.x64.x-none.rarstream.x64.x-none.rars
- MAVEN 教程和详细讲解
- 目标检测高空拍摄道路小车轿车检测数据集601张VOC+YOLO格式.zip
- 智慧旅游 大屏模板 静态模板
- JSP+sql网络远程作业处理系统
- 2024年全国一卷高考数学等3个文件(1).zip
- 实战自学python如何成为大佬(目录):https://blog.csdn.net/weixin-67859959/artic
- PHP新闻网站系统源代码
资源上传下载、课程学习等过程中有任何疑问或建议,欢迎提出宝贵意见哦~我们会及时处理!
点击此处反馈