Python优秀项目基于Flask+CNN+Bi-LSTMl等实现商品短文本分类任务源码+部署文档+全部数据资料.zip资源-CSDN文库

共43个文件

py：14个

html：5个

tsv：4个

版权申诉

Flask

python

141 浏览量 2024-05-25 09:42:05 上传评论收藏 11.01MB ZIP 举报

资源推荐

资源详情

资源评论

收起资源包目录

Python优秀项目基于Flask+CNN+Bi-LSTMl等实现商品短文本分类任务源码+部署文档+全部数据资料.zip （43个子文件）

Short-Text-classification-master

demo

data

label2idx_dict.pkl 64KB

word2idx_dict.pkl 1.94MB

predict.py 3KB

templates

Wait.html 2KB

help.html 2KB

index.html 2KB

Single_data_classification_result.html 2KB

Batch_data_classification_results.html 2KB

webGUI.py 12KB

getFinalResult.py 17KB

testdata

test_GUI.tsv 794B

test_GUI.txt 794B

test-100example.tsv 9KB

static

wait2.js 3KB

wait.js 3KB

img

timg2.jpg 1.48MB

wait.jpg 10KB

格式.png 19KB

timg3.jpg 53KB

timg.jpg 646KB

yemi.png 13KB

css

style.css 3KB

result

result-test_GUI.tsv 1KB

result-test_pre.txt 1KB

result-test_GUI.txt 1KB

result-test_pre.tsv 1KB

解决方案.mp4 7.05MB

model

weightnorm.py 9KB

basicProcess.py 6KB

dataPreprocess.py 10KB

word2vec.py 3KB

main.py 3KB

ensemble_stacking.py 12KB

TextRNNmodel.py 11KB

MyModel.py 4KB

LSTMAttention.py 7KB

ensemble_weight_average.py 14KB

TextCNNmodel.py 5KB

README.md 3KB

解决方案_final.pdf 1.12MB

python系统部署文档.md 14KB

Flask系统部署文档.md 4KB

171265889347208773632.zip 416B

# Short-Text-classification 第十届大学生服务外包大赛（一等奖解决方案）--A01商品短文本分类。采用基于Keras的Word2vec、CNN、Bi-LSTM、Attention、Adversarial等方法实现商品短文本分类任务。基于Flask框架开发模型的可视化交互软件，支持单条文本以及批量文本的分类处理。 ## 1.experiment result 模型在50w数据集上的表现(训练集:测试集=40w:10w) | Model | Accurancy | | ---- | ---- | | TextCNN | 0.8820 | | BiLSTM | 0.8990 | | BiLSTM-Attention| 0.9056 | | Adv-BiLSTM-Attention | 0.9156 | | TextCNN(word) +BiLSTM-Attention(word) +BiLSTM-Attention(char) +Adv-BiLSTM-Attention(word)+Adv-BiLSTM-Attention(char) [加权融合]| 0.9201| ## 2.Requirement > Keras==2.0.5+ Python3.6+ >pandas==0.20.3 Flask==0.12.2 xlrd==1.1.0 jieba==0.39 tensorflow==1.4.0 h5py==2.7.0 Keras==2.0.5 numpy==1.14.2 ## 3.dataset & pretrained model [public training dataset 50w](https://pan.baidu.com/s/1aSy3fxFNvsorfdq2LuK4pA)(提取码：ac2c) [Attention-wight-norm-WithPositionEmbedding(0.9088).h5](https://pan.baidu.com/s/1vharQoMO2j_6iL0SYcsfLQ)(提取码：tf4a) [GRUAttention(0.9175799998474121).h5](https://pan.baidu.com/s/1O-VCIsoPzbvol58ngVV43A)(提取码：epnq) [TextBiLSTM-weightnorm(0.9156999999237061).h5](https://pan.baidu.com/s/1Ub-lcLeAb_EOEqVwStNNVw)(提取码：1u3b) [word embedding matrix and the sentence length info of dataset](https://pan.baidu.com/s/1QN0e_LsjEvDU2FJ5QeLrow)(提取码：ki3e) ## 4.installation steps of demo >1、git clone https://github.com/SaulZhang/Short-Text-classification.git >2、python webGUI.py >3、在浏览器的地址栏中输入：http://127.0.0.1:8000/ ## 5.交互软件使用说明 ### 5.1软件名称商品文本分类(Commodity Text Classfication) ### 5.2软件功能 #### 5.2.1单条分类在单条数据分类对应的文本输入框内输入商品名称，然后点击“单个数据分类”按钮，等待模型识别，识别结束后将跳转界面，输出分类结果。若要进行下一次分类，请点击“返回”按钮，重复执行上述操作。 #### 5.2.1批量分类批量分类时，需要选择待识别的文件(该软件仅支持'.txt','.tsv'两种格式的文件，若选择其他格式的文件，软件将给出错误提示)，合法的文件格式为，第一行单独一行为"ITEM_NAME"表示标题(不包含其他分隔符，若文件的内容格式不正确，软件将会给出错误提示，具体内容格式如下图所示)，随后的每一行表示一件商品的名称。待选择正确格式内容的文件之后，点击"批量数据分类"按钮，等待模型识别，识别结束后将跳转界面，输出文件中前200条数据的分类结果。最终识别结果的文件将保存在工程文件夹中的'./result/'文件夹下面。 ### 5.3支持浏览器 Microsoft Edge 41.16299.967.0+、Firefox66.0.1+、Chrome72.0.3626.96+ ## 6.Contributor [@Saul Zhang](https://github.com/SaulZhang)、[@Caiyuan-Zheng](https://github.com/Caiyuan-Zheng)、[@searcher408](https://github.com/Searcher408)、[@jvyvkai](https://github.com/jvyvkai)、[@Chinazzh8796](https://github.com/Chinazzh8796)

评论收藏

内容反馈

版权申诉