**以基于情绪分类的四分类模型为例:**
### 单条数据
- ##### 原始bert
~~~
Device: cuda:0
Mean inference time: 18.27ms
Standard deviation: 4.72ms
accuracy:82.84566838783705
~~~
- ##### pruned_H8.0F2048n_iters8
~~~
Device: cuda:0
Mean inference time: 16.17ms
Standard deviation: 5.31ms
accuracy:81.87033849684452
~~~
- ##### pruned_H6.0F1536n_iters16
~~~
Device: cuda:0
Mean inference time: 17.08ms
Standard deviation: 2.49ms
accuracy:74.69879518072288
~~~
- ##### pruned_H6.0F1536n_iters16ffn-first
~~~
Device: cuda:0
Mean inference time: 6.89ms
Standard deviation: 0.12ms
accuracy:81.41135972461274
~~~
- ##### pruned_H6.0F1536n_iters32ffn-first
~~~
Device: cuda:0
Mean inference time: 7.01ms
Standard deviation: 0.21ms
accuracy:76.01835915088927
~~~
- ##### 蒸馏为3层
~~~
Device: cuda:0
Mean inference time: 5.94ms
Standard deviation: 5.30ms
accuracy:82.09982788296041
~~~
### 多条数据(30条)
- ##### 原始bert
~~~
Device: cuda:0
Mean inference time: 233.47ms
Standard deviation: 13.70ms
accuracy:82.84566838783705
~~~
- ##### pruned_H8.0F2048n_iters8
~~~
Device: cuda:0
Mean inference time: 189.19ms
Standard deviation: 15.93ms
accuracy:81.87033849684452
~~~
- ##### pruned_H6.0F1536n_iters16
~~~
Device: cuda:0
Mean inference time: 164.29ms
Standard deviation: 8.27ms
accuracy:74.69879518072288
~~~
- ##### pruned_H6.0F1536n_iters16ffn-first
~~~
Device: cuda:0
Mean inference time: 76.30ms
Standard deviation: 1.04ms
accuracy:81.41135972461274
~~~
- ##### pruned_H6.0F1536n_iters32ffn-first
~~~
Device: cuda:0
Mean inference time: 65.07ms
Standard deviation: 2.01ms
accuracy:76.01835915088927
~~~
- ##### 蒸馏为3层
~~~
Device: cuda:0
Mean inference time: 58.96ms
Standard deviation: 9.06ms
accuracy:82.09982788296041
~~~
### 结论:
transformer裁剪中:`target_ffn_size=1536, target_num_of_heads=6` 精度损耗较为严重,即使设置32轮迭代依旧很低,建议采用:`target_ffn_size=2048, target_num_of_heads=8`
没有合适的资源?快使用搜索试试~ 我知道了~
资源推荐
资源详情
资源评论
收起资源包目录
基于深度学习分类模板.zip (42个子文件)
template_classify-main
data
data_cleaning.py 2KB
.dvcignore 139B
.dvc
.gitignore 26B
config 0B
run.py 24KB
requirments.txt 133B
deploy
__init__.py 0B
interface_debug.py 295B
run_app.sh 65B
custom_logging.py 3KB
app.py 1KB
access.log 0B
nli.py 8KB
data_model.py 309B
logging_config.json 341B
vocab.txt 107KB
config.py 1KB
run.sh 524B
optimize
__init__.py 0B
evaluate
evaluate_factory.py 3KB
__init__.py 0B
onnx_performance_time.py 2KB
evaluate_pytorch.py 2KB
__pycache__
evaluate_factory.cpython-38.pyc 2KB
evaluate_onnx.py 3KB
quantify
quantize.py 1KB
distilled_quantify.py 4KB
Bert-GLUE_OnnxRuntime_quantization.ipynb 47KB
distill
utils.py 1KB
__init__.py 0B
matches.py 9KB
utils_glue.py 21KB
distill.sh 2KB
distill.py 13KB
config.py 5KB
pruner
__init__.py 0B
model_pruner.py 3KB
README.md 2KB
acceleration
onnx.pdf 1.34MB
export_pytorch2onnx.py 2KB
export_onnx2fp16.sh 119B
accuracy.py 3KB
共 42 条
- 1
资源评论
博士僧小星
- 粉丝: 1936
- 资源: 5901
上传资源 快速赚钱
- 我的内容管理 展开
- 我的资源 快来上传第一个资源
- 我的收益 登录查看自己的收益
- 我的积分 登录查看自己的积分
- 我的C币 登录后查看C币余额
- 我的收藏
- 我的下载
- 下载帮助
安全验证
文档复制为VIP权益,开通VIP直接复制
信息提交成功