TurboTransformers：一种快速且用户友好的运行时，用于在CPU和GPU上进行变压器推断（Bert，Albert，GPT2，Decoders等）资源-CSDN文库

共274个文件

py：68个

cpp：67个

h：52个

nlp

gpu

decoder

machine-translation

inference

需积分: 22 8 浏览量 2021-02-03 15:20:31 上传评论收藏 2.96MB ZIP 举报

资源详情

资源评论

资源推荐

收起资源包目录

TurboTransformers：一种快速且用户友好的运行时，用于在CPU和GPU上进行变压器推断（Bert，Albert，GPT2，Decoders等）（274个子文件）

.clang-format 21B

FindMKL.cmake 4KB

FindGperftools.cmake 2KB

openblas.cmake 1KB

eigen.cmake 1KB

cuda.cmake 880B

transpose.cpp 26KB

multi_headed_attention_smart_batch.cpp 19KB

transpose_test.cpp 18KB

multi_headed_attention.cpp 17KB

pybind.cpp 14KB

bert_model.cpp 10KB

mat_mul.cpp 8KB

bert_model_example.cpp 8KB

layer_norm.cpp 8KB

matmul_benchmark.cpp 7KB

bert_model_test.cpp 7KB

utils.cpp 6KB

seq_pool.cpp 6KB

positionwise_ffn.cpp 5KB

utils_test.cpp 5KB

profiler.cpp 5KB

activation.cpp 5KB

bert_allocator_test.cpp 5KB

bert_config.cpp 4KB

allocator_api.cpp 4KB

softmax.cpp 4KB

mat_mul_test.cpp 4KB

model_aware_memory_scheduler.cpp 4KB

layernorm_benchmark.cpp 4KB

embedding.cpp 4KB

tensor.cpp 4KB

bert_embedding.cpp 4KB

transpose_benchmark.cpp 3KB

softmax_test.cpp 3KB

albert_layer.cpp 3KB

softmax_benchmark.cpp 3KB

common.cpp 3KB

layer_norm_test.cpp 3KB

bert_output.cpp 3KB

prepare_bert_masks.cpp 3KB

activation_benchmark.cpp 3KB

memory.cpp 3KB

bert_intermediate.cpp 3KB

activation_test.cpp 3KB

prepare_bert_masks_test.cpp 3KB

embedding_test.cpp 3KB

allocator_api_test.cpp 2KB

bert_pooler.cpp 2KB

gpu_utils_test.cpp 2KB

model_aware_memory_scheduler_test.cpp 2KB

tensor_test.cpp 2KB

blas_openblas.cpp 2KB

blas_blis.cpp 2KB

bert_attention.cpp 2KB

cuda_device_context.cpp 2KB

allocator_impl.cpp 2KB

config.cpp 1KB

enforce.cpp 1KB

addbias_layernorm.cpp 1KB

enforce_test.cpp 1KB

addbias_act.cpp 1KB

sequence_pool.cpp 1KB

device_context_test.cpp 1KB

model_aware_allocator.cpp 967B

naive_allocator.cpp 951B

base_allocator.cpp 948B

fp16_test.cpp 946B

ordered_list.cpp 748B

benchmark_helper.cpp 702B

npz_load.cpp 696B

blas_mkl.cpp 692B

catch2_test_main.cpp 54B

Dockerfile_dev.cpu 1KB

Dockerfile_release.cpu 816B

gpu_transpose_kernel.cu 16KB

gpu_utils.cu 8KB

gpu_softmax_kernel.cu 8KB

gpu_layer_norm_kernel.cu 5KB

gpu_activation_kernel.cu 3KB

gpu_embedding_kernel.cu 3KB

gpu_block_reduce.cuh 6KB

cuda_enforce.cuh 3KB

Dockerfile_ci 171B

.dockerignore 117B

.gitattributes 0B

.gitignore 168B

.gitmodules 621B

Dockerfile_dev.gpu 1KB

Dockerfile_release.gpu 1KB

tensor.h 10KB

multi_headed_attention_smart_batch.h 5KB

common.h 5KB

model_aware_memory_scheduler.h 5KB

multi_headed_attention.h 5KB

model_aware_allocator.h 5KB

enforce.h 3KB

transpose.h 3KB

ordered_list.h 3KB

gpu_transpose_kernel.h 3KB

共 274 条

评论收藏

内容反馈

国服第一奶妈

粉丝: 33
资源: 4505

TurboTransformers：一种快速且用户友好的运行时，用于在CPU和GPU上进行变压器推断（Bert，Albert，G...

评论0

最新资源

TurboTransformers：一种快速且用户友好的运行时，用于在CPU和GPU上进行变压器推断（Bert，Albert，G...

评论0

腾讯加速器

基于 RNN、Transformer、Bert 和 GPT2 的对话系统_聊天机器人_python_代码_下载

BERT和GPT的主要区别，解码注意力机制，BERT和GPT在生成长文本时是否存在一定的限制或挑战

ELMO,GPT,BERT对比.docx

大语言模型部署-基于TVM编译优化在CPU和GPU上部署BERT-附项目源码+流程教程+性能测试-优质项目实战.zip

Python-DocProduct使用自然语言处理模型如BERT和GPT2实现医疗问答

BERT与GPT基础，需要了解的看一下

Python_用于在现代消费级gpu上本地运行llm的快速推理库.zip

bert下albert_chinese_small实现文本分类.rar

BERT-GPU：在一台机器上从头开始为BERT进行多GPU培训，无需使用horovod

用于在Transformer模型中可视化注意力的工具（BERT，GPT-2，Albert，XLNet，RoBERTa，CTRL等）-Python开发

********gpt训练好模型分享********

大模型微调-基于Multi-GPU+FP16微调BERT大语言模型-附项目源码-优质项目实战.zip

Python-用谷歌BERT模型在BLSTMCRF模型上进行预训练用于中文命名实体识别的Tensorflow代码

rust-bert-即用型NLP管道和基于变压器的模型（BERT，DistilBERT，GPT2等）-Rust开发

Python-用于预先练训的BERT和其他变压器的spaCy管道

python基于开源GPT2.0的创作型人工智能可扩展可EssayKiller_V2-master.zip

“大模型”通常指的是深度学习中具有大量参数的模型，比如自然语言处理（NLP）中的预训练模型如BERT、GPT、RoBERTa等

bertviz：在Transformer模型中可视化注意力的工具（BERT，GPT-2，Albert，XLNet，RoBERTa，CTRL等）

bert-for-tf2：BERT，ALBERT和适配器-BERT的Keras TensorFlow 2.0实现

李宏毅 BERT PPT

保姆教程白嫖GPU T4*2！Kaggle实现chatglm微调任务-单机多卡训练测试

spacy变压器：spa在空间中使用像BERT，XLNet和GPT-2这样的预训练变压器

GPT的变现和技术分享

NLPGNN：1.使用BERT，ALBERT和GPT2作为tensorflow2.0的层。 2.基于消息传递实现GCN，GAN，GIN和GraphSAGE

正在进行的大规模研究培训变压器语言模型，包括：BERT＆GPT-2-Python开发

基于Sentencepiece和Bert Tokenizer的GPT2-Chinese中文模型训练设计源码

吊打BERT、GPT、DALL·E，跨模态榜单新霸主诞生！.rar

albert-chinese-base.rar

最新资源

gpt训练好模型分享