# Awesome Pretrained Chinese NLP Models[![Awesome](https://awesome.re/badge.svg)](https://awesome.re)
![](/resources/LLMS.png)
<div align="center">
<a href="https://arxiv.org/pdf/2303.18223.pdf">论æ: A Survey of Large Language Models</a>
</div>
å¨èªç¶è¯è¨å¤çé¢åä¸ï¼é¢è®ç»è¯è¨æ¨¡åï¼Pretrained Language Modelsï¼å·²æ为é常éè¦çåºç¡ææ¯ï¼æ¬ä»åºä¸»è¦æ¶éç®åç½ä¸å
¬å¼çä¸äºé«è´¨éä¸æé¢è®ç»æ¨¡åãä¸æå¤æ¨¡æ模åãä¸æ大è¯è¨æ¨¡åçå
容(æè°¢å享èµæºç大佬)ï¼å¹¶å°æç»æ´æ°......
> å½å
ä¸è½½HuggingFaceä»åºæ¨¡åæ¨è使ç¨HuggingFaceéåå°å: https://hf-mirror.com/
# Expand Table of Contents
+ [æ´æ°æ¥å¿](#æ´æ°)
+ [éç¨åºç¡å¤§æ¨¡å](#Base-LLM)
+ [åç´åºç¡å¤§æ¨¡å](#Domain-Base-LLM)
+ [éç¨å¯¹è¯å¤§æ¨¡å](#ChatLLM)
+ [åç´å¯¹è¯å¤§æ¨¡å](#Domain-ChatLLM)
+ [å¤æ¨¡æ对è¯å¤§æ¨¡å](#MultiModal-ChatLLM)
+ [大模åè¯ä¼°åºå](#大模åè¯ä¼°åºå)
+ [å¨çº¿ä½éªå¤§æ¨¡å](#å¨çº¿ä½éªå¤§æ¨¡å)
+ [å¼æºæ¨¡ååºå¹³å°](#å¼æºæ¨¡ååºå¹³å°)
+ [å¼æºæ°æ®éåº](#å¼æºæ°æ®éåº)
+ [å¼æºä¸ææ令æ°æ®é](#ä¸ææ令æ°æ®é)
+ [Embedding](#Embedding)
+ [Other-Awesome](#other-awesome)
+ <details><summary>NLUç³»å</summary>
- [BERT](#BERT)
- [RoBERTa](#RoBERTa)
- [ALBERT](#ALBERT)
- [NEZHA](#NEZHA)
- [XLNET](#XLNET)
- [MacBERT](#MacBERT)
- [WoBERT](#WoBERT)
- [ELECTRA](#ELECTRA)
- [ZEN](#ZEN)
- [ERNIE](#ERNIE)
- [ERNIE3](#ERNIE3)
- [RoFormer](#RoFormer)
- [StructBERT](#StructBERT)
- [Lattice-BERT](#Lattice-BERT)
- [Mengzi-BERT](#Mengzi-BERT)
- [ChineseBERT](#ChineseBERT)
- [TaCL](#TaCL)
- [MC-BERT](#MC-BERT)
- [äºéç¥](#äºéç¥)
- [PERT](#PERT)
- [MobileBERT](#MobileBERT)
- [GAU-α](#GAU-α)
- [DeBERTa](#DeBERTa)
- [GlyphBERT](#GlyphBERT)
- [CKBERT](#CKBERT)
- [LERT](#LERT)
- [RoCBert](#RoCBert)
- [m3e](#M3E)
- [LEALLA](#LEALLA)
</details>
+ <details><summary>NLGç³»å</summary>
- [GPT](#GPT)
- [GPT-3](#GPT-3)
- [NEZHA-GEN](#NEZHA-GEN)
- [CPM-Generate](#CPM-Generate)
- [T5](#T5)
- [T5-PEGASUS](#T5-PEGASUS)
- [Mengzi-T5](#Mengzi-T5)
- [çå¤Î±](#PanGu-Alpha)
- [EVA](#EVA)
- [BART](#BART)
- [é»ä»²](#é»ä»²)
- [ä½å
](#ä½å
)
- [RWKV](#RWKV)
- [Bloom](#Bloom)
- [PromptCLUE](#PromptCLUE)
- [ChatYuan](#ChatYuan)
- [SkyText](#SkyText)
- [ProphetNet](#ProphetNet)
</details>
+ <details><summary>NLU-NLGç³»å</summary>
- [UniLM](#UniLM)
- [Simbert](#Simbert)
- [RoFormer-sim](#RoFormer-sim)
- [CPM-2](#CPM-2)
- [CPT](#CPT)
- [å¨æç](#å¨æç)
- [GLM](#GLM)
- [PLUG](#PLUG)
- [OPD](#OPD)
</details>
+ <details><summary>Multi-Modal</summary>
- [WenLan](#WenLan)
- [CogView](#CogView)
- [ç´«ä¸å¤ªå](#ç´«ä¸å¤ªå)
- [Mengzi-oscar](#Mengzi-oscar)
- [R2D2](#R2D2)
- [Chinese-CLIP](#Chinese-CLIP)
- [TaiYi-CLIP](#TaiYi-CLIP)
- [AltCLIP](#AltCLIP)
- [AltDiffusion](#AltDiffusion)
- [Taiyi-Stable-Diffusion](#Taiyi-Stable-Diffusion)
- [wukong](#wukong)
- [OFA](#OFA)
- [QA-CLIP](#QA-CLIP)
</details>
+ <details><summary>Table</summary>
- [SDCUP](#SDCUP)
</details>
` å¤æ³¨`
>ND: Non-Causal Decoder or Prefix LM
>CD: Causal Decoder
>ED: Encoder-Decoder
## Base-LLM
> 大è§æ¨¡åºç¡æ¨¡åï¼è¡¨æ ¼ä¸åªç½ååºåæ°é`大äº7B`以ä¸æ¨¡åã
| 模å | å¤§å° | æ¶é´ | è¯è¨ | é¢å | ä¸è½½ | 项ç®å°å | æºæ/个人 | æ¶æ | æç® | å¤æ³¨ |
| :--------: | :------: | :-------: | :----: | :----: | :-----------: | :------: | :---------------: | :--: | :--------------: | ----- |
| XVERSE-MoE | 255B/A36B | 2024-09 | ä¸è± | éç¨ | [ð¤HF](https://huggingface.co/xverse/XVERSE-MoE-A36B) | [XVERSE-MoE-A36B](https://github.com/xverse-ai/XVERSE-MoE-A36B) | [xverse-ai](https://github.com/xverse-ai) | MoE | |
| Qwen-2.5 | 0.5/1.5/3/7/14/32/72B | 2024-09 | ä¸è± | éç¨ | [ð¤HF](https://huggingface.co/collections/Qwen/qwen25-66e81a666513e518adb90d9e) | [Qwen2.5](https://github.com/QwenLM/Qwen2.5) | [QwenLM](https://github.com/QwenLM) | CD | [Blog](https://qwenlm.github.io/blog/qwen2.5/) | |
| Tele-FLM | 52B/102B/1TB | 2024-07 | å¤è¯ | éç¨ | [[ð¤HF\]](https://huggingface.co/CofeAI) | / | [CofeAI](https://huggingface.co/CofeAI) | CD | [Tele-FLM Technical Report](https://arxiv.org/pdf/2404.16645) |
| meta-llama-3.1 | 8/70/405B | 2024-07 | å¤è¯ | éç¨ | [[ð¤HF\]](https://huggingface.co/meta-llama) | [llama3](https://github.com/meta-llama/llama3) | [meta-llama](https://github.com/meta-llama) | CD | | |
| internlm2.5-Base | 7B | 2024-07 | ä¸è± | éç¨ | [[ð¤HF\]](https://huggingface.co/internlm) | [InternLM](https://github.com/InternLM/InternLM)[![Star](https://camo.githubusercontent.com/f330929a514fa88e296d3f4aa78863614ccc13d6d1903e4d7b23fd85b69cddba/68747470733a2f2f696d672e736869656c64732e696f2f6769746875622f73746172732f496e7465726e4c4d2f496e7465726e4c4d2e7376673f7374796c653d736f6369616c266c6162656c3d53746172)](https://camo.githubusercontent.com/f330929a514fa88e296d3f4aa78863614ccc13d6d1903e4d7b23fd85b69cddba/68747470733a2f2f696d672e736869656c64732e696f2f6769746875622f73746172732f496e7465726e4c4d2f496e7465726e4c4d2e7376673f7374796c653d736f6369616c266c6162656c3d53746172) | [InternLM](https://github.com/InternLM) | CD | [ðTechnical Report](https://arxiv.org/abs/2403.17297) | |
| MAP-NEO-Base | 2/7B | 2024-06 | ä¸è± | éç¨ | [ð¤HF](https://huggingface.co/collections/m-a-p/neo-models-66395a5c9662bb58d5d70f04) | [MAP-NEO](https://github.com/multimodal-art-projection/MAP-NEO) | [multimodal-art-projection](https://github.com/multimodal-art-projection) | CD | [Paper](https://arxiv.org/abs/2405.19327) | |
| Nemotron-4-Base | 340B | 2024-06 | å¤è¯ | éç¨ | [ð¤HF](https://huggingface.co/nvidia) | / | [NVIDIA](https://github.com/NVIDIA) | CD | [technical report](https://research.nvidia.com/publication/2024-06_nemotron-4-340b). | |
| Index-Base | 1.9B | 2024-06 | ä¸è± | éç¨ | [ð¤HF](https://huggingface.co/IndexTeam/Index-1.9B-Chat) | [Index-1.9B](https://github.com/bilibili/Index-1.9B) | [bilibili](https://github.com/bilibili) | CD | [Report](https://github.com/bilibili/Index-1.9B/blob/main/Index-1.9B%20%E6%8A%80%E6%9C%AF%E6%8A%A5%E5%91%8A.pdf) | |
| Qwen2-Base | 0.5/2/5/7/72B | 2024-06 | å¤è¯ | éç¨ | [ð¤HF](https://huggingface.co/Qwen) | [Qwen2](https://github.com/QwenLM/Qwen2) | [QwenLM](https://github.com/QwenLM) | CD | [Blog](https://qwenlm.github.io/) | |
| GLM-4-Base | 9B | 2024-06 | å¤è¯ | éç¨ | [ð¤HF](https://huggingface.co/THUDM) | [GLM-4](https://github.com/THUDM/GLM-4) | [THUDM](https://github.com/THUDM) | / | | |
| Yi-1.5-Base | 6/9/34B | 2024-05 | ä¸è± | éç¨ | [ð¤HF](https://huggingface.co/01-ai) | [Yi-1.5](https://github.com/01-ai/Yi-1.5) | [01-ai](https://github.com/01-ai) | CD | [Paper](https://arxiv.org/abs/2403.04652) | |
| DeepSeek-V2-Base | A21B/236B | 2024-05 | ä¸è± | éç¨ | [ð¤HF](https://huggingface.co/deepseek-ai/DeepSeek-V2) | [DeepSeek-V2](https://github.com/deepseek-ai/DeepSeek-V2) | [deepseek-ai](https://github.com/deepseek-ai) | MOE | [Paper](https://github.com/deepseek-ai/DeepSeek-V2/blob/main/deepseek-v2-tech-report.pdf) | |
| Llama-3-Base | 8/70B | 2024-04 | å¤è¯ | éç¨ | [ð¤HF](https://hf-mirror.com/meta-llama) | **[llama3](https://github.com/meta-llama/llama3)** | [Meta Llama](https://github.com/meta-llama) | CD | | |
| Zhinao-Base | 7B | 2024-04 | ä¸è± | éç¨ | [ð¤HF](https://huggingface.co/qihoo360) [ ð¤](https://www.modelscope.cn/models/qihoo360/36
普通网友
- 粉丝: 1127
- 资源: 5292
最新资源
- OA办公自动化管理系统(Struts1.2+Hibernate3.0+Spring2+DWR).rar
- OA办公自动化管理系统(Struts1.2+Hibernate3.0+Spring2+DWR)130224.rar
- shopxx_src.rar
- 聊天系统项目全套技术资料100%好用.zip
- tot-jsp-cms.rar
- s2shDemo.rar
- webdgs.rar
- vijun-1.0-release.rar
- 博客系统网站(JSP+SERVLET+MYSQL).rar
- 博客系统网站(JSP+SERVLET+MYSQL)130222.rar
- 博客系统(struts+hibernate+spring)130225.rar
- 超市综合管理信息系统.rar
- 数据爬虫项目全套技术资料100%好用.zip
- 车辆管理系统(struts+hibernate+spring+oracle)130225.rar
- 车辆管理系统(struts+hibernate+spring+oracle).rar
- 共创在线考试系统(JSP+SERVLET).rar
资源上传下载、课程学习等过程中有任何疑问或建议,欢迎提出宝贵意见哦~我们会及时处理!
点击此处反馈