LLama3中文大模型进行指令微调的中文聊天语言模型

共13个文件

json：6个

safetensors：4个

gitattributes：1个

版权申诉

语言模型

140 浏览量 2024-06-06 20:28:48 上传评论收藏 2.25MB ZIP 举报

资源推荐

资源详情

资源评论

收起资源包目录

Llama3-8B-Chinese-Chat-main.zip （13个子文件）

Llama3-8B-Chinese-Chat-main

model-00003-of-00004.safetensors 135B

.gitattributes 1KB

model-00002-of-00004.safetensors 135B

LICENSE 8KB

tokenizer.json 8.66MB

generation_config.json 147B

model.safetensors.index.json 23KB

model-00001-of-00004.safetensors 135B

config.json 649B

tokenizer_config.json 50KB

special_tokens_map.json 97B

README.md 41KB

model-00004-of-00004.safetensors 135B

--- license: llama3 library_name: transformers pipeline_tag: text-generation base_model: meta-llama/Meta-Llama-3-8B-Instruct language: - en - zh tags: - llama-factory - orpo --- ð We included all instructions on how to download, use, and reproduce our various kinds of models at [this GitHub repo](https://github.com/Shenzhi-Wang/Llama3-Chinese-Chat). If you like our models, we would greatly appreciate it if you could star our Github repository. Additionally, please click "like" on our HuggingFace repositories. Thank you! âï¸âï¸âï¸NOTICE: The main branch contains the files for Llama3-8B-Chinese-Chat-**v2.1**. If you want to use our Llama3-8B-Chinese-Chat-**v1**, please refer to [the `v1` branch](https://huggingface.co/shenzhi-wang/Llama3-8B-Chinese-Chat/tree/v1); if you want to use our Llama3-8B-Chinese-Chat-**v2**, please refer to [the `v2` branch](https://huggingface.co/shenzhi-wang/Llama3-8B-Chinese-Chat/tree/v2). âï¸âï¸âï¸NOTICE: For optimal performance, we refrain from fine-tuning the model's identity. Thus, inquiries such as "Who are you" or "Who developed you" may yield random responses that are not necessarily accurate. # Updates - ððð [May 6, 2024] We now introduce Llama3-8B-Chinese-Chat-**v2.1**! Compared to v1, the training dataset of v2.1 is **5x larger** (~100K preference pairs), and it exhibits significant enhancements, especially in **roleplay**, **function calling**, and **math** capabilities! Compared to v2, v2.1 surpasses v2 in **math** and is **less prone to including English words in Chinese responses**. The training dataset of Llama3-8B-Chinese-Chat-v2.1 will be released soon. If you love our Llama3-8B-Chinese-Chat-v1 or v2, you won't want to miss out on Llama3-8B-Chinese-Chat-v2.1! - ð¥ We provide an online interactive demo for Llama3-8B-Chinese-Chat-v2 [here](https://huggingface.co/spaces/llamafactory/Llama3-8B-Chinese-Chat). Have fun with our latest model! - ð¥ We provide the official **Ollama model for the q4_0 GGUF** version of Llama3-8B-Chinese-Chat-v2.1 at [wangshenzhi/llama3-8b-chinese-chat-ollama-q4](https://ollama.com/wangshenzhi/llama3-8b-chinese-chat-ollama-q4)! Run the following command for quick use of this model: `ollama run wangshenzhi/llama3-8b-chinese-chat-ollama-q4`. - ð¥ We provide the official **Ollama model for the q8_0 GGUF** version of Llama3-8B-Chinese-Chat-v2.1 at [wangshenzhi/llama3-8b-chinese-chat-ollama-q8](https://ollama.com/wangshenzhi/llama3-8b-chinese-chat-ollama-q8)! Run the following command for quick use of this model: `ollama run wangshenzhi/llama3-8b-chinese-chat-ollama-q8`. - ð¥ We provide the official **Ollama model for the f16 GGUF** version of Llama3-8B-Chinese-Chat-v2.1 at [wangshenzhi/llama3-8b-chinese-chat-ollama-fp16](https://ollama.com/wangshenzhi/llama3-8b-chinese-chat-ollama-fp16)! Run the following command for quick use of this model: `ollama run wangshenzhi/llama3-8b-chinese-chat-ollama-fp16`. - ð¥ We provide the official **q4_0 GGUF** version of Llama3-8B-Chinese-Chat-**v2.1** at https://huggingface.co/shenzhi-wang/Llama3-8B-Chinese-Chat-GGUF-4bit! - ð¥ We provide the official **q8_0 GGUF** version of Llama3-8B-Chinese-Chat-**v2.1** at https://huggingface.co/shenzhi-wang/Llama3-8B-Chinese-Chat-GGUF-8bit! - ð¥ We provide the official **f16 GGUF** version of Llama3-8B-Chinese-Chat-**v2.1** at https://huggingface.co/shenzhi-wang/Llama3-8B-Chinese-Chat-GGUF-f16! <details> <summary>Updates for Llama3-8B-Chinese-Chat-v2 [CLICK TO EXPAND]</summary> - ð¥ Llama3-8B-Chinese-v2's link: https://huggingface.co/shenzhi-wang/Llama3-8B-Chinese-Chat/tree/v2 - ð¥ We provide the official f16 GGUF version of Llama3-8B-Chinese-Chat-**v2** at https://huggingface.co/shenzhi-wang/Llama3-8B-Chinese-Chat-GGUF-f16/tree/v2! - ð¥ We provide the official 8bit-quantized GGUF version of Llama3-8B-Chinese-Chat-**v2** at https://huggingface.co/shenzhi-wang/Llama3-8B-Chinese-Chat-GGUF-8bit/tree/v2! - ð¥ We provide an online interactive demo for Llama3-8B-Chinese-Chat-v2 (https://huggingface.co/spaces/llamafactory/Llama3-8B-Chinese-Chat). Have fun with our latest model! - ððð [Apr. 29, 2024] We now introduce Llama3-8B-Chinese-Chat-**v2**! Compared to v1, the training dataset of v2 is **5x larger** (~100K preference pairs), and it exhibits significant enhancements, especially in **roleplay**, **function calling**, and **math** capabilities! If you love our Llama3-8B-Chinese-Chat-v1, you won't want to miss out on Llama3-8B-Chinese-Chat-v2! </details> <details> <summary>Updates for Llama3-8B-Chinese-Chat-v1 [CLICK TO EXPAND]</summary> - ð¥ Llama3-8B-Chinese-v1's link: https://huggingface.co/shenzhi-wang/Llama3-8B-Chinese-Chat/tree/v1 - ð¥ We provide the official Ollama model for the f16 GGUF version of Llama3-8B-Chinese-Chat-**v1** at [wangshenzhi/llama3-8b-chinese-chat-ollama-f16](https://ollama.com/wangshenzhi/llama3-8b-chinese-chat-ollama-f16)! Run the following command for quick use of this model: `ollama run wangshenzhi/llama3-8b-chinese-chat-ollama-fp16`. - ð¥ We provide the official Ollama model for the 8bit-quantized GGUF version of Llama3-8B-Chinese-Chat-**v1** at [wangshenzhi/llama3-8b-chinese-chat-ollama-q8](https://ollama.com/wangshenzhi/llama3-8b-chinese-chat-ollama-q8)! Run the following command for quick use of this model: `ollama run wangshenzhi/llama3-8b-chinese-chat-ollama-q8`. - ð¥ We provide the official f16 GGUF version of Llama3-8B-Chinese-Chat-**v1** at [shenzhi-wang/Llama3-8B-Chinese-Chat-GGUF-f16-v1](https://huggingface.co/shenzhi-wang/Llama3-8B-Chinese-Chat-GGUF-f16/tree/v1)! - ð¥ We provide the official 8bit-quantized GGUF version of Llama3-8B-Chinese-Chat-**v1** at [shenzhi-wang/Llama3-8B-Chinese-Chat-GGUF-8bit-v1](https://huggingface.co/shenzhi-wang/Llama3-8B-Chinese-Chat-GGUF-8bit/tree/v1)! - ð If you are in China, you can download our **v1** model from our [Gitee AI repository](https://ai.gitee.com/hf-models/shenzhi-wang/Llama3-8B-Chinese-Chat). </details> # Model Summary Llama3-8B-Chinese-Chat is an instruction-tuned language model for Chinese & English users with various abilities such as roleplaying & tool-using built upon the Meta-Llama-3-8B-Instruct model. Developed by: [Shenzhi Wang](https://shenzhi-wang.netlify.app) (çææ§) and [Yaowei Zheng](https://github.com/hiyouga) (éèå¨) - License: [Llama-3 License](https://llama.meta.com/llama3/license/) - Base Model: Meta-Llama-3-8B-Instruct - Model Size: 8.03B - Context length: 8K # 1. Introduction This is the first model specifically fine-tuned for Chinese & English user through ORPO [1] based on the [Meta-Llama-3-8B-Instruct model](https://huggingface.co/meta-llama/Meta-Llama-3-8B-Instruct). **Compared to the original [Meta-Llama-3-8B-Instruct model](https://huggingface.co/meta-llama/Meta-Llama-3-8B-Instruct), our Llama3-8B-Chinese-Chat-v1 model significantly reduces the issues of "Chinese questions with English answers" and the mixing of Chinese and English in responses.** **Compared to [Llama3-8B-Chinese-Chat-v1](https://huggingface.co/shenzhi-wang/Llama3-8B-Chinese-Chat/tree/v1), our Llama3-8B-Chinese-Chat-v2 model significantly increases the training data size (from 20K to 100K), which introduces great performance enhancement, especially in roleplay, tool using, and math.** [1] Hong, Jiwoo, Noah Lee, and James Thorne. "Reference-free Monolithic Preference Optimization with Odds Ratio." arXiv preprint arXiv:2403.07691 (2024). Training framework: [LLaMA-Factory](https://github.com/hiyouga/LLaMA-Factory). Training details: - epochs: 2 - learning rate: 3e-6 - learning rate scheduler type: cosine - Warmup ratio: 0.1 - cutoff len (i.e. context length): 8192 - orpo beta (i.e. $\lambda$ in the ORPO paper): 0.05 - global batch size: 128 - fine-tuning type: full parameters - optimizer: paged_adamw_32bit <details> <summary>To reproduce the model [CLICK TO EXPAND]</summary> To reproduce Llama3-8B-Chinese-Cha

评论收藏

内容反馈

版权申诉