大模型微调-快速微调LLaMA实现-附项目源码+流程教程-优质项目实战.zip

共340个文件

py：159个

jsonl：40个

md：31个

版权申诉

98 浏览量 2024-05-21 19:45:56 上传评论 1 收藏 31.92MB ZIP 举报

在当前的IT行业中，大模型微调已经成为提升人工智能性能的关键技术。这个压缩包文件提供了一个关于如何快速微调LLaMA（Large Language Model for Many Applications）的实践项目，旨在帮助开发者深入理解并掌握这一过程。LLaMA是NVIDIA公司推出的一种预训练语言模型，具有强大的通用性和可扩展性，能够应用于多种自然语言处理任务。我们要理解什么是大模型微调。大模型，如LLaMA，是指经过大规模数据预训练的神经网络模型，它们通常包含数亿甚至上千亿个参数。这些模型在广泛的文本数据上进行预训练，学习到了丰富的语言模式。然而，尽管这些模型具有出色的泛化能力，但在特定任务上，微调是必要的，因为它们可以针对具体的应用场景进行优化，提高执行效率和准确率。微调是将预训练模型在目标任务的数据集上进行二次训练的过程。对于LLaMA，这可能包括问答、文本生成、机器翻译等任务。通过微调，我们可以让模型更好地适应新的任务需求，减少计算资源消耗，同时保持高性能。接下来，我们关注如何快速微调LLaMA。快速微调通常涉及以下步骤： 1. **环境设置**：确保拥有适当的硬件资源，如GPU，并安装必要的软件库，如PyTorch，以及LLaMA的官方代码库。 2. **数据准备**：收集与目标任务相关的数据集，进行预处理，包括分词、编码和格式转换，使之符合模型输入的要求。 3. **模型加载**：从NVIDIA的官方存储库或Hugging Face模型 hub下载预训练的LLaMA模型，并加载到内存中。 4. **微调配置**：设定微调的超参数，如学习率、批次大小、训练轮数等，这些参数会影响模型的训练效果和速度。 5. **模型训练**：使用目标数据集对模型进行训练，监控训练过程中的损失和评估指标，适时调整策略以优化性能。 6. **验证与测试**：在验证集和测试集上评估微调后的模型，确保其在新任务上的表现满足要求。 7. **模型保存与部署**：保存最佳模型权重，以便于后续部署和使用。这个压缩包提供的项目源码和流程教程将详细展示以上每个步骤，通过实际操作加深理解。源码中包含了训练脚本、数据处理函数以及模型配置文件，而流程教程则会逐步指导用户完成整个微调过程，对于初学者来说尤其有价值。通过这个优质项目实战，你可以学习到大模型微调的关键技术和实践经验，不仅可以提升自己的技能，还能为实际项目开发带来效率和质量的提升。无论你是研究人员还是开发者，都有必要掌握这项技术，以应对不断变化的AI应用场景。

资源推荐

资源详情

资源评论

收起资源包目录

大模型微调-快速微调LLaMA实现-附项目源码+流程教程-优质项目实战.zip （340个子文件）

parser.c 2.74MB

scanner.cc 11KB

binding.cc 869B

setup.cfg 276B

.gitattributes 56B

.gitignore 81B

.gitignore 76B

.gitignore 32B

binding.gyp 336B

bpe_simple_vocab_16e6.txt.gz 1.29MB

parser.h 5KB

langchain_LLaMA_AdapterV2_demo.ipynb 78KB

bird_image.jpg 112KB

dog_image.jpg 84KB

car_image.jpg 58KB

girl.jpg 13KB

grammar.js 22KB

index.js 463B

huggingface_train.json 21.21MB

tensorflow_train.json 12.23MB

huggingface_eval.json 2.4MB

torchhub_train.json 1.68MB

tensorflow_eval.json 1.36MB

torchhub_eval.json 380KB

grammar.json 120KB

node-types.json 53KB

shawnharmsen1.json 1KB

package.json 724B

params.json 103B

questions_huggingface_bm25.jsonl 1.7MB

questions_huggingface_oracle.jsonl 1.28MB

questions_huggingface_gpt_index.jsonl 1.26MB

response_huggingface_Gorilla_RT_oracle.jsonl 1.2MB

response_huggingface_Gorilla_RT_0_shot.jsonl 1.19MB

response_huggingface_Gorilla_RT_bm25.jsonl 1.19MB

response_huggingface_Gorilla_FT_0_shot.jsonl 1.15MB

response_huggingface_Gorilla_RT_gpt_index.jsonl 1.13MB

huggingface_api.jsonl 1.01MB

response_huggingface_Gorilla_FT_bm25.jsonl 993KB

response_huggingface_Gorilla_FT_oracle.jsonl 986KB

questions_tensorflowhub_bm25.jsonl 942KB

response_huggingface_Gorilla_FT_gpt_index.jsonl 930KB

questions_tensorflowhub_oracle.jsonl 826KB

questions_tensorflowhub_gpt_index.jsonl 784KB

response_tensorflowhub_Gorilla_FT_bm25.jsonl 666KB

response_tensorflowhub_Gorilla_RT_0_shot.jsonl 663KB

response_tensorflowhub_Gorilla_FT_oracle.jsonl 660KB

response_tensorflowhub_Gorilla_RT_bm25.jsonl 656KB

tensorflowhub_api.jsonl 645KB

response_tensorflowhub_Gorilla_RT_oracle.jsonl 639KB

response_tensorflowhub_Gorilla_FT_gpt_index.jsonl 638KB

response_tensorflowhub_Gorilla_RT_gpt_index.jsonl 634KB

response_tensorflowhub_Gorilla_FT_0_shot.jsonl 614KB

questions_torchhub_bm25.jsonl 335KB

questions_torchhub_gpt_index.jsonl 308KB

questions_torchhub_oracle.jsonl 298KB

questions_huggingface_0_shot.jsonl 182KB

response_torchhub_Gorilla_RT_0_shot.jsonl 182KB

response_torchhub_Gorilla_FT_oracle.jsonl 136KB

response_torchhub_Gorilla_FT_bm25.jsonl 135KB

torchhub_api.jsonl 124KB

questions_tensorflowhub_0_shot.jsonl 118KB

response_torchhub_Gorilla_RT_bm25.jsonl 118KB

response_torchhub_Gorilla_FT_gpt_index.jsonl 118KB

response_torchhub_Gorilla_RT_gpt_index.jsonl 110KB

response_torchhub_Gorilla_RT_oracle.jsonl 105KB

response_torchhub_Gorilla_FT_0_shot.jsonl 100KB

questions_torchhub_0_shot.jsonl 34KB

example_questions.jsonl 131B

LICENSE 35KB

LICENSE 20KB

LICENSE 11KB

LICENSE 1KB

README.md 15KB

README.md 9KB

MODEL_CARD.md 8KB

README.md 6KB

README.md 5KB

model_card.md 4KB

README.md 4KB

CODE_OF_CONDUCT.md 4KB

CODE_OF_CONDUCT.md 3KB

langchain_integration.md 3KB

train.md 3KB

FAQ.md 3KB

README.md 3KB

train.md 3KB

eval.md 3KB

README.md 2KB

README_llama.md 2KB

README.md 1KB

CONTRIBUTING.md 1KB

README.md 901B

model_zoo.md 757B

hosted-gorilla-.md 643B

共 340 条

# LLaMA-Adapter: Efficient Fine-tuning of LLaMA ## Released Models | Name | Approach | Data | Modality | Visual | Text | | ------------------------------------------------------------ | ------------------------------------------------------ | ------------------------------------------------------------ | ---------------------------------- | -------------- | --------------------- | | [LLaMA-Adapter V1](./alpaca_finetuning_v1) | prefix, gate | Alpaca | Text | × | LLaMA-7B | | [LLaMA-Adapter V2 dialog](./llama_adapter_v2_chat65b) | scale, bias, norm | ShareGPT | Text | × | LLaMA-65B | | [LLaMA-Adapter V2 multimodal](./llama_adapter_v2_multimodal7b) | [P] prefix, projection, gate <br />[F] bias, norm | [P] Image-Text-V1<br />[F] GPT4LLM, LLaVA | Image&Text | CLIP-ViT-L/14 | LLaMA-7B | | [LLaMA-Adapter V2.1 multimodal](./llama_adapter_v2_multimodal7b) | [P] prefix, projection, gate <br />[F] bias, norm, lora | [P] Image-Text-V1<br />[F] GPT4LLM, LLaVA, VQAv2 | Image&Text | CLIP-ViT-L/14 | LLaMA-7B | | [ImageBind-LLM](./imagebind_LLM) | [P] prefix, projection, gate<br />[F] bias, norm, lora | [P] Image-Text-V1<br />[F] [Instruction Following](https://github.com/OpenGVLab/LLaMA-Adapter/blob/main/imagebind_LLM/docs/train.md#data-1) | ImageBind Modalities + Point Cloud | imagebind_huge | Open-Chinese-LLaMA-7B | | ImageBind-dialog | [P] prefix, projection, gate<br />[F] bias, norm, lora | [P] Image-Text-V1<br />[F] LLaVA, ShareGPT | ImageBind Modalities + Point Cloud | imagebind_huge | Open-Chinese-LLaMA-7B | + [P] means **P**re-train and [F] means **F**ine-tune + **Image-Text-V1** is a concatenation of LAION400M, COYO, MMC4, SBU, Conceptual Captions, and COCO + **ImageBind Modalities** include image, video, text, audio, depth, thermal, IMU + **ImageBind-dialog** will be release soon ## Overview Efficiency Comparison: | Model | Parameters | Storage Space | Training Time | :-----: | :-----: |:-----:| :-----: | | [Alpaca](https://github.com/tatsu-lab/stanford_alpaca) | 7B |13G| 3 Hours| | LLaMA-Adapter | 1.2M |4.7M| 1 Hour| By inserting adapters into LLaMA's transformer, our method only introduces **1.2M** learnable parameters, and turns a LLaMA into an instruction-following model within **1 hour**. For stablizing training at early stages, we propose a novel **Zero-init Attention** with zero gating mechanism to adaptively incorporate the instructional signals. After fine-tuning, LLaMA-Adapter can generate high-quality instruction-following sentences, comparable to the fully fine-tuned [Stanford Alpaca](https://github.com/tatsu-lab/stanford_alpaca) and [Alpaca-Lora](https://github.com/tloen/alpaca-lora). <div align="center"> <img src="docs/pipeline.png"/ width="90%"> </div> Our approach can be simply extended to **Multi-modal Input Instructions**. The reasoning framework of image-conditioned LLaMA-Adapter for [ScienceQA](https://scienceqa.github.io/) is as follows, which is also shared by other modalities, such as audio and video. <div align="center"> <img src="docs/multimodal.png"/ width="90%"> </div> ## Setup Here is a from-scratch script for **LLaMA-Adapter V1**. ```bash conda create -n llama_adapter -y python=3.8 conda activate llama_adapter # install pytorch conda install pytorch cudatoolkit -c pytorch -y # install dependency and llama-adapter pip install -r requirements.txt pip install -e . ``` **Note**: **To setup other models**, please refer to [llama_adapter_v2_chat65b](llama_adapter_v2_chat65b), [llama_adapter_v2_multimodal7b](llama_adapter_v2_multimodal7b) and [imagebind_LLM](imagebind_LLM) for more details. ## Inference Please request access to the pre-trained LLaMA from [this form](https://forms.gle/jk851eBVbX1m5TAv5) (official) or download the LLaMA-7B from [Hugging Face](https://huggingface.co/nyanko7/LLaMA-7B/tree/main) (unofficial). Then, obtain the weights of our LLaMA-Adapter from [here](https://github.com/OpenGVLab/LLaMA-Adapter/releases/download/v.1.0.0/llama_adapter_len10_layer30_release.pth). We denote the path to the downloaded weights of LLaMA and adapters as `TARGET_FOLDER` and `ADAPTER_PATH`. Here is an example to generate instruction-following sentences with 7B LLaMA model and our LLaMA-Adapter: ```bash torchrun --nproc_per_node 1 example.py \ --ckpt_dir $TARGET_FOLDER/model_size\ --tokenizer_path $TARGET_FOLDER/tokenizer.model \ --adapter_path $ADAPTER_PATH ``` ## Training We release the simple fine-tuning code of LLaMA-Adapter on LLaMA-7B model at [here](alpaca_finetuning_v1), which is for effortless reproduction with minimal dependencies. We will soon release the fine-tuning code for LLaMA-65B and multi-model LLaMA-Adapter. Please download the 52K instruction-following training [data](https://github.com/tatsu-lab/stanford_alpaca/blob/main/alpaca_data.json) from Standford Alpaca, and put it under `DATA_PATH`. Then run: ```bash cd alpaca_finetuning_v1 torchrun --nproc_per_node 8 finetuning.py \ --model Llama7B_adapter \ --llama_model_path $TARGET_FOLDER/ \ --data_path $DATA_PATH/alpaca_data.json \ --adapter_layer 30 \ --adapter_len 10 \ --max_seq_len 512 \ --batch_size 4 \ --epochs 5 \ --warmup_epochs 2 \ --blr 9e-3 \ --weight_decay 0.02 \ --output_dir ./checkpoint/ ``` ## Comparison with Other Methods ### Instruction 1: How can I develop my critical thinking skills? Give me three suggestions. * **LLaMA-Adapter V2:** 1. Read widely: Reading books, articles, and other sources of information can help you develop your critical thinking skills. By exposing yourself to different perspectives and ideas, you can learn to analyze and evaluate information more effectively. 2. Practice mindfulness: Mindfulness is the practice of being present in the moment and paying attention to your thoughts and feelings. By being more aware of your thoughts and emotions, you can better understand and evaluate them. This can help you develop a more objective and rational approach to thinking. 3. Engage in open-minded discussions: Join a discussion group or debate club where you can engage in open-minded discussions with others. This can help you develop your critical thinking skills by challenging your assumptions and encouraging you to consider different perspectives. * **LLaMA-Adapter V1:** 1. Read widely and critically. 2. Engage in discussions with people who have different perspectives. 3. Ask questions and seek out answers. ### Instruction 2: As a superhero, how would you explain your origin story to a curious child? * **LLaMA-Adapter V2:** As a superhero, I have a very interesting and exciting origin story to share with you. Once upon a time, there was a young boy named John who lived in a small town. John was always curious and wanted to help others, so he spent his days exploring the world around him and learning about different cultures and people. One day, while he was out exploring, he stumbled upon a mysterious cave. Inside the cave, John found a strange device that had the power to give him incredible abilities. With these abilities, John could fly, shoot lasers from his eyes, and even turn invisible. John knew that he had to use the

评论收藏

内容反馈

版权申诉