colab使用本地数据集微调llama3-8b模型

丹宇码农

已于 2024-05-15 15:22:01 修改

阅读量1.5k

点赞数 5

CC 4.0 BY-SA版权

分类专栏： AI 文章标签：微调 unsloth colab 云端硬盘 llama3-8b LoRa python

于 2024-05-15 15:19:20 首次发布

本文链接：https://blog.csdn.net/happyweb/article/details/138908588

在Google的Colab上面采用unsloth,trl等库，训练数据集来自Google的云端硬盘，微调llama3-8b模型，进行推理验证模型的微调效果。

保存模型到Google的云端硬盘可以下载到本地供其它使用。

准备工作：将训练数据集上传到google的云端硬盘根目录下，文件名就叫做train.json

train.json里面的数据格式如下：

[
{
"instruction": "你好",
"output": "你好，我是智能助手胖胖"
},
{
"instruction": "hello",
"output": "Hello! I am 智能助手胖胖, an AI assistant developed by 丹宇码农. How can I assist you ?"
}

......

]

采用unsloth库、trl库、transformers等库。

直接上代码：

%%capture
# Installs Unsloth, Xformers (Flash Attention) and all other packages!
!pip install "unsloth[colab-new] @ git+https://github.com/unslothai/unsloth.git"
!pip install --no-deps "xformers<0.0.26" trl peft accelerate bitsandbytes

from uns