MedicalGPT-使用ChatGPT训练管道训练自己的医疗GPT模型.训练医疗大模型，实现了包括增量预训练(P.zip

共75个文件

py：23个

md：12个

sh：10个

需积分: 50 135 浏览量 2024-10-18 19:56:12 上传评论收藏 8.57MB ZIP 举报

随着人工智能技术的快速发展，医疗行业也在逐渐融入智能化的浪潮。在这一趋势下，一个名为MedicalGPT的项目应运而生，该项目旨在使用先进的语言模型，如ChatGPT，来训练出能够适用于医疗领域的专用大型语言模型。这种方法不仅可以帮助提高医疗信息处理的效率，还能在临床决策支持、药物研发、病例分析等多个方面发挥巨大作用。项目的具体实施通过一个训练管道完成，该管道的设计初衷是让研究人员和医疗从业者能够基于现有的技术架构，自行训练出能够理解医疗术语和概念的AI模型。这个训练管道通常包含以下几个重要组成部分：数据收集与预处理是训练流程的起点。由于医疗数据的特殊性和敏感性，数据来源需要严格遵守相关的法律法规和隐私保护标准。在收集到足够的医疗文本数据后，需要进行去敏感化处理，并将其转化为模型能够理解的格式。这可能包括医学文献、病例报告、医疗记录等。接下来是模型的增量预训练阶段。增量预训练是指在已有的预训练模型基础上，继续进行有针对性的训练，以适应特定领域的知识和语境。在这个阶段，模型将重点学习和掌握医疗领域的专业术语、诊断流程、治疗方案等内容。这通常需要大量的计算资源和时间投入。训练完成后，模型需要经过严格的测试和评估，以确保其在医疗领域的应用效果和安全性。这一步骤涉及到在真实的医疗场景中模拟模型的表现，并通过专业医疗人员的反馈来调整和优化模型。最终，当模型被证明在医疗领域具有应用价值后，就可以部署到实际工作中，帮助医疗人员处理大量信息，提供辅助诊断建议，甚至参与到病患护理和治疗计划的制定中。整个MedicalGPT项目的意义不仅在于构建一个具体的医疗AI模型，更在于它为医疗行业提供了一种新的智能化解决方案。这将有助于推动医疗行业信息化和智能化的进程，减轻医务人员的工作压力，提高医疗服务质量，并最终使病患受益。此外，该项目也对人工智能技术在其他专业领域的应用具有重要的启示意义。随着技术的不断进步，各行各业都可以通过类似的训练管道来培养出更加专业化的AI应用，从而促进整个社会的科技进步和产业升级。

资源推荐

资源详情

资源评论

收起资源包目录

MedicalGPT-使用ChatGPT训练管道训练自己的医疗GPT模型. 训练医疗大模型，实现了包括增量预训练(P.zip （75个子文件）

MedicalGPT-main

run_dpo.sh 876B

vllm_deployment.sh 973B

role_play_data

seed_nurse_role.jsonl 49KB

roleplay_data_generate_gpt4.py 2KB

seed_patient_role.jsonl 43KB

role_generate.py 2KB

roleplay_data_generate_doubao.py 3KB

README.md 1KB

run_eval_quantize.sh 113B

orpo_training.py 22KB

run_training_dpo_pipeline.ipynb 19KB

convert_dataset.py 3KB

run_full_sft.sh 1KB

eval_quantize.py 5KB

deepspeed_zero_stage2_config.json 1KB

_config.yml 26B

run_pt.sh 1KB

dpo_training.py 22KB

.github

ISSUE_TEMPLATE

usage-question.md 205B

feature-request.md 278B

bug-report.md 286B

workflows

ubuntu.yml 1KB

stale.yml 766B

ppo_training.py 22KB

fastapi_server_demo.py 7KB

chatpdf.py 19KB

run_orpo.sh 858B

data

pretrain

fever.txt 343KB

en_article_tail500.txt 27KB

tianlongbabu.txt 834KB

vocab

word_freq.txt 8.19MB

baichuan_vocab.txt 735KB

reward

dpo_zh_500.jsonl 1.24MB

rag

medical_corpus.txt 41KB

finetune

sharegpt_zh_1K_format.jsonl 3.89MB

medical_sft_1K_format.jsonl 748KB

LICENSE 11KB

pretraining.py 33KB

CONTRIBUTING.md 465B

CITATION.cff 309B

inference_multigpu_demo.py 9KB

openai_api.py 21KB

docs

wechat.jpeg 40KB

training_details.md 5KB

logo.png 1.22MB

demo-screen.gif 891KB

dpo.jpg 141KB

datasets.md 9KB

training_params.md 7KB

wechat_group.jpg 154KB

extend_vocab.md 5KB

GPT_Training.jpg 267KB

FAQ.md 1KB

reward_modeling.py 28KB

inference.py 10KB

model_quant.py 5KB

README_EN.md 20KB

template.py 16KB

merge_peft_adapter.py 4KB

requirements.txt 205B

gradio_demo.py 5KB

run_ppo.sh 830B

run_rm.sh 1KB

run_sft.sh 1KB

run_quant.sh 170B

.gitignore 2KB

validate_jsonl.py 3KB

build_domain_tokenizer.py 2KB

run_training_ppo_pipeline.ipynb 25KB

deepspeed_zero_stage3_config.json 1KB

README.md 45KB

merge_tokenizers.py 6KB

DISCLAIMER 3KB

supervised_finetuning.py 44KB

MedicalGPT- Training Your Own Medical GPT Model with ChatGPT Training Pipeline. 训练医疗大模型，实现了包括增量预训练(P

项目内附说明

如果解压失败请用ara软件解压.txt 42B

[**🇨🇳中文**](https://github.com/shibing624/MedicalGPT/blob/main/README.md) | [**🌐English**](https://github.com/shibing624/MedicalGPT/blob/main/README_EN.md) | [**📖文档/Docs**](https://github.com/shibing624/MedicalGPT/wiki) | [**🤖模型/Models**](https://huggingface.co/shibing624) <div align="center"> <a href="https://github.com/shibing624/MedicalGPT"> <img src="https://github.com/shibing624/MedicalGPT/blob/main/docs/logo.png" height="100" alt="Logo"> </a> </div> ----------------- # MedicalGPT: Training Medical GPT Model [![HF Models](https://img.shields.io/badge/Hugging%20Face-shibing624-green)](https://huggingface.co/shibing624) [![Github Stars](https://img.shields.io/github/stars/shibing624/MedicalGPT?color=yellow)](https://star-history.com/#shibing624/MedicalGPT&Timeline) [![Contributions welcome](https://img.shields.io/badge/contributions-welcome-brightgreen.svg)](CONTRIBUTING.md) [![License Apache 2.0](https://img.shields.io/badge/license-Apache%202.0-blue.svg)](LICENSE) [![python_version](https://img.shields.io/badge/Python-3.8%2B-green.svg)](requirements.txt) [![GitHub issues](https://img.shields.io/github/issues/shibing624/MedicalGPT.svg)](https://github.com/shibing624/MedicalGPT/issues) [![Wechat Group](https://img.shields.io/badge/wechat-group-green.svg?logo=wechat)](#Contact) ## 📖 Introduction **MedicalGPT** training medical GPT model with ChatGPT training pipeline, implemantation of Pretraining, Supervised Finetuning, RLHF(Reward Modeling and Reinforcement Learning) and DPO(Direct Preference Optimization). **MedicalGPT** 训练医疗大模型，实现了包括增量预训练、有监督微调、RLHF(奖励建模、强化学习训练)和DPO(直接偏好优化)。 <img src="https://github.com/shibing624/MedicalGPT/blob/main/docs/dpo.jpg" width="860" /> - RLHF training pipeline来自Andrej Karpathy的演讲PDF [State of GPT](https://karpathy.ai/stateofgpt.pdf)，视频 [Video](https://build.microsoft.com/en-US/sessions/db3f4859-cd30-4445-a0cd-553c3304f8e2) - DPO方法来自论文[Direct Preference Optimization:Your Language Model is Secretly a Reward Model](https://arxiv.org/pdf/2305.18290.pdf) - ORPO方法来自论文[ORPO: Monolithic Preference Optimization without Reference Model](https://arxiv.org/abs/2403.07691) ## 🔥 News [2024/09/21] v2.3版本: 支持了 **[Qwen-2.5](https://qwenlm.github.io/zh/blog/qwen2.5/)** 系列模型，详见[Release-v2.3](https://github.com/shibing624/MedicalGPT/releases/tag/2.3.0) [2024/08/02] v2.2版本：支持了角色扮演模型训练，新增了医患对话SFT数据生成脚本[role_play_data](https://github.com/shibing624/MedicalGPT/blob/main/role_play_data/README.md)，详见[Release-v2.2](https://github.com/shibing624/MedicalGPT/releases/tag/2.2.0) [2024/06/11] v2.1版本：支持了 **[Qwen-2](https://qwenlm.github.io/blog/qwen2/)** 系列模型，详见[Release-v2.1](https://github.com/shibing624/MedicalGPT/releases/tag/2.1.0) [2024/04/24] v2.0版本：支持了 **[Llama-3](https://huggingface.co/meta-llama)** 系列模型，详见[Release-v2.0](https://github.com/shibing624/MedicalGPT/releases/tag/2.0.0) [2024/04/17] v1.9版本：支持了 **[ORPO](https://arxiv.org/abs/2403.07691)**，详细用法请参照 `run_orpo.sh`。详见[Release-v1.9](https://github.com/shibing624/MedicalGPT/releases/tag/1.9.0) [2024/01/26] v1.8版本：支持微调Mixtral混合专家MoE模型 **[Mixtral 8x7B](https://huggingface.co/mistralai/Mixtral-8x7B-v0.1)**。详见[Release-v1.8](https://github.com/shibing624/MedicalGPT/releases/tag/1.8.0) [2024/01/14] v1.7版本：新增检索增强生成(RAG)的基于文件问答[ChatPDF](https://github.com/shibing624/ChatPDF)功能，代码`chatpdf.py`，可以基于微调后的LLM结合知识库文件问答提升行业问答准确率。详见[Release-v1.7](https://github.com/shibing624/MedicalGPT/releases/tag/1.7.0) [2023/10/23] v1.6版本：新增RoPE插值来扩展GPT模型的上下文长度；针对LLaMA模型支持了[FlashAttention-2](https://github.com/Dao-AILab/flash-attention)和[LongLoRA](https://github.com/dvlab-research/LongLoRA) 提出的 **$S^2$-Attn**；支持了[NEFTune](https://github.com/neelsjain/NEFTune)给embedding加噪训练方法。详见[Release-v1.6](https://github.com/shibing624/MedicalGPT/releases/tag/1.6.0) [2023/08/28] v1.5版本: 新增[DPO(直接偏好优化)](https://arxiv.org/pdf/2305.18290.pdf)方法，DPO通过直接优化语言模型来实现对其行为的精确控制，可以有效学习到人类偏好。详见[Release-v1.5](https://github.com/shibing624/MedicalGPT/releases/tag/1.5.0) [2023/08/08] v1.4版本: 发布基于ShareGPT4数据集微调的中英文Vicuna-13B模型[shibing624/vicuna-baichuan-13b-chat](https://huggingface.co/shibing624/vicuna-baichuan-13b-chat)，和对应的LoRA模型[shibing624/vicuna-baichuan-13b-chat-lora](https://huggingface.co/shibing624/vicuna-baichuan-13b-chat-lora)，详见[Release-v1.4](https://github.com/shibing624/MedicalGPT/releases/tag/1.4.0) [2023/08/02] v1.3版本: 新增LLaMA, LLaMA2, Bloom, ChatGLM, ChatGLM2, Baichuan模型的多轮对话微调训练；新增领域词表扩充功能；新增中文预训练数据集和中文ShareGPT微调训练集，详见[Release-v1.3](https://github.com/shibing624/MedicalGPT/releases/tag/1.3.0) [2023/07/13] v1.1版本: 发布中文医疗LLaMA-13B模型[shibing624/ziya-llama-13b-medical-merged](https://huggingface.co/shibing624/ziya-llama-13b-medical-merged)，基于Ziya-LLaMA-13B-v1模型，SFT微调了一版医疗模型，医疗问答效果有提升，发布微调后的完整模型权重，详见[Release-v1.1](https://github.com/shibing624/MedicalGPT/releases/tag/1.1) [2023/06/15] v1.0版本: 发布中文医疗LoRA模型[shibing624/ziya-llama-13b-medical-lora](https://huggingface.co/shibing624/ziya-llama-13b-medical-lora)，基于Ziya-LLaMA-13B-v1模型，SFT微调了一版医疗模型，医疗问答效果有提升，发布微调后的LoRA权重，详见[Release-v1.0](https://github.com/shibing624/MedicalGPT/releases/tag/1.0.0) [2023/06/05] v0.2版本: 以医疗为例，训练领域大模型，实现了四阶段训练：包括二次预训练、有监督微调、奖励建模、强化学习训练。详见[Release-v0.2](https://github.com/shibing624/MedicalGPT/releases/tag/0.2.0) ## 😊 Features 基于ChatGPT Training Pipeline，本项目实现了领域模型--医疗行业语言大模型的训练： - 第一阶段：PT(Continue PreTraining)增量预训练，在海量领域文档数据上二次预训练GPT模型，以适应领域数据分布（可选） - 第二阶段：SFT(Supervised Fine-tuning)有监督微调，构造指令微调数据集，在预训练模型基础上做指令精调，以对齐指令意图，并注入领域知识 - 第三阶段 - RLHF(Reinforcement Learning from Human Feedback)基于人类反馈对语言模型进行强化学习，分为两步： - RM(Reward Model)奖励模型建模，构造人类偏好排序数据集，训练奖励模型，用来建模人类偏好，主要是"HHH"原则，具体是"helpful, honest, harmless" - RL(Reinforcement Learning)强化学习，用奖励模型来训练SFT模型，生成模型使用奖励或惩罚来更新其策略，以便生成更高质量、更符合人类偏好的文本 - [DPO(Direct Preference Optimization)](https://arxiv.org/pdf/2305.18290.pdf)直接偏好优化方法，DPO通过直接优化语言模型来实现对其行为的精确控制，而无需使用复杂的强化学习，也可以有效学习到人类偏好，DPO相较于RLHF更容易实现且易于训练，效果更好 - [ORPO](https://arxiv.org/abs/2403.07691)不需要参考模型的优化方法，通过ORPO，LLM可以同时学习指令遵循和满足人类偏好 ### Release Models | Model | Base Model

评论收藏

内容反馈