Skip to content
View zlh1992's full-sized avatar

Block or report zlh1992

Block user

Prevent this user from interacting with your repositories and sending you notifications. Learn more about blocking users.

You must be logged in to block users.

Please don't include any personal information such as legal names or email addresses. Maximum 100 characters, markdown supported. This note will be visible to only you.
Report abuse

Contact GitHub support about this user’s behavior. Learn more about reporting abuse.

Report abuse
Showing results

This is the official repository for Auto-RAG.

Python 193 17 Updated Jan 10, 2025
Python 116 10 Updated Oct 21, 2023

ACL 2019论文复现:Improving Multi-turn Dialogue Modelling with Utterance ReWriter

Python 128 23 Updated Jan 23, 2020

The code of our paper "InfLLM: Unveiling the Intrinsic Capacity of LLMs for Understanding Extremely Long Sequences with Training-Free Memory"

Python 325 28 Updated Apr 20, 2024

Unofficial PyTorch/🤗Transformers(Gemma/Llama3) implementation of Leave No Context Behind: Efficient Infinite Context Transformers with Infini-attention

Python 351 32 Updated Apr 23, 2024

Test code of Inverse cloze task for information retrieval

Python 33 5 Updated Jan 10, 2021

MNBVC(Massive Never-ending BT Vast Chinese corpus)超大规模中文语料集。对标chatGPT训练的40T数据。MNBVC数据集不但包括主流文化,也包括各个小众文化甚至火星文的数据。MNBVC数据集包括新闻、作文、小说、书籍、杂志、论文、台词、帖子、wiki、古诗、歌词、商品介绍、笑话、糗事、聊天记录等一切形式的纯文本中文数据。

3,683 256 Updated Feb 6, 2025

Implementation of paper Data Engineering for Scaling Language Models to 128K Context

Python 451 31 Updated Mar 19, 2024

[ICML'24 Spotlight] LLM Maybe LongLM: Self-Extend LLM Context Window Without Tuning

Python 637 61 Updated Jun 1, 2024

Question and Answer based on Anything.

Python 12,448 1,203 Updated Nov 19, 2024

Collaborative Training of Large Language Models in an Efficient Way

Python 411 58 Updated Aug 28, 2024

XVERSE-13B: A multilingual large language model developed by XVERSE Technology Inc.

Python 648 59 Updated Apr 9, 2024

Chinese and English multimodal conversational language model | 多模态中英双语对话语言模型

Python 4,129 422 Updated Aug 23, 2024

The official implementation of “Sophia: A Scalable Stochastic Second-order Optimizer for Language Model Pre-training”

Python 947 55 Updated Jan 30, 2024

Naive Bayes-based Context Extension

Python 320 22 Updated Dec 9, 2024

Tuning Llama Models > 7b

Python 2 Updated Jun 20, 2023

BELLE: Be Everyone's Large Language model Engine(开源中文对话大模型)

HTML 8,042 768 Updated Oct 16, 2024
Python 302 21 Updated Apr 6, 2023

Fast and memory-efficient exact attention

Python 15,421 1,454 Updated Feb 11, 2025

LLM inference in C/C++

C++ 73,984 10,674 Updated Feb 12, 2025

ChatYuan: Large Language Model for Dialogue in Chinese and English

Python 1,897 183 Updated Jun 16, 2023

Instruct-tune LLaMA on consumer hardware

Jupyter Notebook 18,803 2,223 Updated Jul 29, 2024

EMNLP 2021 - Pre-training architectures for dense retrieval

Python 244 23 Updated Mar 18, 2022

Sampled Softmax Implementation for PyTorch

Python 43 8 Updated Mar 7, 2018

Codes for NeurIPS 2020 paper "Adversarial Weight Perturbation Helps Robust Generalization"

Python 176 19 Updated Feb 18, 2021

Classic papers of algorithm the author have read.

1 Updated Feb 18, 2022

Qlib is an AI-oriented quantitative investment platform, which aims to realize the potential, empower the research, and create the value of AI technologies in quantitative investment. With Qlib, yo…

Python 1 Updated Mar 10, 2022

Implementation of some unbalanced loss like focal_loss, dice_loss, DSC Loss, GHM Loss et.al

Python 1 Updated Mar 20, 2022
Next