zlh1992

zlh1992

4 followers · 4 following

Achievements

Stars

ictnlp / Auto-RAG

This is the official repository for Auto-RAG.

Python 193 17 Updated Jan 10, 2025

wilson1yan / teco

Python 116 10 Updated Oct 21, 2023

liu-nlper / dialogue-utterance-rewriter

ACL 2019论文复现：Improving Multi-turn Dialogue Modelling with Utterance ReWriter

Python 128 23 Updated Jan 23, 2020

thunlp / InfLLM

The code of our paper "InfLLM: Unveiling the Intrinsic Capacity of LLMs for Understanding Extremely Long Sequences with Training-Free Memory"

Python 325 28 Updated Apr 20, 2024

Beomi / InfiniTransformer

Unofficial PyTorch/🤗Transformers(Gemma/Llama3) implementation of Leave No Context Behind: Efficient Infinite Context Transformers with Infini-attention

Python 351 32 Updated Apr 23, 2024

donggyukimc / Inverse-cloze-task

Test code of Inverse cloze task for information retrieval

Python 33 5 Updated Jan 10, 2021

esbatmop / MNBVC

MNBVC(Massive Never-ending BT Vast Chinese corpus)超大规模中文语料集。对标chatGPT训练的40T数据。MNBVC数据集不但包括主流文化，也包括各个小众文化甚至火星文的数据。MNBVC数据集包括新闻、作文、小说、书籍、杂志、论文、台词、帖子、wiki、古诗、歌词、商品介绍、笑话、糗事、聊天记录等一切形式的纯文本中文数据。

3,683 256 Updated Feb 6, 2025

FranxYao / Long-Context-Data-Engineering

Implementation of paper Data Engineering for Scaling Language Models to 128K Context

Python 451 31 Updated Mar 19, 2024

datamllab / LongLM

[ICML'24 Spotlight] LLM Maybe LongLM: Self-Extend LLM Context Window Without Tuning

Python 637 61 Updated Jun 1, 2024

netease-youdao / QAnything

Question and Answer based on Anything.

Python 12,448 1,203 Updated Nov 19, 2024

OpenMOSS / CoLLiE

Collaborative Training of Large Language Models in an Efficient Way

Python 411 58 Updated Aug 28, 2024

xverse-ai / XVERSE-13B

XVERSE-13B: A multilingual large language model developed by XVERSE Technology Inc.

Python 648 59 Updated Apr 9, 2024

THUDM / VisualGLM-6B

Chinese and English multimodal conversational language model | 多模态中英双语对话语言模型

Python 4,129 422 Updated Aug 23, 2024

Liuhong99 / Sophia

The official implementation of “Sophia: A Scalable Stochastic Second-order Optimizer for Language Model Pre-training”

Python 947 55 Updated Jan 30, 2024

bojone / NBCE

Naive Bayes-based Context Extension

Python 320 22 Updated Dec 9, 2024

zlh1992 / BigLlamaTuningSummary

Tuning Llama Models > 7b

Python 2 Updated Jun 20, 2023

LianjiaTech / BELLE

BELLE: Be Everyone's Large Language model Engine（开源中文对话大模型）

HTML 8,042 768 Updated Oct 16, 2024

yangjianxin1 / LLMPruner

Python 302 21 Updated Apr 6, 2023

Dao-AILab / flash-attention

Fast and memory-efficient exact attention

Python 15,421 1,454 Updated Feb 11, 2025

ggerganov / llama.cpp

LLM inference in C/C++

C++ 73,984 10,674 Updated Feb 12, 2025

clue-ai / ChatYuan

ChatYuan: Large Language Model for Dialogue in Chinese and English

Python 1,897 183 Updated Jun 16, 2023

tloen / alpaca-lora

Instruct-tune LLaMA on consumer hardware

Jupyter Notebook 18,803 2,223 Updated Jul 29, 2024

jacobsen100 / ThesisGNNxBOND-public

Python 1 1 Updated Jan 23, 2022

luyug / Condenser

EMNLP 2021 - Pre-training architectures for dense retrieval

Python 244 23 Updated Mar 18, 2022

leimao / Sampled-Softmax-PyTorch

Sampled Softmax Implementation for PyTorch

Python 43 8 Updated Mar 7, 2018

csdongxian / AWP

Codes for NeurIPS 2020 paper "Adversarial Weight Perturbation Helps Robust Generalization"

Python 176 19 Updated Feb 18, 2021

zlh1992 / Classic-Papers

Classic papers of algorithm the author have read.

1 Updated Feb 18, 2022

zlh1992 / qlib

Forked from microsoft/qlib

Qlib is an AI-oriented quantitative investment platform, which aims to realize the potential, empower the research, and create the value of AI technologies in quantitative investment. With Qlib, yo…

Python 1 Updated Mar 10, 2022

zlh1992 / BiuG-XMRec-WSDMCup22

Forked from miziha-zp/BiuG-XMRec-WSDMCup22

Python 1 Updated Feb 22, 2022

zlh1992 / NLP-Loss-Pytorch

Forked from xinyi-code/NLP-Loss-Pytorch

Implementation of some unbalanced loss like focal_loss, dice_loss, DSC Loss, GHM Loss et.al

Python 1 Updated Mar 20, 2022

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

zlh1992

Achievements

Achievements

Block or report zlh1992

Stars

ictnlp / Auto-RAG

wilson1yan / teco

liu-nlper / dialogue-utterance-rewriter

thunlp / InfLLM

Beomi / InfiniTransformer

donggyukimc / Inverse-cloze-task

esbatmop / MNBVC

FranxYao / Long-Context-Data-Engineering

datamllab / LongLM

netease-youdao / QAnything

OpenMOSS / CoLLiE

xverse-ai / XVERSE-13B

THUDM / VisualGLM-6B

Liuhong99 / Sophia

bojone / NBCE

zlh1992 / BigLlamaTuningSummary

LianjiaTech / BELLE

yangjianxin1 / LLMPruner

Dao-AILab / flash-attention

ggerganov / llama.cpp

clue-ai / ChatYuan

tloen / alpaca-lora

jacobsen100 / ThesisGNNxBOND-public

luyug / Condenser

leimao / Sampled-Softmax-PyTorch

csdongxian / AWP

zlh1992 / Classic-Papers

zlh1992 / qlib

zlh1992 / BiuG-XMRec-WSDMCup22

zlh1992 / NLP-Loss-Pytorch