LongChat-13B: An Open-Source Chatbot With 16k Tokens Memory
LongChat-13B: An Open-Source Chatbot With 16k Tokens Memory
com/
Introduction
What is LongChat-13B?
LongChat-13B has several features that make it stand out from other
conversational models. Some of these features are:
Performance Evaluation
During the finer-grained line retrieval test, it was observed that the
Mpt-7b-storywriter model faced a substantial decrease in its regular
performance, plummeting to less than 50% of its usual output. Similarly,
the Chatglm2-6B model did not fare well either. Nonetheless, the
LongChat-13B-16K model showcased remarkable reliability, achieving a
performance level almost on par with GPT-3.5 or Anthropoic-claude
when operating within a context length of 12K.
source - https://lmsys.org/blog/2023-06-29-longchat/
For a more detailed look at the benchmarks and their results, please see
their blog post. The blog post includes information about the model's
training process, its performance on various benchmarks, and more.
If you are interested to learn more about the LongChat-13B model, all
relevant links are provided under the 'source' section at the end of this
article.
Limitation
Conclusion
source
blog post - https://lmsys.org/blog/2023-06-29-longchat/
github repo - https://github.com/DachengLi1/LongChat
Model details - https://huggingface.co/lmsys/longchat-13b-16k
GPTQ Model - https://huggingface.co/TheBloke/LongChat-13B-GPTQ
GGML Model- https://huggingface.co/TheBloke/LongChat-13B-GGML