LLM大模型排名
Rank | Model | Elo Rating | Description |
---|---|---|---|
1 | 🥇 vicuna-13b | 1169 | a chat assistant fine-tuned from LLaMA on user-shared conversations by LMSYS |
2 | 🥈 koala-13b | 1082 | a dialogue model for academic research by BAIR |
3 | 🥉 oasst-pythia-12b | 1065 | an Open Assistant for everyone by LAION |
4 | alpaca-13b | 1008 | a model fine-tuned from LLaMA on instruction-following demonstrations by Stanford |
5 | chatglm-6b | 985 | an open bilingual dialogue language model by Tsinghua University |
6 | fastchat-t5-3b | 951 | a chat assistant fine-tuned from FLAN-T5 by LMSYS |
7 | dolly-v2-12b | 944 | an instruction-tuned open large language model by Databricks |
8 | llama-13b | 932 | open and efficient foundation language models by Meta |
9 | stablelm-tuned-alpha-7b | 858 | Stability AI language models |
原文链接:Chatbot Arena: Benchmarking LLMs in the Wild with Elo Ratings
BimAnt翻译整理,转载请标明出处