LLM大模型排名
| Rank | Model | Elo Rating | Description |
|---|---|---|---|
| 1 | 🥇 vicuna-13b | 1169 | a chat assistant fine-tuned from LLaMA on user-shared conversations by LMSYS |
| 2 | 🥈 koala-13b | 1082 | a dialogue model for academic research by BAIR |
| 3 | 🥉 oasst-pythia-12b | 1065 | an Open Assistant for everyone by LAION |
| 4 | alpaca-13b | 1008 | a model fine-tuned from LLaMA on instruction-following demonstrations by Stanford |
| 5 | chatglm-6b | 985 | an open bilingual dialogue language model by Tsinghua University |
| 6 | fastchat-t5-3b | 951 | a chat assistant fine-tuned from FLAN-T5 by LMSYS |
| 7 | dolly-v2-12b | 944 | an instruction-tuned open large language model by Databricks |
| 8 | llama-13b | 932 | open and efficient foundation language models by Meta |
| 9 | stablelm-tuned-alpha-7b | 858 | Stability AI language models |
原文链接:Chatbot Arena: Benchmarking LLMs in the Wild with Elo Ratings
BimAnt翻译整理,转载请标明出处