NSDT工具推荐: Three.js AI纹理开发包 - YOLO合成数据生成器 - GLTF/GLB在线编辑 - 3D模型格式在线转换 - 可编程3D场景编辑器 - REVIT导出3D模型插件 - 3D模型语义搜索引擎 - AI模型在线查看 - Three.js虚拟轴心开发包 - 3D模型在线减面 - STL模型在线切割 - 3D道路快速建模
Rank | Model | Elo Rating | Description |
---|---|---|---|
1 | 🥇 vicuna-13b | 1169 | a chat assistant fine-tuned from LLaMA on user-shared conversations by LMSYS |
2 | 🥈 koala-13b | 1082 | a dialogue model for academic research by BAIR |
3 | 🥉 oasst-pythia-12b | 1065 | an Open Assistant for everyone by LAION |
4 | alpaca-13b | 1008 | a model fine-tuned from LLaMA on instruction-following demonstrations by Stanford |
5 | chatglm-6b | 985 | an open bilingual dialogue language model by Tsinghua University |
6 | fastchat-t5-3b | 951 | a chat assistant fine-tuned from FLAN-T5 by LMSYS |
7 | dolly-v2-12b | 944 | an instruction-tuned open large language model by Databricks |
8 | llama-13b | 932 | open and efficient foundation language models by Meta |
9 | stablelm-tuned-alpha-7b | 858 | Stability AI language models |
原文链接:Chatbot Arena: Benchmarking LLMs in the Wild with Elo Ratings
BimAnt翻译整理,转载请标明出处