Qwen-14B
Open Sourceby Alibaba 路 Released 2024-01-01
61.1
avg score
N/A
Input Price
N/A
Output Price
N/A
Context Window
text
Type
Tested on 6 benchmarks with 61.1% average. Top scores: ARC AI2 (79.2%), LAMBADA (71.1%), GSM8K (61.3%).
Benchmark Scores
| Benchmark | Category | Score | Bar |
|---|---|---|---|
| ARC AI2 | knowledge | 79.2 | |
| LAMBADA | knowledge | 71.1 | |
| GSM8K | math | 61.3 | |
| PIQA | knowledge | 59.8 | |
| MMLU | knowledge | 55.1 | |
| BBH | reasoning | 40.0 |