Hacker Newsnew | past | comments | ask | show | jobs | submit | gracegreg's commentslogin

Qwen2-72B claims better than LLama3-70B, I just found there is an another LLama3 model has comparable performace:

| | Qwen2-72B | Higgs-Llama-3-70B | Llama3-70B-Instruct |

| ---------- | --------- | ----------------- | ------------------- |

| MMLU | 82.3 | 80.8 | 80.2 |

| MMLU-Pro | 64.4 | 63.2 | 56.2 |

| Arena-Hard | 48.1 | 49.6 | 41.1 |

| GPQA | 42.4 | 42.1 | 41.9 |

- https://huggingface.co/bosonai/Higgs-Llama-3-70B


all new models claim to be better than the top SOTA model. Since llama3 dropped, every new model released has claimed to be better than it.


[DISREGARD]


Do you mean 400B? I thought that 70B was released some time ago:

https://huggingface.co/meta-llama/Meta-Llama-3-70B


Jesus, I'm losing it, thank you, you saved me from looking foolish (at least, continuing to :) )


It would be easier to follow if you don’t delete your comments. As it is, the thread does not make much sense.


And? And it's pretty easy to surmise from the comments.


You're thinking of 400B. 70B is out


Guidelines | FAQ | Lists | API | Security | Legal | Apply to YC | Contact

Search: