Hacker News
new
|
past
|
comments
|
ask
|
show
|
jobs
|
submit
login
littlestymaar
28 days ago
|
parent
|
context
|
favorite
| on:
DeepSeek v4
How can a medium-sized model like Deepseek-V4-Flash be cheaper than a much smaller models like Qwen3.5-35B-A3B.
It's five times bigger in both total and active parameters!
Ancapistani
28 days ago
[–]
I don’t know for sure, but I believe those larger models must be run on nVidia hardware (CUDA), while Deepseek-V4-* can be run on Huawei chips. My assumption is that there is less demand pressure on non-nVidia chips.
Guidelines
|
FAQ
|
Lists
|
API
|
Security
|
Legal
|
Apply to YC
|
Contact
Search:
It's five times bigger in both total and active parameters!