How can a medium-sized model like Deepseek-V4-Flash be cheaper than a much small... | Hacker News

Hacker Newsnew | past | comments | ask | show | jobs | submit

		littlestymaar 28 days ago \| parent \| context \| favorite \| on: DeepSeek v4 How can a medium-sized model like Deepseek-V4-Flash be cheaper than a much smaller models like Qwen3.5-35B-A3B. It's five times bigger in both total and active parameters!

Ancapistani 28 days ago [–]

I don’t know for sure, but I believe those larger models must be run on nVidia hardware (CUDA), while Deepseek-V4-* can be run on Huawei chips. My assumption is that there is less demand pressure on non-nVidia chips.

Guidelines | FAQ | Lists | API | Security | Legal | Apply to YC | Contact