The DeepSeek R1 model people are freaking out about, runs better with more compute because it's a chain of thoughts model.