Try doing a head to head comparison using all LLM tricks available including pro...

mountainriver · 2025-10-19T20:19:45 1760905185

There are tons of problems this simply doesn’t apply to. In the limited API world this may be true but agents are far from reliable

yunwal · 2025-10-19T15:02:02 1760886122

I mean, at the point where you’re writing tools to assist it, we are no longer comparing the performance of 2 LLMs. You’re taking a solution that requires a small amount of expertise, and replacing it with another solution that requires more expertise, and costs more. The question is not “can fine tuning alone do better than every other trick in the book plus a SOTA LLM plus infinite time and money?” The question is: “is fine tuning useful?”

gdiamos · 2025-10-19T15:28:29 1760887709

Fair didn’t seem to matter to users who just wanted to build solutions with reasonable time and budget

echelon · 2025-10-19T14:59:53 1760885993

If your customers can't fine tune, do it for them instead.

gdiamos · 2025-10-19T15:13:01 1760886781

How can you hire enough people to scale that while making the economics work?

Why would they join you rather than founding their own company?

CaptainOfCoit · 2025-10-19T16:28:20 1760891300

> How can you hire enough people to scale that while making the economics work?

Once you (as in you the person) have the expertise, what you need all the people for exactly? To fine-tuning you need to figure out the architecture, how to train, how to infer, pick together the dataset and then run the training (optionally setup a pipeline so the customer can run the "add more data -> train" process themselves). What in this process you need to hire so many people for?

> Why would they join you rather than founding their own company?

Same as always, in any industry, not everyone wants to lead and not everyone wants to follow.

gdiamos · 2025-10-19T17:59:32 1760896772

llm.finetune(data) is a leaky abstraction

Read Andrej’s blog that I linked earlier in the thread if you want to understand why.

CaptainOfCoit · 2025-10-19T18:52:53 1760899973

If it works it works? :shrug:

gdiamos · 2025-10-19T19:44:24 1760903064

The problem is that it doesn’t always work and when it does fail it fails silently.

Debugging requires knowing some small detail about your data distribution or how you did gradient clipping which take time and painstakingly detailed experiments to uncover.

CaptainOfCoit · 2025-10-20T09:54:57 1760954097

> The problem is that it doesn’t always work and when it does fail it fails silently.

Right, but why does that mean you need more employees? You need to figure out how to surface failures, rather than just adding more meat to the problem.

echelon · 2025-10-19T15:24:36 1760887476

> How can you hire enough people to scale that while making the economics work?

Pick the right customers.

> Why would they join you rather than founding their own company?

The network effects of having enough resources in one place. For having other teams deal with the training data, infrastructure, deployment, etc.

gdiamos · 2025-10-19T15:30:39 1760887839

I think you are saying to go after the very high end of the market.

That’s fair, one market segment of this is sometimes called sovereign compute.

Another common model that I have seen is to become the deepmind for one very large and important customer.

I think this works.