I have built large scale distributed gpu (96gpus per job) dnn systems and worked on very advanced code bases.
GPT4 massively sped up my ability to create this.
It is a tool and it takes a lot of time to master it. Took me around 3-6 months of every day use to actually figure out how. You need to go back and try to learn it properly, it's easily 3-5x my work output.
It's absolutely worth the money when you look at the whole picture. Also lambda labs never has availability. I actually can schedule a distributed cluster on AWS.
> It's absolutely worth the money when you look at the whole picture.
That highly depends on many things. If you run a business with a relatively steady load that doesn't need to scale quickly multiple times per day, AWS is definitely not for you. Take Let's Encrypt[1] as an example. Just because cloud is the hype doesn't mean it's always worth it.
Edit: Or a personal experience: I had a customer that insisted on building their website on AWS. They weren't expecting high traffic loads and didn't need high availability, so I suggested to just use a VPS for $50 a month. They wanted to go the AWS route. Now their website is super scalable with all the cool buzzwords and it costs them $400 a month to run. Great! And in addition, the whole setup is way more complex to maintain since it's built on AWS instead of just a simple website with a database and some cache.
We built a tool for lambda labs and other clouds that launches a specific instance whenever it becomes available and notifies you. We poll Lambda Labs for availability every 3 seconds. Would this be something that would be useful for you?
Hi I'm Charles. Good at whatever I am currently interested in. I'm looking for consulting work or just a chat if you're looking for advice.
I'm currently interested in full stack deep learning systems for customized LLM work.
Before that I fully designed large scale distributed AI systems for AI researchers.
Before that I led a team for image analysis in a cancer research company.
Before that, I redesigned their instrument and firmware.
Did a lot of electronics before that.