ingridpan's comments

ingridpan · on Sept 12, 2023

I found this tutorial helpful for getting started with fine-tuning https://www.youtube.com/watch?v=74NSDMvYZ9Y

This guy used gradient.ai and he has a Google Collab to try it

ingridpan · on Aug 28, 2023

While you're experimenting, worth checking out https://gradient.ai/ -- they're basically the OpenAI API but with llama2

ingridpan · on Aug 25, 2023

In this post, we explore problems involved in LLM deployment, from GPU shortages to bottlenecks in model performance. These problems have inspired recent developments in distributed training frameworks commonly used to train LLMs, notably ZeRO-Offload. Here we give an overview of ZeRO-Offload, and in future posts we describe its benefits in depth.

ingridpan · on Aug 25, 2023

RAG is great for pulling some additional knowledge, but if you combine it with fine-tuning (i.e., the LLM 'understands' the domain-specific terminology better) it becomes a lot more effective

monkeydust · on Aug 25, 2023

Looking exactly into this, any research on this topic?

ingridpan · on Aug 25, 2023

Research: https://arxiv.org/abs/2005.11401

An implementation on Hugging Face: https://huggingface.co/docs/transformers/model_doc/rag

monkeydust · on Aug 25, 2023

Txs but I meant fine tuning and RAG combined

ingridpan · on Aug 24, 2023

https://gradient.ai/ API for inference and fine-tuning open-source LLMs

ingridpan · on Aug 24, 2023

not quite self-hosted but gradient.ai gives you access to llama2 via CLI

ingridpan · on Aug 24, 2023

https://gradient.ai/ is doing that with llama2

worldsayshi · on Aug 24, 2023

Looks really promising. I wonder if the similar pricing to OpenAI means that Gradient is also(?) bleeding money even if they get a good customer base. Or are these prices sustainable over time?

ingridpan · on Aug 24, 2023

Good question, esp as Gradient fine-tuning is so much cheaper than Open AI's

worldsayshi · on Aug 25, 2023

Yeah it's even cheaper. Although it looks like it's about the same in proportion to approx model size/expected quality? They haven't launched any >13B model yet, although they plan to.