We wanted to make concept for an app using all local models for chat (llama 3.1 8B) and voice (whisper). Deployed using kubernetes and easily scalable not to mention fully open source!
We wanted to make concept for an app using all local models for chat (llama 3.1 8B) and voice (whisper). Deployed using kubernetes and easily scalable not to mention fully open source!