Hacker Newsnew | past | comments | ask | show | jobs | submitlogin

I see! Do you know what's causing the slowdown for ollama? They should be using the same backend..


Dude, ggerganov is the creator of llama.cpp. Kind of a legend. And of course he is right, you should've used llama.cpp.

Or you can just ask the ollama people about the ollama problems. Ollama is (or was) just a Go wrapper around llama.cpp.


Was. They've been diverging.




Guidelines | FAQ | Lists | API | Security | Legal | Apply to YC | Contact

Search: