top | item 45576752

(no title)

yvbbrjdr | 4 months ago

I see! Do you know what's causing the slowdown for ollama? They should be using the same backend..

discuss

order

alecco|4 months ago

Dude, ggerganov is the creator of llama.cpp. Kind of a legend. And of course he is right, you should've used llama.cpp.

Or you can just ask the ollama people about the ollama problems. Ollama is (or was) just a Go wrapper around llama.cpp.

ilc|4 months ago

Was. They've been diverging.