top | item 46681903 (no title) dumbmrblah | 1 month ago One thing to consider is that this version is a new architecture, so it’ll take time for Llama CPP to get updated. Similar to how it was with Qwen Next. discuss order hn newest cristoperb|1 month ago Apparently it is the same as the DeepseekV3 architecture and already supported by llama.cpp once the new name is added. Here's the PR: https://github.com/ggml-org/llama.cpp/pull/18936 khimaros|1 month ago has been merged
cristoperb|1 month ago Apparently it is the same as the DeepseekV3 architecture and already supported by llama.cpp once the new name is added. Here's the PR: https://github.com/ggml-org/llama.cpp/pull/18936 khimaros|1 month ago has been merged
cristoperb|1 month ago
khimaros|1 month ago