top | item 41999951

(no title)

jyap | 1 year ago

This 236B model came out around September 6th.

DeepSeek-V2.5 is an upgraded version that combines DeepSeek-V2-Chat and DeepSeek-Coder-V2-Instruct.

From: https://huggingface.co/deepseek-ai/DeepSeek-V2.5

discuss

order

genpfault|1 year ago

> To utilize DeepSeek-V2.5 in BF16 format for inference, 80GB*8 GPUs are required.

risho|1 year ago

I wonder if the new mbp can run it at q4.