top | item 44986453

(no title)

phi-go | 6 months ago

Does this have a compute benefit or could one use different specialized LLM architectures / models for the subnetworks?

discuss

order

No comments yet.