(no title)
patelajay285 | 1 year ago
Here's an (now slightly outdated) example of OpenAI GPT-4 => OpenAI GPT-3.5: https://datadreamer.dev/docs/latest/pages/get_started/quick_...
But you can also do GPT-4 to any model on HuggingFace. Or something like Llama-70B to Llama-1B.
For some tasks, this kind of distillation works extremely well given even a few hundred examples of the larger model performing the task.
bangaladore|1 year ago
I'm confused why you are mentioning 3.5 here. The weights aren't public, so you aren't actually running any derivative of GPT-3.5
Or am I mistaken. Can you clarify?
Tiberium|1 year ago
They don't limit it only to open-source models. And you can finetune 3.5 Turbo on OpenAI API.