top | item 42341995

(no title)

ulam2 | 1 year ago

No base model? disappointed.

discuss

order

paxys|1 year ago

The base model is Llama 3.1 70B

eldenring|1 year ago

It is probably the same base model as Llama 3.0.

They mention postraining improvements.

monkmartinez|1 year ago

interesting comment... what are you doing with base models? Are you a "finetuner"? I have been trying my hand with finetunes on instruct models and the results have been ok, but not awesome. I have a base model downloading now to give that a proper shot.

superkuh|1 year ago

I'm not them but I still prefer a text completion style of prompting rather than a baked in pre-prompt structure assuming only a 'chat' style metaphor of interaction.

benob|1 year ago

Base models are useful in research to see the effect of instruction tuning