top | item 38691237

(no title)

TarqDirtyToMe | 2 years ago

I think alignment in this context refers to distribution alignment: “aligning” it so its outputs more efficiently steer the target model with less tokens

discuss

order

No comments yet.