top | item 39674067

(no title)

jaan | 2 years ago

Does it support constrained generation during training?

This is what we need for the large language models I am training for health care use cases.

For example, constraining LLM output is currently done by masking, and having this rust based library would enable novel ways to train LLMs.

Relevant papers:

https://github.com/epfl-dlab/transformers-CFG

https://neurips.cc/virtual/2023/poster/70782

discuss

order

mmoskal|2 years ago

It's definitely very exciting direction, which we have not explored at all!