WingNews logo WingNews
top | new | best | ask | show | jobs
top | item 46699784

(no title)

prats226 | 1 month ago

Instead of markdown -> LLM to get JSON, you can just train a slightly bigger model which you can constrain decode to give JSON rightaway. https://huggingface.co/nanonets/Nanonets-OCR2-3B

We recently published a cookbook for constrained decoding here: https://nanonets.com/cookbooks/structured-llm-outputs/

discuss

order

No comments yet.

powered by hn/api // news.ycombinator.com