top | item 46199275

(no title)

maxrumpf | 2 months ago

This is the tech report for a model I helped work on. I'm biased, but it turned out very well.

We essentially let the model learn to retrieve like a human would: Make a first search, read the results, and then make another. This lets the model be vastly better than pre-programmed pipelines. We test this extensively and compare against implementing this with API models (like Sonnet 4.5 and GPT-5.1). SID-1 compares favorably.

Happy to answer any questions or get feedback. First and foremost: Enjoy the read. It's much more detailed than most tech reports.

discuss

order

No comments yet.