top | item 47182194

(no title)

dbreunig | 3 days ago

Check out “Recursive Language Models”, or RLMs.

I believe this method works well because it turns a long context problem (hard for LLMs) into a coding and reasoning problem (much better!). You’re leveraging the last 18 months of coding RL by changing you scaffold.

discuss

order

koakuma-chan|3 days ago

This seems really weird to me. Isn't that just using LLMs in a specific way? Why come up with a new name "RLM" instead of saying "LLM"? Nothing changes about the model.

vimda|3 days ago

RLMs are a new architecture, but you can mimic an RLM by providing the context through a tool, yes