top | item 44880143 (no title) mamp | 6 months ago Unfortunately, I think the context rot paper [1] found that the performance degradation when context increased still occurred in models using attention sinks.1. https://research.trychroma.com/context-rot discuss order hn newest giancarlostoro|6 months ago Saw that paper have not had a chance to read it yet, are there other techniques that help then? I assume theres a few different ones used.
giancarlostoro|6 months ago Saw that paper have not had a chance to read it yet, are there other techniques that help then? I assume theres a few different ones used.
giancarlostoro|6 months ago