top | item 44537103

(no title)

spmurrayzzz | 7 months ago

Might end up being some confusion with the RULER benchmark from NVIDIA given the (somewhat shared) domain: https://github.com/NVIDIA/RULER

EDIT: by shared I only mean the adjacency to LLMs/AI/ML, RL is a pretty big differentiator though and project looks great

discuss

order

kcorbitt|7 months ago

Dang, hadn't seen that. Namespace collision strikes again.

swyx|7 months ago

yeah unforutnately for you this is one of the well known long context benchmarks. too late tho, soldier on.