top | item 46905687

(no title)

weisser | 25 days ago

What sort of context do you give the APIs when you are starting the game? Does it need to learn the rules as it goes?

discuss

mbh159|25 days ago

We have a standard harness for each of the model's that we test. Each prompt includes the rules, access to memory, and a lookup of the complete ruleset. The prompt adapts adding legal actions per turn and guidance depending on the stage of the game (updated based on the technological progress of the player).

Unlike RL algorithms these LLMs wouldn't learn quick enough without the prior knowledge the harness provides

weisser|25 days ago

what do you use for memory?