top | item 44504478

(no title)

DelightOne | 7 months ago

How does an e2e test for less capable LLMs look like, you call each LLM one by one? Aren't these tests flaky by the nature of LLMs, how do you deal with that?

discuss

order

No comments yet.