top | item 30921149

(no title)

Inufu | 3 years ago

On many natural language tasks there can be significant overlap, making it difficult to judge performance. That's why I like more complex code generation tasks such the dataset we used for AlphaCode.

discuss

No comments yet.