top | item 30921149 (no title) Inufu | 3 years ago On many natural language tasks there can be significant overlap, making it difficult to judge performance. That's why I like more complex code generation tasks such the dataset we used for AlphaCode. discuss order hn newest No comments yet.
No comments yet.