(no title)
calebkaiser | 4 months ago
As the author points out, many of the patterns are fundamentally about in-context learning, and this in particular has been subject to a ton of research from the mechanistic interpretability crew. If you're curious, I think this line of research is fascinating: https://transformer-circuits.pub/2022/in-context-learning-an...
sgt101|3 months ago
unknown|3 months ago
[deleted]