(no title)
ClaireGz | 5 days ago
Curious from your experiments: at 1M+ context, does communication start dominating vs compute?
I keep seeing cases where bigger context windows are technically possible but don’t translate into better results unless the context is very structured, so I wonder where the real scaling limit ends up being in practice.
DARSHANFOFADIYA|5 days ago
The quality degradation as context length increaes is a whole another science problem