top | item 43500456

FFN Fusion: Rethinking Sequential Computation in Large Language Models

5 points| Dezash | 11 months ago |arxiv.org

discuss

order

No comments yet.