CAD: Disaggregating Core Attention for Efficient Long-Context LLM Training (hao-ai-lab.github.io) 6 pts| 2 months ago | discuss
Reasoning Without Hesitating: Efficient Cot Through Certainty Probing (hao-ai-lab.github.io) 20 pts| 1 year ago | 5 comments