SCOPE: Optimizing Key-Value Cache Compression in Long-context Generation
基本信息 📝 原文链接: https://arxiv.org/abs/2412.13649👥 作者: Jialong Wu, Zhenglin Wang, Linhai Zhang, Yilong Lai, Yulan He, Deyu Zhou🏷️ 关键词: large language models, Key-Value Cache📚 分类: 机器学习 摘要 …
2025-10-25