add-sparse-method

分类: 开发与编程 | 上传者: CURRENTFCURRENTF | 下载: 0 | 版本: v1.0(最新)

Add or refactor a first-class Sparse-vLLM sparse method alongside vanilla, SnapKV, OmniKV, QuEST, and DeltaKV. Use when Codex needs to introduce a new `vllm_sparse_method`, move method logic out of `attention.py` or `utils/`, add method-specific cache metadata or decode-time view building, wire config and registration, and preserve the repo's cache-manager-first architecture.

更新日志: Source: GitHub https://github.com/CURRENTF/Sparse-vLLM

目录结构

当前层级: skills/add-sparse-method/

SKILL.md

登录后下载/点赞/收藏 ❤ 30 | ★ 0
评论 0

请先登录后评论。

还没有评论,快来第一个发言吧。