Skip to content

RC4ML/Bi-KV

Folders and files

NameName
Last commit message
Last commit date

Latest commit

 
 
 
 
 
 
 

Repository files navigation

Bi-KV

Bipartite KVCache

Requirements

pip install transformers
pip install sentencepiece
pip install flashinfer -i https://flashinfer.ai/whl/cu121/torch2.4/
pip install vllm

Run

cd Bi-KV
KVCACHE_NUM=5 WORKER_NUM=5 python init.py  # set number of kvcache and worker 

Distributed Run

cd Bi-KV
bash run.sh

About

Bipartite KVCache

Resources

Stars

Watchers

Forks

Releases

No releases published

Packages

No packages published

Contributors 3

  •  
  •  
  •