We read every piece of feedback, and take your input very seriously.
To see all available qualifiers, see our documentation.
[TMLR 2024] Efficient Large Language Models: A Survey
1.1k 95
[TOSN 2024] Artificial Intelligence of Things: A Survey
56 2
Official Code for "SVD-LLM: Truncation-aware Singular Value Decomposition for Large Language Model Compression"
Python 135 10
[NeurIPS 2020] "Does Unsupervised Architecture Representation Learning Help Neural Architecture Search?" by Shen Yan, Yu Zheng, Wei Ao, Xiao Zeng, Mi Zhang
Python 50 11
[NeurIPS 2022] "FedRolex: Model-Heterogeneous Federated Learning with Rolling Sub-Model Extraction" by Samiul Alam, Luyang Liu, Ming Yan, and Mi Zhang
Python 62 16
[DMLR 2024] FedAIoT: A Federated Learning Benchmark for Artificial Intelligence of Things
Python 53 10
[NAACL 2025🔥] Official implementation of "MEDA: Dynamic KV Cache Allocation for Efficient Multimodal Long-Context Inference"
[ICLR 2025🔥] Official implementation of "D2O: Dynamic Discriminative Operations for Efficient Long-Context Inference of Large Language Models"
[2024 ECCV Workshop] Famba-V: Fast Vision Mamba with Cross-Layer Token Fusion
[ICLR 2022] "Deep AutoAugment" by Yu Zheng, Zhi Zhang, Shen Yan, Mi Zhang
Loading…