pruning-distillation guidance or docs for Multimodal model #12975

xduzhangjiayu · 2025-04-11T02:48:35Z

Hi，
Is this project has some guidance or docs about how to pruning-distillation a multimodal model like qwen2.5vl-7B Qwen2.5-Omni-7B? Currently I only find the docs for Llama.
Looking forward to the reply, thanks!

sharathts · 2025-04-12T00:02:46Z

Hi @xduzhangjiayu - we currently support only pruning and distillation of LLMs. The same can be applied to compressing the LLM part of VLMs as they tend to be the bottleneck.

xduzhangjiayu · 2025-04-16T00:44:53Z

@sharathts
Hi,
Thanks for the reply, one more question is, how can I transfer the multimodal model (like qwen2vl) type from huggingface model to .nemo format, is there any script in this repo? Thanks!

xduzhangjiayu assigned okuchaiev Apr 11, 2025

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

pruning-distillation guidance or docs for Multimodal model #12975

pruning-distillation guidance or docs for Multimodal model #12975

xduzhangjiayu commented Apr 11, 2025

sharathts commented Apr 12, 2025

xduzhangjiayu commented Apr 16, 2025 •

edited

Loading

pruning-distillation guidance or docs for Multimodal model #12975

pruning-distillation guidance or docs for Multimodal model #12975

Comments

xduzhangjiayu commented Apr 11, 2025

sharathts commented Apr 12, 2025

xduzhangjiayu commented Apr 16, 2025 • edited Loading

xduzhangjiayu commented Apr 16, 2025 •

edited

Loading