Multimodal (LLaVA) support for Android/Mobile? #12305

Helldez · 2025-03-10T10:13:55Z

Helldez
Mar 10, 2025

Hi everyone,

I was wondering if llama.cpp currently supports multimodal models (like LLaVA) on Android or other mobile devices. I know that llama.cpp has added support for LLaVA, but has anyone successfully run it on mobile?

Some questions I have:

Is there official support or any experimental builds for multimodal inference on Android?
What are the main challenges in running LLaVA on mobile? (Performance, quantization, etc.)
Are there any optimized versions or alternative solutions to make it work efficiently on mobile CPUs/NPUs?

If anyone has tested it or has insights, I’d love to hear your thoughts!

Thanks in advance!

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Multimodal (LLaVA) support for Android/Mobile? #12305

{{title}}

Replies: 0 comments

Select a reply

Multimodal (LLaVA) support for Android/Mobile? #12305

Helldez Mar 10, 2025

Replies: 0 comments

Helldez
Mar 10, 2025