common : refactor downloading system, handle mmproj with -hf option (#12694)

* (wip) refactor downloading system [no ci] * fix all examples * fix mmproj with -hf * gemma3: update readme * only handle mmproj in llava example * fix multi-shard download * windows: fix problem with std::min and std::max * fix 2
2025-04-01 23:44:05 +02:00 · 2025-04-01 23:44:05 +02:00 · 267c1399f1
commit 267c1399f1
parent f423981ac8
19 changed files with 673 additions and 635 deletions
--- a/examples/llava/README-gemma3.md
+++ b/examples/llava/README-gemma3.md
@ -4,6 +4,26 @@
 >
 > This is very experimental, only used for demo purpose.

+## Quick started
+
+You can use pre-quantized model from [ggml-org](https://huggingface.co/ggml-org)'s Hugging Face account
+
+```bash
+# build
+cmake -B build
+cmake --build build --target llama-gemma3-cli
+
+# alternatively, install from brew (MacOS)
+brew install llama.cpp
+
+# run it
+llama-gemma3-cli -hf ggml-org/gemma-3-4b-it-GGUF
+llama-gemma3-cli -hf ggml-org/gemma-3-12b-it-GGUF
+llama-gemma3-cli -hf ggml-org/gemma-3-27b-it-GGUF
+
+# note: 1B model does not support vision
+```
+
 ## How to get mmproj.gguf?

 ```bash