llama.cpp/tools
Xuan-Son Nguyen 32916a4907
clip : refactor graph builder (#13321)
* mtmd : refactor graph builder

* fix qwen2vl

* clean up siglip cgraph

* pixtral migrated

* move minicpmv to a dedicated build function

* move max_feature_layer to build_llava

* use build_attn for minicpm resampler

* fix windows build

* add comment for batch_size

* also support tinygemma3 test model

* qwen2vl does not use RMS norm

* fix qwen2vl norm (2)
2025-05-06 22:40:24 +02:00
..
batched-bench llama : move end-user examples to tools directory (#13249) 2025-05-02 20:27:13 +02:00
cvector-generator llama : move end-user examples to tools directory (#13249) 2025-05-02 20:27:13 +02:00
export-lora llama : move end-user examples to tools directory (#13249) 2025-05-02 20:27:13 +02:00
gguf-split llama : move end-user examples to tools directory (#13249) 2025-05-02 20:27:13 +02:00
imatrix imatrix: fix oob writes if src1 is not contiguous (#13286) 2025-05-04 00:50:37 +02:00
llama-bench llama : move end-user examples to tools directory (#13249) 2025-05-02 20:27:13 +02:00
main llama : move end-user examples to tools directory (#13249) 2025-05-02 20:27:13 +02:00
mtmd clip : refactor graph builder (#13321) 2025-05-06 22:40:24 +02:00
perplexity llama : move end-user examples to tools directory (#13249) 2025-05-02 20:27:13 +02:00
quantize llama : move end-user examples to tools directory (#13249) 2025-05-02 20:27:13 +02:00
rpc rpc : use backend registry, support dl backends (#13304) 2025-05-04 21:25:43 +02:00
run llama : move end-user examples to tools directory (#13249) 2025-05-02 20:27:13 +02:00
server sampling : Integrate Top-nσ into main sampling chain (and add it to the server) (#13264) 2025-05-05 22:12:19 +02:00
tokenize llama : move end-user examples to tools directory (#13249) 2025-05-02 20:27:13 +02:00
tts llama : move end-user examples to tools directory (#13249) 2025-05-02 20:27:13 +02:00
CMakeLists.txt mtmd : rename llava directory to mtmd (#13311) 2025-05-05 16:02:55 +02:00