llama.cpp/docs
Xuan-Son Nguyen 92ecdcc06a
mtmd : add vision support for llama 4 (#13282)
* wip llama 4 conversion

* rm redundant __init__

* fix conversion

* fix conversion

* test impl

* try this

* reshape patch_embeddings_0

* fix view

* rm ffn_post_norm

* cgraph ok

* f32 for pos embd

* add image marker tokens

* Llama4UnfoldConvolution

* correct pixel shuffle

* fix merge conflicts

* correct

* add debug_graph

* logits matched, but it still preceives the image incorrectly

* fix style

* add image_grid_pinpoints

* handle llama 4 preprocessing

* rm load_image_size

* rm unused line

* fix

* small fix 2

* add test & docs

* fix llava-1.6 test

* test: add notion of huge models

* add comment

* add warn about degraded quality
2025-05-19 13:04:14 +02:00
..
backend sycl: use oneDNN for matrices multiplication (#12972) 2025-05-15 16:53:41 +02:00
development llama : move end-user examples to tools directory (#13249) 2025-05-02 20:27:13 +02:00
multimodal mtmd : rename llava directory to mtmd (#13311) 2025-05-05 16:02:55 +02:00
android.md repo : update links to new url (#11886) 2025-02-15 16:40:57 +02:00
build.md CUDA/HIP: Share the same unified memory allocation logic. (#12934) 2025-04-15 11:20:38 +02:00
docker.md repo : update links to new url (#11886) 2025-02-15 16:40:57 +02:00
function-calling.md update function-calling.md w/ template override for functionary-small-v3.2 (#12214) 2025-03-06 09:03:31 +00:00
install.md install : add macports (#12518) 2025-03-23 10:21:48 +02:00
llguidance.md llguidance build fixes for Windows (#11664) 2025-02-14 12:46:08 -08:00
multimodal.md mtmd : add vision support for llama 4 (#13282) 2025-05-19 13:04:14 +02:00