llama.cpp

History

Xuan-Son Nguyen 92ecdcc06a mtmd : add vision support for llama 4 (#13282 ) * wip llama 4 conversion * rm redundant __init__ * fix conversion * fix conversion * test impl * try this * reshape patch_embeddings_0 * fix view * rm ffn_post_norm * cgraph ok * f32 for pos embd * add image marker tokens * Llama4UnfoldConvolution * correct pixel shuffle * fix merge conflicts * correct * add debug_graph * logits matched, but it still preceives the image incorrectly * fix style * add image_grid_pinpoints * handle llama 4 preprocessing * rm load_image_size * rm unused line * fix * small fix 2 * add test & docs * fix llava-1.6 test * test: add notion of huge models * add comment * add warn about degraded quality		2025-05-19 13:04:14 +02:00
..
backend	sycl: use oneDNN for matrices multiplication (#12972 )	2025-05-15 16:53:41 +02:00
development	llama : move end-user examples to tools directory (#13249 )	2025-05-02 20:27:13 +02:00
multimodal	mtmd : rename llava directory to mtmd (#13311 )	2025-05-05 16:02:55 +02:00
android.md	repo : update links to new url (#11886 )	2025-02-15 16:40:57 +02:00
build.md	CUDA/HIP: Share the same unified memory allocation logic. (#12934 )	2025-04-15 11:20:38 +02:00
docker.md	repo : update links to new url (#11886 )	2025-02-15 16:40:57 +02:00
function-calling.md	update function-calling.md w/ template override for functionary-small-v3.2 (#12214 )	2025-03-06 09:03:31 +00:00
install.md	install : add macports (#12518 )	2025-03-23 10:21:48 +02:00
llguidance.md	llguidance build fixes for Windows (#11664 )	2025-02-14 12:46:08 -08:00
multimodal.md	mtmd : add vision support for llama 4 (#13282 )	2025-05-19 13:04:14 +02:00