* sync : ggml (part 1) * sync : ggml (part 2, CUDA) * sync : ggml (part 3, Metal) * ggml : build fixes ggml-ci * cuda : restore lost changes * cuda : restore lost changes (StableLM rope) * cmake : enable separable compilation for CUDA ggml-ci * ggml-cuda : remove device side dequantize * Revert "cmake : enable separable compilation for CUDA" This reverts commit 09e35d04b1c4ca67f9685690160b35bc885a89ac. * cuda : remove assert for rope * tests : add test-backend-ops * ggml : fix bug in ggml_concat * ggml : restore `ggml_get_n_tasks()` logic in `ggml_graph_plan()` * ci : try to fix macOS * ggml-backend : remove backend self-registration * ci : disable Metal for macOS cmake build ggml-ci * metal : fix "supports family" call * metal : fix assert * metal : print resource path ggml-ci --------- Co-authored-by: slaren <slarengh@gmail.com> |
||
|---|---|---|
| .. | ||
| build-info.cmake | ||
| build-info.sh | ||
| convert-gg.sh | ||
| gen-build-info-cpp.cmake | ||
| get-wikitext-2.sh | ||
| LlamaConfig.cmake.in | ||
| qnt-all.sh | ||
| run-all-perf.sh | ||
| run-all-ppl.sh | ||
| server-llm.sh | ||
| sync-ggml.sh | ||
| verify-checksum-models.py | ||