This website requires JavaScript.
Explore
Help
Sign in
ver4a
/
llama.cpp
Watch
1
Star
0
Fork
You've already forked llama.cpp
0
Code
Issues
Pull requests
Projects
Releases
Packages
Wiki
Activity
e562eece7c
llama.cpp
/
ggml
History
Download ZIP
Download TAR.GZ
Johannes Gäßler
e562eece7c
CUDA: fix typo in FlashAttention code (
#13926
)
2025-05-30 21:22:03 +02:00
..
cmake
cmake: Factor out CPU architecture detection (
#13883
)
2025-05-29 12:50:25 +02:00
include
ggml : add ggml_repeat_4d (
#13824
)
2025-05-27 15:53:55 +02:00
src
CUDA: fix typo in FlashAttention code (
#13926
)
2025-05-30 21:22:03 +02:00
.gitignore
vulkan : cmake integration (
#8119
)
2024-07-13 18:12:39 +02:00
CMakeLists.txt
vulkan: use timestamp queries for GGML_VULKAN_PERF (
#13817
)
2025-05-27 18:39:07 +02:00