This website requires JavaScript.
Explore
Help
Sign in
ver4a
/
llama.cpp
Watch
1
Star
0
Fork
You've already forked llama.cpp
0
Code
Issues
Pull requests
Projects
Releases
Packages
Wiki
Activity
e1e8e0991f
llama.cpp
/
ggml
History
Download ZIP
Download TAR.GZ
Johannes Gäßler
e1e8e0991f
CUDA: batched+noncont MMQ, refactor bs>1 MoE code (
#13199
)
2025-04-30 23:12:59 +02:00
..
cmake
scripts : update sync + fix cmake merge
2025-03-27 10:09:29 +02:00
include
CUDA: fix q_nope_absorbed prec for DS 2 Lite f16 (
#13137
)
2025-04-28 09:29:26 +02:00
src
CUDA: batched+noncont MMQ, refactor bs>1 MoE code (
#13199
)
2025-04-30 23:12:59 +02:00
.gitignore
vulkan : cmake integration (
#8119
)
2024-07-13 18:12:39 +02:00
CMakeLists.txt
ggml : add SSE 4.2 and x64 base variant for CPUs without AVX (
#12871
)
2025-04-21 18:13:51 +02:00