Georgi Gerganov
|
b2838049cc
|
bench : handle decode errors (#13548)
ggml-ci
|
2025-05-15 05:57:02 +03:00 |
|
Diego Devesa
|
cf0a43bb64
|
llama-bench : add defrag-thold, check for invalid ranges (#13487)
|
2025-05-13 00:31:37 +02:00 |
|
Diego Devesa
|
22cdab343b
|
llama-bench : accept ranges for integer parameters (#13410)
|
2025-05-12 13:08:22 +02:00 |
|
David Huang
|
7f323a589f
|
Add --no-op-offload to improve -ot pp perf in MoE models like llama4 400B (#13386)
|
2025-05-11 14:18:39 +02:00 |
|
Diego Devesa
|
1d36b3670b
|
llama : move end-user examples to tools directory (#13249)
* llama : move end-user examples to tools directory
---------
Co-authored-by: Xuan Son Nguyen <son@huggingface.co>
|
2025-05-02 20:27:13 +02:00 |
|