![]() * llama : use n_swa + n_ubatch cells for SWA cache ggml-ci * llama : add warning about multi-sqeuence SWA contexts |
||
---|---|---|
.. | ||
batched-bench | ||
cvector-generator | ||
export-lora | ||
gguf-split | ||
imatrix | ||
llama-bench | ||
main | ||
mtmd | ||
perplexity | ||
quantize | ||
rpc | ||
run | ||
server | ||
tokenize | ||
tts | ||
CMakeLists.txt |