llama.cpp

History

Eric Curtin f777a73e18 Some llama-run cleanups (#11973 ) Use consolidated open function call from File class. Change read_all to to_string(). Remove exclusive locking, the intent for that lock is to avoid multiple processes writing to the same file, it's not an issue for readers, although we may want to consider adding a shared lock. Remove passing nullptr as reference, references are never supposed to be null. clang-format the code for consistent styling. Signed-off-by: Eric Curtin <ecurtin@redhat.com>		2025-02-23 13:14:32 +00:00
..
batched	llama : add `llama_vocab`, functions -> methods, naming (#11110 )	2025-01-12 11:32:42 +02:00
batched-bench	llama : add `llama_vocab`, functions -> methods, naming (#11110 )	2025-01-12 11:32:42 +02:00
batched.swift	swift : fix llama-vocab api usage (#11645 )	2025-02-04 13:15:24 +02:00
convert-llama2c-to-ggml	llama : add `llama_vocab`, functions -> methods, naming (#11110 )	2025-01-12 11:32:42 +02:00
cvector-generator	repo : update links to new url (#11886 )	2025-02-15 16:40:57 +02:00
deprecation-warning	Update deprecation-warning.cpp (#10619 )	2024-12-04 23:19:20 +01:00
embedding	llama : add `llama_vocab`, functions -> methods, naming (#11110 )	2025-01-12 11:32:42 +02:00
eval-callback	llama : add `llama_vocab`, functions -> methods, naming (#11110 )	2025-01-12 11:32:42 +02:00
export-lora	export-lora : fix tok_embd tensor (#11330 )	2025-01-21 14:07:12 +01:00
gbnf-validator	Tool call support (generic + native for Llama, Functionary, Hermes, Mistral, Firefunction, DeepSeek) w/ lazy grammars (#9639 )	2025-01-30 19:13:58 +00:00
gen-docs	ggml : move AMX to the CPU backend (#10570 )	2024-11-29 21:54:58 +01:00
gguf	GGUF: C++ refactor, backend support, misc fixes (#11030 )	2025-01-07 18:01:58 +01:00
gguf-hash	GGUF: C++ refactor, backend support, misc fixes (#11030 )	2025-01-07 18:01:58 +01:00
gguf-split	ci : use -no-cnv in gguf-split tests (#11254 )	2025-01-15 18:28:35 +02:00
gritlm	llama : add `llama_vocab`, functions -> methods, naming (#11110 )	2025-01-12 11:32:42 +02:00
imatrix	examples: fix typo in imatrix/README.md (#11884 )	2025-02-15 21:03:30 +02:00
infill	llama : add `llama_vocab`, functions -> methods, naming (#11110 )	2025-01-12 11:32:42 +02:00
jeopardy	`build`: rename main → llama-cli, server → llama-server, llava-cli → llama-llava-cli, etc... (#7809 )	2024-06-13 00:41:52 +01:00
llama-bench	llama-bench : fix unexpected global variable initialize sequence issue (#11832 )	2025-02-14 02:13:43 +01:00
llama.android	repo : update links to new url (#11886 )	2025-02-15 16:40:57 +02:00
llama.swiftui	llama.swiftui : add "Done" dismiss button to help view (#11998 )	2025-02-22 06:33:29 +01:00
llava	llava: build clip image from pixels (#11999 )	2025-02-22 15:28:28 +01:00
lookahead	repo : update links to new url (#11886 )	2025-02-15 16:40:57 +02:00
lookup	repo : update links to new url (#11886 )	2025-02-15 16:40:57 +02:00
main	tool-call: refactor common chat / tool-call api (+ tests / fixes) (#11900 )	2025-02-18 18:03:23 +00:00
parallel	llama : add `llama_vocab`, functions -> methods, naming (#11110 )	2025-01-12 11:32:42 +02:00
passkey	repo : update links to new url (#11886 )	2025-02-15 16:40:57 +02:00
perplexity	Fix: Compile failure due to Microsoft STL breaking change (#11836 )	2025-02-12 21:36:11 +01:00
quantize	repo : update links to new url (#11886 )	2025-02-15 16:40:57 +02:00
quantize-stats	llama : add `llama_vocab`, functions -> methods, naming (#11110 )	2025-01-12 11:32:42 +02:00
retrieval	repo : update links to new url (#11886 )	2025-02-15 16:40:57 +02:00
rpc	rpc-server : add support for the SYCL backend (#10934 )	2024-12-23 10:39:30 +02:00
run	Some llama-run cleanups (#11973 )	2025-02-23 13:14:32 +00:00
save-load-state	llama : add `llama_vocab`, functions -> methods, naming (#11110 )	2025-01-12 11:32:42 +02:00
server	server : disable Nagle's algorithm (#12020 )	2025-02-22 11:46:31 +01:00
simple	llama : add `llama_vocab`, functions -> methods, naming (#11110 )	2025-01-12 11:32:42 +02:00
simple-chat	Add Jinja template support (#11016 )	2025-01-21 13:18:51 +00:00
simple-cmake-pkg	repo : update links to new url (#11886 )	2025-02-15 16:40:57 +02:00
speculative	repo : update links to new url (#11886 )	2025-02-15 16:40:57 +02:00
speculative-simple	llama : add `llama_vocab`, functions -> methods, naming (#11110 )	2025-01-12 11:32:42 +02:00
sycl	[SYCL]set context default value to avoid memory issue, update guide (#9476 )	2024-09-18 08:30:31 +08:00
tokenize	llama : add `llama_vocab`, functions -> methods, naming (#11110 )	2025-01-12 11:32:42 +02:00
tts	tts : add guide tokens support (#11186 )	2025-01-18 12:20:57 +02:00
chat-13B.bat	Create chat-13B.bat (#592 )	2023-03-29 20:21:09 +03:00
chat-13B.sh	`build`: rename main → llama-cli, server → llama-server, llava-cli → llama-llava-cli, etc... (#7809 )	2024-06-13 00:41:52 +01:00
chat-persistent.sh	scripts : fix pattern and get n_tokens in one go (#10221 )	2024-11-09 09:06:54 +02:00
chat-vicuna.sh	`build`: rename main → llama-cli, server → llama-server, llava-cli → llama-llava-cli, etc... (#7809 )	2024-06-13 00:41:52 +01:00
chat.sh	`build`: rename main → llama-cli, server → llama-server, llava-cli → llama-llava-cli, etc... (#7809 )	2024-06-13 00:41:52 +01:00
CMakeLists.txt	tts : add OuteTTS support (#10784 )	2024-12-18 19:27:21 +02:00
convert_legacy_llama.py	metadata: Detailed Dataset Authorship Metadata (#8875 )	2024-11-13 21:10:38 +11:00
json_schema_pydantic_example.py	py : type-check all Python scripts with Pyright (#8341 )	2024-07-07 15:04:39 -04:00
json_schema_to_grammar.py	grammar : fix JSON Schema for string regex with top-level alt. (#9903 )	2024-10-16 19:03:24 +03:00
llama.vim	repo : update links to new url (#11886 )	2025-02-15 16:40:57 +02:00
llm.vim	llm.vim : stop generation at multiple linebreaks, bind to <F2> (#2879 )	2023-08-30 09:50:55 +03:00
Miku.sh	`build`: rename main → llama-cli, server → llama-server, llava-cli → llama-llava-cli, etc... (#7809 )	2024-06-13 00:41:52 +01:00
pydantic_models_to_grammar.py	pydantic : replace uses of __annotations__ with get_type_hints (#8474 )	2024-07-14 19:51:21 -04:00
pydantic_models_to_grammar_examples.py	repo : update links to new url (#11886 )	2025-02-15 16:40:57 +02:00
reason-act.sh	`build`: rename main → llama-cli, server → llama-server, llava-cli → llama-llava-cli, etc... (#7809 )	2024-06-13 00:41:52 +01:00
regex_to_grammar.py	py : switch to snake_case (#8305 )	2024-07-05 07:53:33 +03:00
server-llama2-13B.sh	`build`: rename main → llama-cli, server → llama-server, llava-cli → llama-llava-cli, etc... (#7809 )	2024-06-13 00:41:52 +01:00
server_embd.py	py : type-check all Python scripts with Pyright (#8341 )	2024-07-07 15:04:39 -04:00
ts-type-to-grammar.sh	JSON schema conversion: ⚡️ faster repetitions, min/maxLength for strings, cap number length (#6555 )	2024-04-12 19:43:38 +01:00