Commit graph

  • e6c9e0986c Fix bin dir for win ci anzz1 2023-03-21 23:49:24 +02:00
  • 01a297b099
    specify build type for ctest on windows (#371) Erik Scholz 2023-03-21 22:34:25 +01:00
  • 3366853e41
    Add notice about pending change Georgi Gerganov 2023-03-21 22:57:35 +02:00
  • 3f9c6135e4
    fix typo in chatLLaMa (#368) Mathieu Nayrolles 2023-03-21 16:52:27 -04:00
  • 0f61352708
    Update issue templates Georgi Gerganov 2023-03-21 19:47:27 +02:00
  • 353ec251a4
    We could use std::unordered_map over std::map (#305) Fabio R. Sluzala 2023-03-21 14:21:50 -03:00
  • 89d5d90f3b
    Fix color codes emitting mid-UTF8 code. (#312) Matvey Soloviev 2023-03-21 18:11:01 +01:00
  • 16ffc013c6
    Importer for GPTQ quantized LLaMA models (#301) comex 2023-03-21 09:42:25 -07:00
  • 486ae645fd
    Compute perplexity over prompt (#270) Gary Linscott 2023-03-21 09:27:42 -07:00
  • 3ab3e6582f
    Add chatLLaMa script (#198) Jean-Christophe Hoelt 2023-03-21 18:23:15 +02:00
  • f157088cb7
    makefile: Fix CPU feature detection on Haiku (#218) Alex von Gluck IV 2023-03-21 11:21:06 -05:00
  • c86ba036e6
    Enable ANSI colors on Windows 10+ (#311) anzz1 2023-03-21 18:14:46 +02:00
  • 1daf4dd712
    Minor style changes Georgi Gerganov 2023-03-21 18:10:32 +02:00
  • dc6a845b85
    Add chat.sh script Georgi Gerganov 2023-03-21 18:09:37 +02:00
  • 6a612959e1
    Check for reverse prompt by characters instead of tokens (#292) (#330) tjohnman 2023-03-21 17:05:06 +01:00
  • d5f56a5e5a
    Check for reverse prompt by characters instead of tokens (#292) (#330) tjohnman 2023-03-21 17:04:43 +01:00
  • 3bfa3b43b7
    Fix convert script, warnings alpaca instructions, default params Georgi Gerganov 2023-03-21 17:59:16 +02:00
  • 715d292ee0
    Add OpenBSD support (#314) Kevin Lo 2023-03-21 09:50:09 -06:00
  • c98ae02668
    fix typo in comment (#318) Mack Straight 2023-03-21 08:49:43 -07:00
  • c3b2306b18
    Makefile: slightly cleanup for Mac Intel; echo instead of run ./main -h (#335) Qingyou Meng 2023-03-21 23:44:11 +08:00
  • 975d2cebf9
    cmdline option for custom amount of model parts (--n_parts N) (#348) anzz1 2023-03-21 17:42:43 +02:00
  • e0ffc861fa
    Update IPFS links to quantized alpaca with new tokenizer format (#352) Kevin Kwok 2023-03-21 08:34:49 -07:00
  • 8f644a0a85
    Change default repeat_penalty to 1.0 Georgi Gerganov 2023-03-21 17:32:14 +02:00
  • eb34620aec
    Add tokenizer test + revert to C++11 (#355) Georgi Gerganov 2023-03-21 17:29:41 +02:00
  • 2e664f1ff4
    Add initial AVX512 support for dot product on Linux (#320) Casey Primozic 2023-03-21 07:35:42 -07:00
  • 8cf9f34edd
    Adding missing features of CMakeLists.txt & Refactoring (#131) nusu-github 2023-03-21 09:37:16 +09:00
  • bd4b46d6ba Nix flake: set meta.mainProgram to llama Ben Siraphob 2023-03-20 16:44:30 -05:00
  • 6b6d5b5024
    Fixed tokenizer.model not found error when model dir is symlink (#325) Qingyou Meng 2023-03-21 03:33:10 +08:00
  • a791a68b61
    move file magic/version to header, print expected version (#319) Mack Straight 2023-03-20 12:26:01 -07:00
  • 0f1b21cb90
    Docker - Fix publish docker image in GitHub Registry (#235) Bernat Vadell 2023-03-20 18:05:20 +01:00
  • 074bea2eb1
    sentencepiece bpe compatible tokenizer (#252) Mack Straight 2023-03-20 03:17:23 -07:00
  • 5cb63e2493
    Add tqdm to Python requirements (#293) Stephan Walter 2023-03-20 08:24:11 +00:00
  • da5303c1ea
    bugfix: default should not be interactive (#304) cocktailpeanut 2023-03-19 17:44:20 -04:00
  • 4545539d71
    Rename script Georgi Gerganov 2023-03-19 21:58:51 +02:00
  • edeba28366
    Add temporary helper script for Alpaca chat Georgi Gerganov 2023-03-19 21:57:28 +02:00
  • 5c19c70ba6
    fix coloring of last n_batch of prompt, and refactor line input (#221) Rickey Bowers Jr 2023-03-19 13:44:30 -06:00
  • 24568371ae
    Support for multiple reverse prompts. (#299) tjohnman 2023-03-19 20:33:06 +01:00
  • 7392f1cd2c
    Improved quantize script (#222) Suaj Carrot 2023-03-19 12:38:44 -06:00
  • ad5fd5b60c
    Make prompt randomization optional. (#300) tjohnman 2023-03-19 19:36:19 +01:00
  • 368d0c8a9e
    Respect the maximum number of tokens in interactive. (#298) tjohnman 2023-03-19 19:31:17 +01:00
  • 50fae10d03
    Add --ignore-eos parameter (#181) slaren 2023-03-19 19:22:48 +01:00
  • 084e2f0ec0
    interactive mode: print '\n' in sigint_handler, this flush stdout thus ensure color reset. (#283) Qingyou Meng 2023-03-20 02:10:00 +08:00
  • 0b366e7357
    Command line switch to use F16 for memory_k and memory_v (refactor of #154) (#294) Erik Scholz 2023-03-19 18:57:00 +01:00
  • 160bfb217d
    Update hot topics to mention Alpaca support Georgi Gerganov 2023-03-19 19:51:55 +02:00
  • c494ed5b94
    Fix off-by-one bug (#115) Georgi Gerganov 2023-03-19 19:46:32 +02:00
  • c1c7026b47
    Fix python stuff (#109) Georgi Gerganov 2023-03-19 19:33:18 +02:00
  • 467b149761
    Refactoring convert-pth-to-ggml.py: more concise and readable (#109) qunash 2023-03-19 20:17:39 +03:00
  • 70f01cb863
    Drop trailing new line from file prompts (#80) Georgi Gerganov 2023-03-19 19:04:44 +02:00
  • a4e63b73df
    Add instruction for using Alpaca (#240) Georgi Gerganov 2023-03-19 18:49:50 +02:00
  • 9e1707218a
    Add "--instruct" argument for usage with Alpaca (#240) Georgi Gerganov 2023-03-19 18:37:02 +02:00
  • 22213a17b5
    Change RMSNorm eps to 1e-6 (#173) Georgi Gerganov 2023-03-19 17:30:00 +02:00
  • d7def1a752
    Warn user if a context size greater than 2048 tokens is specified (#274) Ronsor 2023-03-18 17:10:47 -07:00
  • 6f61c18ec9 Fix typo in readme Pavol Rusnak 2023-03-18 22:39:46 +01:00
  • 1e5a6d088d Add note about Python 3.11 to readme Pavol Rusnak 2023-03-18 22:20:04 +01:00
  • 554b541521 Add memory/disk requirements to readme Pavol Rusnak 2023-03-18 21:58:46 +01:00
  • d3f202d57b
    Remove unused code since n_vocab is model.hparams.n_vocab (#262) Alex Nguyen 2023-03-18 20:51:49 +07:00
  • e03e359730
    fixed warning with std::ignore about unused function result (#151) Justin Suess 2023-03-18 07:44:09 -04:00
  • a81d0c2a17
    Fix n^2 loop in tokenization (#254) Gary Linscott 2023-03-18 04:17:19 -07:00
  • b2de7f18df
    CI Improvements (#230) anzz1 2023-03-18 09:27:12 +02:00
  • a292747893
    Nix flake (#40) Niklas Korz 2023-03-17 23:03:48 +01:00
  • c9f670a177
    Implement non-greedy tokenizer that tries to maximize token lengths (#242) thement 2023-03-17 21:05:58 +01:00
  • 4f54609110
    Default to 4 threads (#243) Georgi Gerganov 2023-03-17 21:46:46 +02:00
  • e81b9c81c1
    Update Contributing section Georgi Gerganov 2023-03-17 20:30:04 +02:00
  • 367946c668
    Don't tell users to use a bad number of threads (#243) Stephan Walter 2023-03-17 17:47:35 +00:00
  • 6b0df5ccf3
    add ptread link to fix cmake build under linux (#114) mmyjona 2023-03-18 00:38:24 +08:00
  • 2af23d3043
    🚀 Dockerize llamacpp (#132) Bernat Vadell 2023-03-17 10:47:06 +01:00
  • 904d2a8d6a
    Q4_1 quantization (#193) Matvey Soloviev 2023-03-17 05:48:39 +01:00
  • 721311070e
    Update README.md Georgi Gerganov 2023-03-16 15:00:09 +02:00
  • ac15de7895
    Expand "Contributing" section Georgi Gerganov 2023-03-16 08:55:13 +02:00
  • 273abc47ff
    Update hot topics - RMSnorm Georgi Gerganov 2023-03-16 07:12:12 +02:00
  • 9b4a15b17d
    Fix RMS norm in GGML (#191) Nebula 2023-03-15 19:29:25 -04:00
  • 6eac39ba95
    Add RMS norm and use it (#187) hoangmit 2023-03-15 18:41:38 -04:00
  • 27944c4206
    fixed typo (#178) moritzbrantner 2023-03-15 21:35:25 +01:00
  • 2d15d6c9a9
    add SIGINT support for _WIN32 environments (#120) Rickey Bowers Jr 2023-03-15 13:56:24 -06:00
  • 2d64715ad4
    added ctx_size parameter (#148) Justin Suess 2023-03-15 15:42:40 -04:00
  • 16b2c61a22
    fixed color reset on exit (#149) Justin Suess 2023-03-15 15:39:38 -04:00
  • 977295c700
    Fix potential licensing issue (#126) Musab Gultekin 2023-03-15 22:39:06 +03:00
  • 956dfda8ad
    Use tokenizer.vocab_size() instead of hardcoding 32000 in convert-pth-to-ggml.py (#142) Ronsor 2023-03-15 12:37:50 -07:00
  • 113e685d18
    inline -> static inline for "bytesFromNibbles" (#161) hoangmit 2023-03-15 15:05:14 -04:00
  • 47857e564c
    Don't use vdotq_s32 if it's not available (#139) Ronsor 2023-03-14 12:34:37 -07:00
  • 60f819a2b1
    Add section to README on how to run the project on Android (#130) Radoslav Gerganov 2023-03-14 15:30:08 +02:00
  • 97ab2b2578
    Add Misc section + update hot topics + minor fixes Georgi Gerganov 2023-03-14 09:43:52 +02:00
  • 2f700a2738
    Add windows to the CI (#98) Sebastián A 2023-03-13 17:29:10 -03:00
  • c09a9cfb06
    CMake build in Release by default (#75) Georgi Gerganov 2023-03-13 21:22:15 +02:00
  • 7ec903d3c1
    Update contribution section, hot topics, limitations, etc. Georgi Gerganov 2023-03-13 19:21:51 +02:00
  • 4497ad819c
    Print system information Georgi Gerganov 2023-03-13 19:15:08 +02:00
  • ed6849cc07
    Initial support for CMake (#75) Sebastián A 2023-03-13 14:12:33 -03:00
  • 41be0a3b3d
    Add NetBSD support. (#90) Thomas Klausner 2023-03-13 17:40:54 +01:00
  • 671d5cac15
    Use fprintf for diagnostic output (#48) Pavol Rusnak 2023-03-13 17:39:56 +01:00
  • 84d9015c4a
    Use vdotq_s32 to improve performance (#67) Georgi Gerganov 2023-03-13 18:36:44 +02:00
  • 63fd76fbb0
    Reduce model loading time (#43) uint256_t 2023-03-14 01:33:43 +09:00
  • 2a20f48efa
    Fix UTF-8 handling (including colors) (#79) Val Kharitonov 2023-03-13 12:24:18 -04:00
  • d1f224712d
    Add quantize script for batch quantization (#92) Pavol Rusnak 2023-03-13 17:15:20 +01:00
  • 1808ee0500
    Add initial contribution guidelines Georgi Gerganov 2023-03-13 09:42:26 +02:00
  • a169bb889c Gate signal support on being on a unixoid system. (#74) Matvey Soloviev 2023-03-13 04:08:01 +01:00
  • 460c482540 Fix token count accounting Matvey Soloviev 2023-03-13 00:35:51 +01:00
  • c80e2a8f2a
    Revert "10% performance boost on ARM" Georgi Gerganov 2023-03-13 01:28:08 +02:00
  • 54a0e66ea0
    Check for vdotq_s32 availability Georgi Gerganov 2023-03-13 01:21:03 +02:00
  • 543c57e991
    Ammend to previous commit - forgot to update non-QRDMX branch Georgi Gerganov 2023-03-13 01:05:24 +02:00
  • 113a9e83eb
    10% performance boost on ARM Georgi Gerganov 2023-03-13 00:56:10 +02:00