Historique des commits

Auteur SHA1 Message Date
  oobabooga 113f94b61e Bump transformers (16-bit llama must be reconverted/redownloaded) il y a 2 ans
  oobabooga 03cb44fc8c Add new llama.cpp library (2048 context, temperature, etc now work) il y a 2 ans
  catalpaaa 4ab679480e allow quantized model to be loaded from model dir (#760) il y a 2 ans
  oobabooga 3a47a602a3 Detect ggml*.bin files automatically il y a 2 ans
  oobabooga 4c27562157 Minor changes il y a 2 ans
  Thomas Antony 79fa2b6d7e Add support for alpaca il y a 2 ans
  Thomas Antony 7745faa7bb Add llamacpp to models.py il y a 2 ans
  oobabooga 1cb9246160 Adapt to the new model names il y a 2 ans
  oobabooga 53da672315 Fix FlexGen il y a 2 ans
  oobabooga ee95e55df6 Fix RWKV tokenizer il y a 2 ans
  oobabooga fde92048af Merge branch 'main' into catalpaaa-lora-and-model-dir il y a 2 ans
  oobabooga 49c10c5570 Add support for the latest GPTQ models with group-size (#530) il y a 2 ans
  catalpaaa b37c54edcf lora-dir, model-dir and login auth il y a 2 ans
  oobabooga a6bf54739c Revert models.py (accident) il y a 2 ans
  oobabooga a80aa65986 Update models.py il y a 2 ans
  oobabooga ddb62470e9 --no-cache and --gpu-memory in MiB for fine VRAM control il y a 2 ans
  oobabooga e26763a510 Minor changes il y a 2 ans
  Wojtek Kowaluk 7994b580d5 clean up duplicated code il y a 2 ans
  Wojtek Kowaluk 30939e2aee add mps support on apple silicon il y a 2 ans
  oobabooga ee164d1821 Don't split the layers in 8-bit mode by default il y a 2 ans
  oobabooga e085cb4333 Small changes il y a 2 ans
  awoo 83cb20aad8 Add support for --gpu-memory witn --load-in-8bit il y a 2 ans
  oobabooga 1c378965e1 Remove unused imports il y a 2 ans
  oobabooga 66256ac1dd Make the "no GPU has been detected" message more descriptive il y a 2 ans
  oobabooga 265ba384b7 Rename a file, add deprecation warning for --load-in-4bit il y a 2 ans
  Ayanami Rei 8778b756e6 use updated load_quantized il y a 2 ans
  Ayanami Rei e1c952c41c make argument non case-sensitive il y a 2 ans
  Ayanami Rei 3c9afd5ca3 rename method il y a 2 ans
  Ayanami Rei edbc61139f use new quant loader il y a 2 ans
  oobabooga 65dda28c9d Rename --llama-bits to --gptq-bits il y a 2 ans