Commit History

Autor SHA1 Mensaxe Data
  oobabooga c33715ad5b Move towards HF LLaMA implementation %!s(int64=2) %!d(string=hai) anos
  oobabooga bd8aac8fa4 Add LLaMA 8-bit support %!s(int64=2) %!d(string=hai) anos
  oobabooga ed8b35efd2 Add --pin-weight parameter for FlexGen %!s(int64=2) %!d(string=hai) anos
  oobabooga ea5c5eb3da Add LLaMA support %!s(int64=2) %!d(string=hai) anos
  oobabooga 659bb76722 Add RWKVModel class %!s(int64=2) %!d(string=hai) anos
  oobabooga 6837d4d72a Load the model by name %!s(int64=2) %!d(string=hai) anos
  oobabooga 70e522732c Move RWKV loader into a separate file %!s(int64=2) %!d(string=hai) anos
  oobabooga ebc64a408c RWKV support prototype %!s(int64=2) %!d(string=hai) anos
  oobabooga 8e3e8a070f Make FlexGen work with the newest API %!s(int64=2) %!d(string=hai) anos
  oobabooga 65326b545a Move all gradio elements to shared (so that extensions can use them) %!s(int64=2) %!d(string=hai) anos
  oobabooga f6f792363b Separate command-line params by spaces instead of commas %!s(int64=2) %!d(string=hai) anos
  luis 5abdc99a7c gpu-memory arg change %!s(int64=2) %!d(string=hai) anos
  oobabooga 7224343a70 Improve the imports %!s(int64=2) %!d(string=hai) anos
  oobabooga e46c43afa6 Move some stuff from server.py to modules %!s(int64=2) %!d(string=hai) anos
  oobabooga 1dacd34165 Further refactor %!s(int64=2) %!d(string=hai) anos