Commit History

Autor SHA1 Mensaxe Data
  oobabooga 8e89bc596b Fix encode() for RWKV %!s(int64=2) %!d(string=hai) anos
  oobabooga 19a34941ed Add proper streaming to RWKV %!s(int64=2) %!d(string=hai) anos
  oobabooga 8660227e1b Add top_k to RWKV %!s(int64=2) %!d(string=hai) anos
  oobabooga 20bd645f6a Fix bug in multigpu setups (attempt 3) %!s(int64=2) %!d(string=hai) anos
  oobabooga 09a7c36e1b Minor improvement while running custom models %!s(int64=2) %!d(string=hai) anos
  oobabooga 24c4c20391 Fix bug in multigpu setups (attempt #2) %!s(int64=2) %!d(string=hai) anos
  oobabooga d88b7836c6 Fix bug in multigpu setups %!s(int64=2) %!d(string=hai) anos
  oobabooga e91f4bc25a Add RWKV tokenizer %!s(int64=2) %!d(string=hai) anos
  oobabooga a54b91af77 Improve readability %!s(int64=2) %!d(string=hai) anos
  oobabooga 8e706df20e Fix a memory leak when text streaming is on %!s(int64=2) %!d(string=hai) anos
  oobabooga c33715ad5b Move towards HF LLaMA implementation %!s(int64=2) %!d(string=hai) anos
  oobabooga c93f1fa99b Count the tokens more conservatively %!s(int64=2) %!d(string=hai) anos
  oobabooga 05e703b4a4 Print the performance information more reliably %!s(int64=2) %!d(string=hai) anos
  oobabooga a345a2acd2 Add a tokenizer placeholder %!s(int64=2) %!d(string=hai) anos
  oobabooga 5b354817f6 Make chat minimally work with LLaMA %!s(int64=2) %!d(string=hai) anos
  oobabooga ea5c5eb3da Add LLaMA support %!s(int64=2) %!d(string=hai) anos
  oobabooga 7bbe32f618 Don't return a value in an iterator function %!s(int64=2) %!d(string=hai) anos
  oobabooga ff9f649c0c Remove some unused imports %!s(int64=2) %!d(string=hai) anos
  oobabooga 955cf431e8 Minor consistency fix %!s(int64=2) %!d(string=hai) anos
  oobabooga 831ac7ed3f Add top_p %!s(int64=2) %!d(string=hai) anos
  oobabooga 7c4d5ca8cc Improve the text generation call a bit %!s(int64=2) %!d(string=hai) anos
  oobabooga 0f6708c471 Sort the imports %!s(int64=2) %!d(string=hai) anos
  oobabooga e735806c51 Add a generate() function for RWKV %!s(int64=2) %!d(string=hai) anos
  oobabooga f871971de1 Trying to get the chat to work %!s(int64=2) %!d(string=hai) anos
  oobabooga ebd698905c Add streaming to RWKV %!s(int64=2) %!d(string=hai) anos
  oobabooga 70e522732c Move RWKV loader into a separate file %!s(int64=2) %!d(string=hai) anos
  oobabooga ebc64a408c RWKV support prototype %!s(int64=2) %!d(string=hai) anos
  oobabooga 6e843a11d6 Fix FlexGen in chat mode %!s(int64=2) %!d(string=hai) anos
  oobabooga fa58fd5559 Proper way to free the cuda cache %!s(int64=2) %!d(string=hai) anos
  oobabooga 700311ce40 Empty the cuda cache at model.generate() %!s(int64=2) %!d(string=hai) anos