Commit History

Autor SHA1 Mensaxe Data
  oobabooga 341e135036 Various fixes in chat mode %!s(int64=2) %!d(string=hai) anos
  oobabooga b0e8cb8c88 Various fixes in chat mode %!s(int64=2) %!d(string=hai) anos
  oobabooga 0bd5430988 Use 'with' statement to better handle streaming memory %!s(int64=2) %!d(string=hai) anos
  oobabooga 37f0166b2d Fix memory leak in new streaming (second attempt) %!s(int64=2) %!d(string=hai) anos
  oobabooga 59b5f7a4b7 Improve usage of stopping_criteria %!s(int64=2) %!d(string=hai) anos
  oobabooga add9330e5e Bug fixes %!s(int64=2) %!d(string=hai) anos
  oobabooga 33fb6aed74 Minor bug fix %!s(int64=2) %!d(string=hai) anos
  oobabooga ad2970374a Readability improvements %!s(int64=2) %!d(string=hai) anos
  oobabooga 72d539dbff Better separate the FlexGen case %!s(int64=2) %!d(string=hai) anos
  oobabooga ab50f80542 New text streaming method (much faster) %!s(int64=2) %!d(string=hai) anos
  oobabooga 8e89bc596b Fix encode() for RWKV %!s(int64=2) %!d(string=hai) anos
  oobabooga 19a34941ed Add proper streaming to RWKV %!s(int64=2) %!d(string=hai) anos
  oobabooga 8660227e1b Add top_k to RWKV %!s(int64=2) %!d(string=hai) anos
  oobabooga 20bd645f6a Fix bug in multigpu setups (attempt 3) %!s(int64=2) %!d(string=hai) anos
  oobabooga 09a7c36e1b Minor improvement while running custom models %!s(int64=2) %!d(string=hai) anos
  oobabooga 24c4c20391 Fix bug in multigpu setups (attempt #2) %!s(int64=2) %!d(string=hai) anos
  oobabooga d88b7836c6 Fix bug in multigpu setups %!s(int64=2) %!d(string=hai) anos
  oobabooga e91f4bc25a Add RWKV tokenizer %!s(int64=2) %!d(string=hai) anos
  oobabooga a54b91af77 Improve readability %!s(int64=2) %!d(string=hai) anos
  oobabooga 8e706df20e Fix a memory leak when text streaming is on %!s(int64=2) %!d(string=hai) anos
  oobabooga c33715ad5b Move towards HF LLaMA implementation %!s(int64=2) %!d(string=hai) anos
  oobabooga c93f1fa99b Count the tokens more conservatively %!s(int64=2) %!d(string=hai) anos
  oobabooga 05e703b4a4 Print the performance information more reliably %!s(int64=2) %!d(string=hai) anos
  oobabooga a345a2acd2 Add a tokenizer placeholder %!s(int64=2) %!d(string=hai) anos
  oobabooga 5b354817f6 Make chat minimally work with LLaMA %!s(int64=2) %!d(string=hai) anos
  oobabooga ea5c5eb3da Add LLaMA support %!s(int64=2) %!d(string=hai) anos
  oobabooga 7bbe32f618 Don't return a value in an iterator function %!s(int64=2) %!d(string=hai) anos
  oobabooga ff9f649c0c Remove some unused imports %!s(int64=2) %!d(string=hai) anos
  oobabooga 955cf431e8 Minor consistency fix %!s(int64=2) %!d(string=hai) anos
  oobabooga 831ac7ed3f Add top_p %!s(int64=2) %!d(string=hai) anos