oobabooga bd8aac8fa4 Add LLaMA 8-bit support %!s(int64=2) %!d(string=hai) anos
..
LLaMA.py 5a79863df3 Increase the sequence length, decrease batch size %!s(int64=2) %!d(string=hai) anos
LLaMA_8bit.py bd8aac8fa4 Add LLaMA 8-bit support %!s(int64=2) %!d(string=hai) anos
RWKV.py ff9f649c0c Remove some unused imports %!s(int64=2) %!d(string=hai) anos
chat.py 1a05860ca3 Ensure proper no-streaming with generation_attempts > 1 %!s(int64=2) %!d(string=hai) anos
deepspeed_parameters.py f38c9bf428 Fix deepspeed (oops) %!s(int64=3) %!d(string=hai) anos
extensions.py 91f5852245 Move bot_picture.py inside the extension %!s(int64=2) %!d(string=hai) anos
html_generator.py 43b6ab8673 Store thumbnails as files instead of base64 strings %!s(int64=2) %!d(string=hai) anos
models.py bd8aac8fa4 Add LLaMA 8-bit support %!s(int64=2) %!d(string=hai) anos
shared.py ed8b35efd2 Add --pin-weight parameter for FlexGen %!s(int64=2) %!d(string=hai) anos
stopping_criteria.py 7224343a70 Improve the imports %!s(int64=2) %!d(string=hai) anos
text_generation.py c93f1fa99b Count the tokens more conservatively %!s(int64=2) %!d(string=hai) anos
ui.py 2bff646130 Stop chat from flashing dark when processing %!s(int64=2) %!d(string=hai) anos