oobabooga bd8aac8fa4 Add LLaMA 8-bit support 2 anni fa
..
LLaMA.py 5a79863df3 Increase the sequence length, decrease batch size 2 anni fa
LLaMA_8bit.py bd8aac8fa4 Add LLaMA 8-bit support 2 anni fa
RWKV.py ff9f649c0c Remove some unused imports 2 anni fa
chat.py 1a05860ca3 Ensure proper no-streaming with generation_attempts > 1 2 anni fa
deepspeed_parameters.py f38c9bf428 Fix deepspeed (oops) 3 anni fa
extensions.py 91f5852245 Move bot_picture.py inside the extension 2 anni fa
html_generator.py 43b6ab8673 Store thumbnails as files instead of base64 strings 2 anni fa
models.py bd8aac8fa4 Add LLaMA 8-bit support 2 anni fa
shared.py ed8b35efd2 Add --pin-weight parameter for FlexGen 2 anni fa
stopping_criteria.py 7224343a70 Improve the imports 2 anni fa
text_generation.py c93f1fa99b Count the tokens more conservatively 2 anni fa
ui.py 2bff646130 Stop chat from flashing dark when processing 2 anni fa