oobabooga
|
c7aa51faa6
Use a list of eos_tokens instead of just a number
|
%!s(int64=2) %!d(string=hai) anos |
Xan
|
b3e10e47c0
Fix merge conflict in text_generation
|
%!s(int64=2) %!d(string=hai) anos |
oobabooga
|
341e135036
Various fixes in chat mode
|
%!s(int64=2) %!d(string=hai) anos |
oobabooga
|
b0e8cb8c88
Various fixes in chat mode
|
%!s(int64=2) %!d(string=hai) anos |
oobabooga
|
0bd5430988
Use 'with' statement to better handle streaming memory
|
%!s(int64=2) %!d(string=hai) anos |
oobabooga
|
37f0166b2d
Fix memory leak in new streaming (second attempt)
|
%!s(int64=2) %!d(string=hai) anos |
oobabooga
|
59b5f7a4b7
Improve usage of stopping_criteria
|
%!s(int64=2) %!d(string=hai) anos |
oobabooga
|
add9330e5e
Bug fixes
|
%!s(int64=2) %!d(string=hai) anos |
Xan
|
5648a41a27
Merge branch 'main' of https://github.com/xanthousm/text-generation-webui
|
%!s(int64=2) %!d(string=hai) anos |
Xan
|
ad6b699503
Better TTS with autoplay
|
%!s(int64=2) %!d(string=hai) anos |
oobabooga
|
33fb6aed74
Minor bug fix
|
%!s(int64=2) %!d(string=hai) anos |
oobabooga
|
ad2970374a
Readability improvements
|
%!s(int64=2) %!d(string=hai) anos |
oobabooga
|
72d539dbff
Better separate the FlexGen case
|
%!s(int64=2) %!d(string=hai) anos |
oobabooga
|
ab50f80542
New text streaming method (much faster)
|
%!s(int64=2) %!d(string=hai) anos |
oobabooga
|
8e89bc596b
Fix encode() for RWKV
|
%!s(int64=2) %!d(string=hai) anos |
oobabooga
|
19a34941ed
Add proper streaming to RWKV
|
%!s(int64=2) %!d(string=hai) anos |
oobabooga
|
8660227e1b
Add top_k to RWKV
|
%!s(int64=2) %!d(string=hai) anos |
oobabooga
|
20bd645f6a
Fix bug in multigpu setups (attempt 3)
|
%!s(int64=2) %!d(string=hai) anos |
oobabooga
|
09a7c36e1b
Minor improvement while running custom models
|
%!s(int64=2) %!d(string=hai) anos |
oobabooga
|
24c4c20391
Fix bug in multigpu setups (attempt #2)
|
%!s(int64=2) %!d(string=hai) anos |
oobabooga
|
d88b7836c6
Fix bug in multigpu setups
|
%!s(int64=2) %!d(string=hai) anos |
oobabooga
|
e91f4bc25a
Add RWKV tokenizer
|
%!s(int64=2) %!d(string=hai) anos |
oobabooga
|
a54b91af77
Improve readability
|
%!s(int64=2) %!d(string=hai) anos |
oobabooga
|
8e706df20e
Fix a memory leak when text streaming is on
|
%!s(int64=2) %!d(string=hai) anos |
oobabooga
|
c33715ad5b
Move towards HF LLaMA implementation
|
%!s(int64=2) %!d(string=hai) anos |
oobabooga
|
c93f1fa99b
Count the tokens more conservatively
|
%!s(int64=2) %!d(string=hai) anos |
oobabooga
|
05e703b4a4
Print the performance information more reliably
|
%!s(int64=2) %!d(string=hai) anos |
oobabooga
|
a345a2acd2
Add a tokenizer placeholder
|
%!s(int64=2) %!d(string=hai) anos |
oobabooga
|
5b354817f6
Make chat minimally work with LLaMA
|
%!s(int64=2) %!d(string=hai) anos |
oobabooga
|
ea5c5eb3da
Add LLaMA support
|
%!s(int64=2) %!d(string=hai) anos |