oobabooga
|
955cf431e8
Minor consistency fix
|
%!s(int64=2) %!d(string=hai) anos |
oobabooga
|
831ac7ed3f
Add top_p
|
%!s(int64=2) %!d(string=hai) anos |
oobabooga
|
7c4d5ca8cc
Improve the text generation call a bit
|
%!s(int64=2) %!d(string=hai) anos |
oobabooga
|
0f6708c471
Sort the imports
|
%!s(int64=2) %!d(string=hai) anos |
oobabooga
|
e735806c51
Add a generate() function for RWKV
|
%!s(int64=2) %!d(string=hai) anos |
oobabooga
|
f871971de1
Trying to get the chat to work
|
%!s(int64=2) %!d(string=hai) anos |
oobabooga
|
ebd698905c
Add streaming to RWKV
|
%!s(int64=2) %!d(string=hai) anos |
oobabooga
|
70e522732c
Move RWKV loader into a separate file
|
%!s(int64=2) %!d(string=hai) anos |
oobabooga
|
ebc64a408c
RWKV support prototype
|
%!s(int64=2) %!d(string=hai) anos |
oobabooga
|
6e843a11d6
Fix FlexGen in chat mode
|
%!s(int64=2) %!d(string=hai) anos |
oobabooga
|
fa58fd5559
Proper way to free the cuda cache
|
%!s(int64=2) %!d(string=hai) anos |
oobabooga
|
700311ce40
Empty the cuda cache at model.generate()
|
%!s(int64=2) %!d(string=hai) anos |
oobabooga
|
78ad55641b
Remove duplicate max_new_tokens parameter
|
%!s(int64=2) %!d(string=hai) anos |
oobabooga
|
65326b545a
Move all gradio elements to shared (so that extensions can use them)
|
%!s(int64=2) %!d(string=hai) anos |
oobabooga
|
9ae063e42b
Fix softprompts when deepspeed is active (#112)
|
%!s(int64=2) %!d(string=hai) anos |
oobabooga
|
7224343a70
Improve the imports
|
%!s(int64=2) %!d(string=hai) anos |
oobabooga
|
1dacd34165
Further refactor
|
%!s(int64=2) %!d(string=hai) anos |
oobabooga
|
ce7feb3641
Further refactor
|
%!s(int64=2) %!d(string=hai) anos |