Sem descrição

9 Ramos

oobabooga 4058b33fc9 Improve the chat experience		há 3 anos atrás
models	dd70f7edd5 Add the default folders	há 3 anos atrás
presets	89fb0a13c5 Add a new preset	há 3 anos atrás
torch-dumps	ec2973f596 Add folder	há 3 anos atrás
LICENSE	ad71774a24 Initial commit	há 3 anos atrás
README.md	a0b1b1beb2 Mention gpt4chan's config.json	há 3 anos atrás
convert-to-torch.py	45168e9e7a Update the description	há 3 anos atrás
download-model.py	dd1bed2d8b Fix the download script	há 3 anos atrás
html_generator.py	538998b43b Fix a bug with the greentexts	há 3 anos atrás
requirements.txt	fed7233ff4 Add script to download models	há 3 anos atrás
server.py	4058b33fc9 Improve the chat experience	há 3 anos atrás
webui.png	29ce69eea7 Update the screenshot	há 3 anos atrás

text-generation-webui

A gradio webui for running large language models locally. Supports gpt-j-6B, gpt-neox-20b, opt, galactica, and many others.

Its goal is to become the AUTOMATIC1111/stable-diffusion-webui of text generation.

Features

Switch between different models using a dropdown menu.
Generate nice HTML output for gpt4chan.
Generate Markdown output for GALACTICA, including LaTeX support.
Notebook mode that resembles OpenAI's playground.
Chat mode for conversation and role playing.
Load 13b/20b models in 8-bit mode.
Load parameter presets from text files.

Installation

Create a conda environment:

conda create -n textgen
conda activate textgen

Install the appropriate pytorch for your GPU. For NVIDIA GPUs, this should work:

conda install pytorch torchvision torchaudio pytorch-cuda=11.7 -c pytorch -c nvidia

Install the requirements:

pip install -r requirements.txt

Downloading models

Models should be placed under models/model-name. For instance, models/gpt-j-6B for gpt-j-6B.

Hugging Face

Hugging Face is the main place to download models. These are some of my favorite:

The files that you need to download are the json, txt, and pytorch*.bin files. The remaining files are not necessary.

For your convenience, you can automatically download a model from HF using the script download-model.py. Its usage is very simple:

python download-model.py organization/model

For instance:

python download-model.py facebook/opt-1.3b

gpt4chan

gpt4chan has been shut down from Hugging Face, so you need to download it elsewhere. You have two options:

Torrent: 16-bit / 32-bit
Direct download: 16-bit / 32-bit

You also need to put GPT-J-6B's config.json file in the same folder: config.json

Converting to pytorch

The script convert-to-torch.py allows you to convert models to .pt format, which is about 10x faster to load:

python convert-to-torch.py models/model-name/

The output model will be saved to torch-dumps/model-name.pt. When you load a new model, the webui first looks for this .pt file; if it is not found, it loads the model as usual from models/model-name/.

Starting the webui

conda activate textgen
python server.py

Then browse to

http://localhost:7860/?__theme=dark

Optionally, you can use the following command-line flags:

--model model-name: Load this model by default.

--notebook: Launch the webui in notebook mode, where the output is written to the same text box as the input.

--chat: Launch the webui in chat mode.

Presets

Inference settings presets can be created under presets/ as text files. These files are detected automatically at startup.

Contributing

Pull requests are welcome.

README.md