This repository provides a patch to support Unigram-based tokenizers with ExLlamaV2
.
By default, using a Unigram tokenizer may result in an error during inference. This patch resolves that issue.
hf.py.patch
A patch file that modifies hf.py
in the ExLlamaV2 repository to support Unigram tokenizers.
apply_patch.sh
A shell script to apply the patch.
You can apply the patch with the following command:
$ ./apply_patch.sh EXLLAMAV2_DIR
Replace EXLLAMAV2_DIR with the path to your local clone of ExLlamaV2.
Example:
$ ./apply_patch.sh ~/repos/exllamav2
The patch is intended to: