This is a collection of tokenization implementations focused on transparency and readability.
You can install Tokenizers directly from GitHub.
pip install git+https://github.com/dakofler/simple_tokenizers.gitThere are example-notebooks included that show how to use the package, see Examples.
Daniel Kofler - AI Research Associate (dkofler@outlook.com)
Cheers,
Daniel