GPT-2 type architecture trained with document-level character-depth, and multi-token prediction for learning deeper patterns. TODO: fp8 casting for quicker training
nicowaltz/painter
Folders and files
| Name | Name | Last commit date | ||
|---|---|---|---|---|
| Name | Name | Last commit date | ||
|---|---|---|---|---|