Transformer from Scratch 🚀

This repository contains a clean and modular implementation of the original Transformer model (as proposed in the paper "Attention is All You Need") using PyTorch.

It includes:

A custom Transformer class with Encoder and Decoder modules
Training pipeline using tqdm progress bars
Tokenization with Hugging Face datasets and tokenizers
Teacher forcing using target shifting (as done in standard implementations)
Padding masks and look-ahead masks to correctly handle variable-length sequences

📚 Dataset

We use Hugging Face's translation datasets (e.g., English → Czech) for training. Tokenization is handled using a pretrained tokenizer with max sequence length support.

🚀 Training

The model can be trained using the provided train_model function.
It shows a real-time progress bar and calculates average loss per epoch.

Example:

trained_model = train_model(model, train_loader, num_epochs=10, lr=3e-4, device='cuda')

✅ Requirements

Python 3.8+
PyTorch
Hugging Face datasets
tqdm

Install requirements:

pip install torch datasets tqdm

🤖 Special Note

During this project, I used ChatGPT (GPT-4) as a mentor to help correct bugs, clarify ambiguous points from the paper, and guide through complex parts of the implementation. It acted as an instant code reviewer and a paper simplifier 💪.

📝 Reference

Vaswani et al., 2017: Attention is All You Need

Name		Name	Last commit message	Last commit date
Latest commit History 2 Commits
.gitignore		.gitignore
NIPS-2017-attention-is-all-you-need-Paper.pdf		NIPS-2017-attention-is-all-you-need-Paper.pdf
config.yml		config.yml
poetry.lock		poetry.lock
pyproject.toml		pyproject.toml
readme.md		readme.md
transformer.ipynb		transformer.ipynb
utils.py		utils.py

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Repository files navigation

Transformer from Scratch 🚀

📚 Dataset

🚀 Training

✅ Requirements

🤖 Special Note

📝 Reference

About

Uh oh!

Releases

Packages

Uh oh!

Uh oh!

Contributors

Uh oh!

Languages

Folders and files

Latest commit

History

Repository files navigation

Transformer from Scratch 🚀

📚 Dataset

🚀 Training

✅ Requirements

🤖 Special Note

📝 Reference

About

Resources

Uh oh!

Stars

Watchers

Forks

Releases

Packages 0

Uh oh!

Uh oh!

Contributors

Uh oh!

Languages

Packages