Skip to content

feat: deepspeed example#100

Merged
adam-peaston-SC merged 28 commits into
mainfrom
feat/deepspeed-example
Jan 19, 2026
Merged

feat: deepspeed example#100
adam-peaston-SC merged 28 commits into
mainfrom
feat/deepspeed-example

Conversation

@adam-peaston-SC
Copy link
Copy Markdown
Contributor

Description

General description of the changes.

Model details

What specific models did you add or modify?

Steps to reproduce any training

What settings did you use, batch size, lr, number of nodes, etc

Training results

A gist of logs, a link to tensorboard etc if relevant.

Things done

General:

  • Linting (ruff, black, isort)

If adding a demo model training script:

  • Atomic saving - saving is done atomically to avoid corruption
  • Checkpointing - model can successfully save a checkpoint and resume.
  • Completed a full run

@adam-peaston-SC adam-peaston-SC self-assigned this Jan 18, 2026
@adam-peaston-SC adam-peaston-SC marked this pull request as ready for review January 18, 2026 05:27
Comment thread deepspeed/train.py Outdated
Comment thread deepspeed/train.py Outdated
@adam-peaston-SC adam-peaston-SC merged commit 12f4f7f into main Jan 19, 2026
1 check passed
@adam-peaston-SC adam-peaston-SC deleted the feat/deepspeed-example branch January 19, 2026 01:00
Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment

Labels

None yet

Projects

None yet

Development

Successfully merging this pull request may close these issues.

2 participants