Skip to content
This repository was archived by the owner on Feb 14, 2025. It is now read-only.

AICrossSim/NewComputeBench-archived

Folders and files

NameName
Last commit message
Last commit date

Latest commit

 

History

2 Commits
 
 
 
 
 
 
 
 
 
 
 
 
 
 

Repository files navigation

README

NewComputeBench is a project to develop a benchmark suite for the new compute paradigm (Spiking neural networks, Optical computation, In-Memory computation, etc). The project is divided into three main components:

  • Model Training
  • Model Behavior-Level Simulation
  • Hardware-Performance Simulation

Model Training

ARIA-LLaMA (LLaMA with Group Query Attention)

We aim to support the following features:

ARIA-LLM-135M:

ARIA-LLM-1.1B:

  • 🚧 TODO ARIA-LLM-1B (We aim to scale the ARIA-LLM-135M model to 1B parameters and pretrain with 3T tokens)
    • Model config
    • Pretraining data: SlimPajama
    • Pretraining scripts
    • Supervised Fine-tuning data SmolTalk
    • Supervised Fine-tuning scripts

TBD-8B

  • 🚧 TODO: 8B (We aim to fine-tune a LLM around 7B parameters using LoRA)
    • LoRA fine-tuning data
    • LoRA fine-tuning scripts

Model Behavior Simulation

🚧 TODO

Hardware-Performance Simulation

🚧 TODO

About

No description, website, or topics provided.

Resources

Stars

Watchers

Forks

Releases

No releases published

Packages

No packages published