Add the NVIDIA A100 on the list by RemiLehe · Pull Request #8 · karlrupp/cpu-gpu-mic-comparison

RemiLehe · 2020-12-08T01:00:03Z

This PR adds data for the NVIDIA A100:
https://developer.nvidia.com/blog/nvidia-ampere-architecture-in-depth/

PS: Thanks for maintaining this repo and making it openly accessible!

jedbrown

I suppose this repo should state a position on whether tensor cores count for double precision flops. FMA with arbitrarily long vector width counts, but not matrix-matrix? Or should matrix-matrix instructions be included?

Would you mind adding the single precision case as well?

Add the NVIDIA A100 on the list

9efc53b

jedbrown approved these changes Apr 28, 2021

View reviewed changes

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Add the NVIDIA A100 on the list#8

Add the NVIDIA A100 on the list#8
RemiLehe wants to merge 1 commit into
karlrupp:masterfrom
RemiLehe:master

RemiLehe commented Dec 8, 2020

Uh oh!

jedbrown left a comment

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

2 participants

Conversation

RemiLehe commented Dec 8, 2020

Uh oh!

jedbrown left a comment

Choose a reason for hiding this comment

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

2 participants