Add fp16 support by santhnm2 · Pull Request #38 · microsoft/dist-ir

santhnm2 · 2021-09-20T07:33:42Z

No description provided.

siddharth-krishna · 2021-09-22T08:38:46Z

            assert isinstance(output, tuple)
            for i, v in enumerate(op.outputs):
                value_map[v] = output[i]
+                if torch.any(torch.isnan(output[i])):


I'd put these under a debug flag to avoid slowing down executions

siddharth-krishna · 2021-09-22T09:02:26Z

-        args.dram_bandwidth = simulation_parameters["dram_bandwidth"]
-        args.kernel_launch_overhead = simulation_parameters["kernel_launch_overhead"]
+        args.device_throughput = 1.0 / simulation_parameters["device_parameters"][0]
+        args.dram_bandwidth = 1.0 / simulation_parameters["device_parameters"][1]


Won't this become infinity if one of the regression coefficients is 0?

Yea but I think that's ok for now since we don't see much utility from the dram bandwidth at sufficiently large data sizes right?

When I ran it before Tue's meeting, it threw a RuntimeError for dividing by zero (not sure if it was from this line though), so maybe safer to store the parameters if you run into that error again.

siddharth-krishna · 2021-09-22T09:13:26Z

            "--use_gpu",
            action="store_true",
-            default=torch.cuda.is_available(),
+            default=False,


Why did you change the default?

There's no way to only use CPU otherwise, but maybe we should just make this --use_cpu instead

siddharth-krishna reviewed Sep 20, 2021

View reviewed changes

Comment thread examples/parser.py

Comment thread test/test_grid_search.py

santhnm2 added 2 commits September 20, 2021 14:27

Add fp16 support

099f0d4

Add fp16 support for GPT2

b5d1da8

santhnm2 force-pushed the fp16 branch from a5d54aa to b5d1da8 Compare September 20, 2021 21:29

santhnm2 and others added 6 commits September 20, 2021 14:31

Fix formatting

4bce097

Fix tests with no GPU available

7a3e573

Add fp16 support to simulator calibration

b621cb1

Create DataFrame of calibration data and dump to csv

eb777a6

Directly store device parameters

60abe92

GPT fp16 fixes

2b12914

siddharth-krishna reviewed Sep 22, 2021

View reviewed changes

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Add fp16 support#38

Add fp16 support#38
santhnm2 wants to merge 8 commits into
mainfrom
fp16

santhnm2 commented Sep 20, 2021

Uh oh!

Uh oh!

Uh oh!

siddharth-krishna Sep 22, 2021

Uh oh!

Uh oh!

Uh oh!

Uh oh!

Uh oh!

siddharth-krishna Sep 22, 2021

Uh oh!

santhnm2 Sep 22, 2021

Uh oh!

siddharth-krishna Sep 22, 2021

Uh oh!

siddharth-krishna Sep 22, 2021

Uh oh!

santhnm2 Sep 22, 2021

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

2 participants

Conversation

santhnm2 commented Sep 20, 2021

Uh oh!

Uh oh!

Uh oh!

siddharth-krishna Sep 22, 2021

Choose a reason for hiding this comment

Uh oh!

Uh oh!

Uh oh!

Uh oh!

Uh oh!

siddharth-krishna Sep 22, 2021

Choose a reason for hiding this comment

Uh oh!

santhnm2 Sep 22, 2021

Choose a reason for hiding this comment

Uh oh!

siddharth-krishna Sep 22, 2021

Choose a reason for hiding this comment

Uh oh!

siddharth-krishna Sep 22, 2021

Choose a reason for hiding this comment

Uh oh!

santhnm2 Sep 22, 2021

Choose a reason for hiding this comment

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

2 participants