-
Notifications
You must be signed in to change notification settings - Fork 15
Open
Labels
bugSomething isn't workingSomething isn't working
Description
Some build artefacts for the GEMM are reused when they should not be. When running two GEMM tests, where the first uses 1 column and the second uses 8 columns, the runtimes are the same (see logs below). If I delete the build folder and run a test on 8 columns, then the latency is correctly reduced.
(iron) 🐍 iron jungvi@grays /scratch/jungvi/IRON devel ± pytest iron/operators/gemm/ -v --iterations=1 -s
==================================================================================================================== test session starts =====================================================================================================================
platform linux -- Python 3.12.12, pytest-9.0.2, pluggy-1.6.0 -- /scratch/jungvi/micromamba/envs/iron/bin/python3.12
cachedir: .pytest_cache
rootdir: /scratch/jungvi/IRON
configfile: pytest.ini
plugins: xdist-3.8.0
collected 1 item
iron/operators/gemm/test.py::test_gemm[gemm_2048x2048x2048_64x64x64_1cols]
Latency (us): 48605.1
Effective Bandwidth: 5.177610e-01 GB/s
Throughput: 3.534582e+02 GFLOP/s
PASSED
===================================================================================================================== 1 passed in 4.14s ======================================================================================================================
(iron) 🐍 iron jungvi@grays /scratch/jungvi/IRON devel ± pytest iron/operators/gemm/ -v --iterations=1 -s
==================================================================================================================== test session starts =====================================================================================================================
platform linux -- Python 3.12.12, pytest-9.0.2, pluggy-1.6.0 -- /scratch/jungvi/micromamba/envs/iron/bin/python3.12
cachedir: .pytest_cache
rootdir: /scratch/jungvi/IRON
configfile: pytest.ini
plugins: xdist-3.8.0
collected 1 item
iron/operators/gemm/test.py::test_gemm[gemm_2048x2048x2048_64x64x64_8cols]
Latency (us): 48537.7
Effective Bandwidth: 5.184799e-01 GB/s
Throughput: 3.539489e+02 GFLOP/s
PASSED
(iron) 🐍 iron jungvi@grays /scratch/jungvi/IRON devel ± rm -rf build
(iron) ✘ 🐍 iron jungvi@grays /scratch/jungvi/IRON devel ± pytest iron/operators/gemm/ -v --iterations=5 -s
==================================================================================================================== test session starts =====================================================================================================================
platform linux -- Python 3.12.12, pytest-9.0.2, pluggy-1.6.0 -- /scratch/jungvi/micromamba/envs/iron/bin/python3.12
cachedir: .pytest_cache
rootdir: /scratch/jungvi/IRON
configfile: pytest.ini
plugins: xdist-3.8.0
collected 5 items
iron/operators/gemm/test.py::test_gemm[iter0-gemm_2048x2048x2048_64x64x64_8cols]
Latency (us): 6994.5
Effective Bandwidth: 3.597969e+00 GB/s
Throughput: 2.456213e+03 GFLOP/s
PASSED
Reactions are currently unavailable
Metadata
Metadata
Assignees
Labels
bugSomething isn't workingSomething isn't working