Skip to content

Conversation

@5000user5000
Copy link
Owner

@5000user5000 5000user5000 commented Nov 26, 2025

fix k=100 problem
發現在 k=1,10 gpu 具有較大優勢
而 k=100 時則是差不多,推測試 shared memory 已經是極限,導致只能從較慢的 global memory 取資料,造成速度瓶頸

@5000user5000 5000user5000 merged commit 5dc33d5 into main Nov 26, 2025
1 check passed
@5000user5000 5000user5000 mentioned this pull request Nov 26, 2025
Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment

Labels

Projects

None yet

Development

Successfully merging this pull request may close these issues.

2 participants