Skip to content

Latest commit

 

History

History
2 lines (2 loc) · 235 Bytes

File metadata and controls

2 lines (2 loc) · 235 Bytes

Developed a GPU-accelerated 2D Convolution Engine in C++/CUDA, implementing core operations used in Computer Vision and Deep Learning. Optimized with shared memory and tiling, benchmarked against CPU for significant performance gains.