Skip to content

Incorrect output from grouped Conv GPU path #23

Description

@harz05

The grouped convolution path in Generate_GPU_ALPAKA of ROperator_Conv.hxx produces incorrect numerical output.

gemm_n is set to total output channels at the top of the function and used directly in per-group matmul calls, g_out_offset strides, groupFOffset, and groupBiasElements but it is never divided by fAttrGroup.

PR #22 catches this issue

Metadata

Metadata

Assignees

No one assigned

    Labels

    No labels
    No labels

    Type

    No type
    No fields configured for issues without a type.

    Projects

    No projects

    Milestone

    No milestone

    Relationships

    None yet

    Development

    No branches or pull requests

    Issue actions