Skip to content

consecutive 2 zero-modules will cause a consitent 0 gradient #3

Description

@myendless1

flow_spatial and flow_temporal can not be set to zero modules at the same time. I think this is a misleading in the paper, otherwise these the output value of the second flow_temporal conv will cause gradient of the first flow_spatial conv to be constant zero. I think the reinitialization of transformer3d.transformer_blocks[idx].flow_temporal and transformer3d.transformer_blocks[idx].flow_spatial in the code is due to this reason.

Metadata

Metadata

Assignees

No one assigned

    Labels

    No labels
    No labels

    Type

    No type

    Fields

    No fields configured for issues without a type.

    Projects

    No projects

    Milestone

    No milestone

    Relationships

    None yet

    Development

    No branches or pull requests

    Issue actions