Skip to content

Remove attentions from TransformerOutput and stop computing attention weights in FLAVA#535

Open
stashuk-olek wants to merge 1 commit into
facebookresearch:mainfrom
stashuk-olek:export-D92927086
Open

Remove attentions from TransformerOutput and stop computing attention weights in FLAVA#535
stashuk-olek wants to merge 1 commit into
facebookresearch:mainfrom
stashuk-olek:export-D92927086

Conversation

@stashuk-olek

Copy link
Copy Markdown

Summary:
The attentions field on TransformerOutput and return_attn_weights/head_mask parameters in the FLAVA encoder stack were never used by any consumer.

This diffs cleans it up. Later the intent is to simplify attention usage / use common API for them.

Differential Revision: D92927086

@meta-cla meta-cla Bot added the CLA Signed This label is managed by the Facebook bot. Authors need to sign the CLA before a PR can be reviewed. label Feb 11, 2026
@meta-codesync

meta-codesync Bot commented Feb 11, 2026

Copy link
Copy Markdown

@stashuk-olek has exported this pull request. If you are a Meta employee, you can view the originating Diff in D92927086.

@stashuk-olek stashuk-olek force-pushed the export-D92927086 branch 2 times, most recently from 7a54c92 to aee4101 Compare February 11, 2026 19:34
stashuk-olek added a commit to stashuk-olek/multimodal that referenced this pull request Feb 12, 2026
… weights in FLAVA (facebookresearch#535)

Summary:

The `attentions` field on `TransformerOutput` and `return_attn_weights`/`head_mask` parameters in the FLAVA encoder stack were never used by any consumer. 

This diffs cleans it up. Later the intent is to simplify attention usage / use common API for them.

Differential Revision: D92927086
stashuk-olek added a commit to stashuk-olek/multimodal that referenced this pull request Feb 12, 2026
… weights in FLAVA (facebookresearch#535)

Summary:

The `attentions` field on `TransformerOutput` and `return_attn_weights`/`head_mask` parameters in the FLAVA encoder stack were never used by any consumer. 

This diffs cleans it up. Later the intent is to simplify attention usage / use common API for them.

Differential Revision: D92927086
stashuk-olek added a commit to stashuk-olek/multimodal that referenced this pull request Feb 13, 2026
… weights in FLAVA (facebookresearch#535)

Summary:

The `attentions` field on `TransformerOutput` and `return_attn_weights`/`head_mask` parameters in the FLAVA encoder stack were never used by any consumer. 

This diffs cleans it up. Later the intent is to simplify attention usage / use common API for them.

Reviewed By: OmarPavel

Differential Revision: D92927086
stashuk-olek added a commit to stashuk-olek/multimodal that referenced this pull request Feb 13, 2026
… weights in FLAVA (facebookresearch#535)

Summary:

The `attentions` field on `TransformerOutput` and `return_attn_weights`/`head_mask` parameters in the FLAVA encoder stack were never used by any consumer. 

This diffs cleans it up. Later the intent is to simplify attention usage / use common API for them.

Reviewed By: OmarPavel

Differential Revision: D92927086
stashuk-olek added a commit to stashuk-olek/multimodal that referenced this pull request Feb 13, 2026
… weights in FLAVA (facebookresearch#535)

Summary:

The `attentions` field on `TransformerOutput` and `return_attn_weights`/`head_mask` parameters in the FLAVA encoder stack were never used by any consumer. 

This diffs cleans it up. Later the intent is to simplify attention usage / use common API for them.

Reviewed By: OmarPavel

Differential Revision: D92927086
stashuk-olek added a commit to stashuk-olek/multimodal that referenced this pull request Feb 13, 2026
… weights in FLAVA (facebookresearch#535)

Summary:

The `attentions` field on `TransformerOutput` and `return_attn_weights`/`head_mask` parameters in the FLAVA encoder stack were never used by any consumer. 

This diffs cleans it up. Later the intent is to simplify attention usage / use common API for them.

Reviewed By: OmarPavel

Differential Revision: D92927086
stashuk-olek added a commit to stashuk-olek/multimodal that referenced this pull request Feb 13, 2026
… weights in FLAVA (facebookresearch#535)

Summary:

The `attentions` field on `TransformerOutput` and `return_attn_weights`/`head_mask` parameters in the FLAVA encoder stack were never used by any consumer. 

This diffs cleans it up. Later the intent is to simplify attention usage / use common API for them.

Reviewed By: OmarPavel

Differential Revision: D92927086
stashuk-olek added a commit to stashuk-olek/multimodal that referenced this pull request Feb 13, 2026
… weights in FLAVA (facebookresearch#535)

Summary:

The `attentions` field on `TransformerOutput` and `return_attn_weights`/`head_mask` parameters in the FLAVA encoder stack were never used by any consumer. 

This diffs cleans it up. Later the intent is to simplify attention usage / use common API for them.

Reviewed By: OmarPavel

Differential Revision: D92927086
stashuk-olek added a commit to stashuk-olek/multimodal that referenced this pull request Feb 13, 2026
… weights in FLAVA (facebookresearch#535)

Summary:

The `attentions` field on `TransformerOutput` and `return_attn_weights`/`head_mask` parameters in the FLAVA encoder stack were never used by any consumer. 

This diffs cleans it up. Later the intent is to simplify attention usage / use common API for them.

Reviewed By: OmarPavel

Differential Revision: D92927086
stashuk-olek added a commit to stashuk-olek/multimodal that referenced this pull request Feb 25, 2026
… weights in FLAVA (facebookresearch#535)

Summary:

The `attentions` field on `TransformerOutput` and `return_attn_weights`/`head_mask` parameters in the FLAVA encoder stack were never used by any consumer. 

This diffs cleans it up. Later the intent is to simplify attention usage / use common API for them.

Reviewed By: OmarPavel

Differential Revision: D92927086
… weights in FLAVA (facebookresearch#535)

Summary:
Pull Request resolved: facebookresearch#535

The `attentions` field on `TransformerOutput` and `return_attn_weights`/`head_mask` parameters in the FLAVA encoder stack were never used by any consumer.

This diffs cleans it up. Later the intent is to simplify attention usage / use common API for them.

Reviewed By: OmarPavel

Differential Revision: D92927086
Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment

Labels

CLA Signed This label is managed by the Facebook bot. Authors need to sign the CLA before a PR can be reviewed. fb-exported meta-exported

Projects

None yet

Development

Successfully merging this pull request may close these issues.

1 participant