Skip to content

Feat(embedder) use bf16 and fix the interface of attn_implementation in embedder.decode_only#1566

Merged
hanhainebula merged 4 commits intoFlagOpen:masterfrom
lnxtree:feat-fix-embedder_use_bf16
Mar 26, 2026
Merged

Feat(embedder) use bf16 and fix the interface of attn_implementation in embedder.decode_only#1566
hanhainebula merged 4 commits intoFlagOpen:masterfrom
lnxtree:feat-fix-embedder_use_bf16

Conversation

@lnxtree
Copy link
Contributor

@lnxtree lnxtree commented Mar 26, 2026

  • add use_bf16 support and unify inference dtype behavior
  • add use_bf16 to auto embedder and all embedder constructors
  • fix the interface of attn_implementation in embedder.decode_only..load_model and reranker.decode_only..load_model

@hanhainebula hanhainebula merged commit ac7a274 into FlagOpen:master Mar 26, 2026
1 check passed
Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment

Labels

None yet

Projects

None yet

Development

Successfully merging this pull request may close these issues.

2 participants