feat(oci): add xai.grok-4.3 model metadata#28776
Conversation
Oracle Cloud Infrastructure Generative AI service has launched the xAI Grok 4.3 model (reasoning model, 1M-token context window shared between prompt and response, knowledge cutoff December 2025). The model id `xai.grok-4.3` is already listed as a supported model in the OCI provider docs (litellm-docs PR BerriAI#197, merged 2026-05-23), but the model cost catalog has no entry for it, so it does not appear in the proxy UI "Add Model" dropdown and spend is attributed to the default rate. Add cost + capability metadata for `oci/xai.grok-4.3` so the UI surfaces the model and `litellm.cost_calculator` attributes spend accurately. Pricing is taken from the Oracle public pricing API (apexapps.oracle.com/pls/apex/cetools/api/v1/products/, PAY_AS_YOU_GO SKUs B112080-B112085): - input: $1.25 / 1M tokens (<=200k), $2.50 / 1M tokens (>200k) - output: $2.50 / 1M tokens (<=200k), $5.00 / 1M tokens (>200k) - cached: $0.20 / 1M tokens (<=200k), $0.40 / 1M tokens (>200k) Capability flags follow the OCI Grok 4.3 model documentation (https://docs.oracle.com/en-us/iaas/Content/generative-ai/xai-grok-4-3.htm): - function_calling: yes ("Function Calling: Yes, through the API.") - vision: yes ("Multimodal support: Input text and images and get a text output.") - prompt_caching: yes ("Cached Input Tokens: Yes") - reasoning: yes ("reasoning model designed for complex … tasks") - response_schema: false (consistent with existing OCI Grok entries) Context window: - 1,000,000 tokens shared between prompt + response ("Context Length: 1 million tokens (maximum prompt + response length is 1 million tokens for keeping the context).") - The playground caps a single response at 131,000 tokens, but the API has no separate output-only limit beyond the shared context. So max_input_tokens / max_output_tokens / max_tokens are all set to 1,000,000, matching the xai/grok-4.3 catalog entry pattern. Tests follow the pattern from BerriAI#27154 (xai/grok-4.3 entry). Signed-off-by: BobDu <i@bobdu.cc>
Greptile SummaryAdds the
Confidence Score: 5/5Safe to merge — the change is a self-contained metadata addition to two JSON catalog files with no logic changes anywhere in the codebase. All three changed files are narrow in scope: two JSON catalog files receive an identical new entry, and the test file uses only stdlib to read those files locally. The new entry's pricing values mirror the canonical xai/grok-4.3 entry exactly, capability flags are consistent with existing OCI Grok entries, and the backup file is kept in sync. There are no logic changes, no new code paths, and no risk of regressions. No files require special attention.
|
| Filename | Overview |
|---|---|
| model_prices_and_context_window.json | Adds oci/xai.grok-4.3 entry with pricing tiers, 1M context window, and capability flags; correctly mirrors the canonical xai/grok-4.3 pricing and sets supports_response_schema: false consistent with other OCI Grok entries |
| litellm/model_prices_and_context_window_backup.json | Backup copy updated with an identical oci/xai.grok-4.3 entry, kept in sync with the main catalog file |
| tests/test_litellm/test_oci_xai_grok_4_3_model_metadata.py | New test file with two pure-stdlib tests; validates all pricing/capability fields on oci/xai.grok-4.3 and asserts the backup catalog matches the main one — no network calls, complies with the unit-test-only constraint for this directory |
Reviews (1): Last reviewed commit: "feat(oci): add xai.grok-4.3 model metada..." | Re-trigger Greptile
Codecov Report✅ All modified and coverable lines are covered by tests. 📢 Thoughts on this report? Let us know! |
Relevant issues
N/A — model is already listed as supported in the OCI provider docs
(BerriAI/litellm-docs#197, merged 2026-05-23), this PR only adds the
matching entry to the model cost catalog so the proxy UI surfaces it
and spend tracking attributes cost correctly.
Linear ticket
N/A (external contributor)
Pre-Submission checklist
tests/test_litellm/test_oci_xai_grok_4_3_model_metadata.pymake test-unitoci/xai.grok-4.3entry to the model cost catalog (and its backup) + a metadata regression test@greptileaiand received a Confidence Score of at least 4/5 before requesting a maintainer reviewSummary
Oracle Cloud Infrastructure Generative AI has launched the xAI Grok
4.3 model — a reasoning model with a 1M-token context window (shared
between prompt and response) and a knowledge cutoff of December 2025.
The model id
xai.grok-4.3is already listed as supported in the OCIprovider docs (link below), but the model cost catalog does not yet
have an entry for it, so it does not appear in the proxy UI "Add Model"
dropdown and spend is attributed to the default rate.
This PR adds
oci/xai.grok-4.3to bothmodel_prices_and_context_window.jsonand the bundledlitellm/model_prices_and_context_window_backup.json, with pricing andcapability metadata sourced from Oracle's public documentation and
pricing API.
Pricing (PAY_AS_YOU_GO)
Sourced from the Oracle public pricing API
(
https://apexapps.oracle.com/pls/apex/cetools/api/v1/products/,SKUs B112080-B112085):
Capability flags
Per the OCI Grok 4.3 model documentation
(https://docs.oracle.com/en-us/iaas/Content/generative-ai/xai-grok-4-3.htm):
supports_function_callingsupports_visionsupports_prompt_cachingsupports_reasoningsupports_response_schemamax_input_tokensmax_output_tokensmax_tokensSources
xai/grok-4.3entry, merged 2026-05-07)Testing
tests/test_litellm/test_oci_xai_grok_4_3_model_metadata.pycovers:oci/xai.grok-4.3