fix(recommandation): Fix chart recommandation by alexandrusoare · Pull Request #39886 · apache/superset

alexandrusoare · 2026-05-05T09:56:20Z

SUMMARY

recommended_visualizations in get_chart_data responses are based purely on column count and column name string matching. A table chart with 2 columns gets "scatter plot" recommended; any column with "time" in the name triggers "line chart" even if it's not actually temporal.

Root Cause

Two problems:

Temporal columns are invisible — The DataColumn.data_type inference uses a Python isinstance heuristic that checks for int/float/bool. Datetime values from SQL arrive as strings, so temporal columns are always classified as "string". Meanwhile, the query result already contains coltypes — a list of GenericDataType enum values (NUMERIC, STRING, TEMPORAL, BOOLEAN) derived from actual SQL types via extract_dataframe_dtypes() — but this field was never read.
Recommendation logic ignores context — The old code only checked if any column name contains "time"/"date" and if column count <= 3. It didn't consider chart.viz_type (so it recommends the same type you already have), actual data types, or column cardinality (so high-cardinality ID columns trigger "scatter plot").

Fix

Read coltypes from the query result and use it to populate DataColumn.data_type with accurate SQL-derived types (falling back to the isinstance heuristic when coltypes is unavailable)
Replace the inline recommendation block with _recommend_visualizations(viz_type, columns, row_count) which:
- Classifies columns by type (temporal, numeric, categorical based on cardinality)
- Applies data-shape rules (temporal+numeric → line/area; categorical+numeric → bar/pie; multi-numeric → scatter; etc.)
- Excludes the chart's current viz_type category from suggestions
- Caps output at 4 recommendations

BEFORE/AFTER SCREENSHOTS OR ANIMATED GIF

TESTING INSTRUCTIONS

ADDITIONAL INFORMATION

Has associated issue:
Required feature flags:
Changes UI
Includes DB Migration (follow approval process in SIP-59)
- Migration is atomic, supports rollback & is backwards-compatible
- Confirm DB migration upgrade and downgrade tested
- Runtime estimates and downtime expectations provided
Introduces new feature or API
Removes existing feature or API

bito-code-review · 2026-05-05T09:56:30Z

Code Review Agent Run #04a47b

Actionable Suggestions - 0

Review Details

Files reviewed - 2 · Commit Range: 8b2a8d2..8b2a8d2
- superset/mcp_service/chart/tool/get_chart_data.py
- tests/unit_tests/mcp_service/chart/tool/test_get_chart_data.py
Files skipped - 0
Tools
- Whispers (Secret Scanner) - ✔︎ Successful
- Detect-secrets (Secret Scanner) - ✔︎ Successful
- MyPy (Static Code Analysis) - ✔︎ Successful
- Astral Ruff (Static Code Analysis) - ✔︎ Successful

Bito Usage Guide

Commands

Type the following command in the pull request comment and save the comment.

/review - Manually triggers a full AI review.
/pause - Pauses automatic reviews on this pull request.
/resume - Resumes automatic reviews.
/resolve - Marks all Bito-posted review comments as resolved.
/abort - Cancels all in-progress reviews.

Refer to the documentation for additional commands.

Configuration

This repository uses Superset You can customize the agent settings here or contact your Bito workspace admin at evan@preset.io.

Documentation & Help

AI Code Review powered by

netlify · 2026-05-05T09:58:48Z

✅ Deploy Preview for superset-docs-preview ready!

Name	Link
🔨 Latest commit	`8b2a8d2`
🔍 Latest deploy log	https://app.netlify.com/projects/superset-docs-preview/deploys/69f9bec7fe13600008c71fec
😎 Deploy Preview	https://deploy-preview-39886--superset-docs-preview.netlify.app
📱 Preview on mobile	Toggle QR Code... Use your smartphone camera to open QR code link.
🤖 Make changes	Run an agent on this branch

To edit notification comments on pull requests, go to your Netlify project configuration.

codecov · 2026-05-05T10:00:10Z

Codecov Report

❌ Patch coverage is 15.58442% with 65 lines in your changes missing coverage. Please review.
✅ Project coverage is 64.35%. Comparing base (dc1c0f6) to head (81d6fc2).
⚠️ Report is 17 commits behind head on master.

Files with missing lines	Patch %	Lines
superset/mcp_service/chart/tool/get_chart_data.py	15.58%	65 Missing ⚠️

Additional details and impacted files

@@            Coverage Diff             @@
##           master   #39886      +/-   ##
==========================================
- Coverage   64.35%   64.35%   -0.01%     
==========================================
  Files        2569     2569              
  Lines      134680   134750      +70     
  Branches    31254    31272      +18     
==========================================
+ Hits        86679    86718      +39     
- Misses      46505    46534      +29     
- Partials     1496     1498       +2

Flag	Coverage Δ
hive	`39.65% <15.58%> (?)`
mysql	`59.89% <15.58%> (-0.05%)`	⬇️
postgres	`59.97% <15.58%> (-0.05%)`	⬇️
presto	`41.40% <15.58%> (-0.03%)`	⬇️
python	`61.50% <15.58%> (-0.01%)`	⬇️
sqlite	`59.59% <15.58%> (-0.05%)`	⬇️
unit	`100.00% <ø> (ø)`

Flags with carried forward coverage won't be shown. Click here to find out more.

☔ View full report in Codecov by Sentry.
📢 Have feedback on the report? Share it here.

🚀 New features to boost your workflow:

❄️ Test Analytics: Detect flaky tests, report on failures, and find test suite problems.
📦 JS Bundle Analysis: Save yourself from yourself by tracking and limiting bundle sizes in JS merges.

bito-code-review · 2026-05-05T12:03:58Z

Code Review Agent Run #b70632

Actionable Suggestions - 0

Additional Suggestions - 1

superset/mcp_service/chart/tool/get_chart_data.py - 1
- Incorrect pie chart logic · Line 160-160
  
  The pie chart suggestion logic checks only the first categorical column's unique_count, but should verify if any categorical column has unique_count <= 10. This ensures pie charts are recommended when suitable low-cardinality categorical data exists, even if not in the first position. The old code filtered categorical columns upfront, so this check was implicitly for any; the refactor removed that filter but didn't update the pie condition accordingly.
  Code suggestion
  @@ -160,1 +160,1 @@ - if len(numeric) == 1 and categorical and categorical[0].unique_count <= 10: + if len(numeric) == 1 and categorical and any(c.unique_count <= 10 for c in categorical):

Review Details

Files reviewed - 2 · Commit Range: 8b2a8d2..81d6fc2
- superset/mcp_service/chart/tool/get_chart_data.py
- tests/unit_tests/mcp_service/chart/tool/test_get_chart_data.py
Files skipped - 0
Tools
- Whispers (Secret Scanner) - ✔︎ Successful
- Detect-secrets (Secret Scanner) - ✔︎ Successful
- MyPy (Static Code Analysis) - ✔︎ Successful
- Astral Ruff (Static Code Analysis) - ✔︎ Successful

Bito Usage Guide

Commands

Type the following command in the pull request comment and save the comment.

/review - Manually triggers a full AI review.
/pause - Pauses automatic reviews on this pull request.
/resume - Resumes automatic reviews.
/resolve - Marks all Bito-posted review comments as resolved.
/abort - Cancels all in-progress reviews.

Refer to the documentation for additional commands.

Configuration

This repository uses Superset You can customize the agent settings here or contact your Bito workspace admin at evan@preset.io.

Documentation & Help

AI Code Review powered by

msyavuz · 2026-05-05T14:41:14Z

+    temporal = [c for c in columns if c.data_type == "temporal"]
+    numeric = [c for c in columns if c.data_type == "numeric"]
+    categorical = [c for c in columns if c.data_type in ("string", "boolean")]


This can probably be one loop right? Would that be better for the performance overhead?

msyavuz · 2026-05-05T14:43:05Z

+    if temporal and numeric:
+        return _candidates_temporal_numeric(numeric, row_count)
+    if categorical and numeric:
+        return _candidates_categorical_numeric(numeric, categorical)
+    if len(numeric) >= 2:
+        return _candidates_multi_numeric(numeric, categorical)
+    if len(numeric) == 1 and not temporal and not categorical:
+        return _candidates_single_numeric(numeric[0], row_count)
+    return []


Should this be a single function call with multiple arguments instead?

msyavuz · 2026-05-05T14:44:22Z

+def _candidates_temporal_numeric(
+    numeric: list[DataColumn], row_count: int
+) -> list[str]:
+    # Few data points are better as a bar chart than a line
+    if row_count < 5:
+        candidates = ["bar chart", "table"]
+    else:
+        candidates = ["line chart", "area chart", "bar chart"]
+        if len(numeric) > 1:
+            candidates.append("multi-line chart")
+    return candidates
+
+
+def _candidates_categorical_numeric(
+    numeric: list[DataColumn],
+    categorical: list[DataColumn],
+) -> list[str]:
+    candidates = ["bar chart"]
+    if len(numeric) == 1 and categorical[0].unique_count <= 10:
+        candidates.append("pie chart")
+    if len(numeric) >= 2:
+        candidates.append("scatter plot")
+        candidates.append("heatmap")
+    if any(c.unique_count > 5 for c in categorical):
+        candidates.append("treemap")
+    return candidates
+
+
+def _candidates_single_numeric(col: DataColumn, row_count: int) -> list[str]:
+    candidates = ["big number / KPI", "gauge chart"]
+    if row_count > 20 and col.unique_count > 10:
+        candidates.insert(0, "histogram")
+    return candidates
+
+
+def _candidates_multi_numeric(
+    numeric: list[DataColumn],
+    categorical: list[DataColumn],
+) -> list[str]:
+    candidates = ["scatter plot"]
+    if len(numeric) >= 3:
+        candidates.append("bubble chart")
+    if categorical:
+        candidates.append("heatmap")
+    return candidates


It looks a bit clearer with multiple functions though, not sure

msyavuz · 2026-05-05T14:47:15Z

-                    if all(isinstance(v, (int, float)) for v in sample_values):
-                        data_type = "numeric"
-                    elif all(isinstance(v, bool) for v in sample_values):
+                if idx < len(coltypes):


When is this not available?

msyavuz · 2026-05-05T14:48:02Z

+    from superset.mcp_service.chart.tool.get_chart_data import _GENERIC_TYPE_MAP
+    from superset.utils.core import GenericDataType


Maybe module level imports?

fix(recommandation): fix chart recommandation

8b2a8d2

pull-request-size Bot added the size/L label May 5, 2026

codeant-ai-for-open-source Bot reviewed May 5, 2026

View reviewed changes

Comment thread superset/mcp_service/chart/tool/get_chart_data.py

improvements

81d6fc2

msyavuz reviewed May 5, 2026

View reviewed changes

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

fix(recommandation): Fix chart recommandation#39886

fix(recommandation): Fix chart recommandation#39886
alexandrusoare wants to merge 2 commits intomasterfrom
alexandrusoare/fix/get_chart_recommandation

alexandrusoare commented May 5, 2026

Uh oh!

bito-code-review Bot commented May 5, 2026 •

edited

Loading

Code Review Agent Run #04a47b

Uh oh!

netlify Bot commented May 5, 2026

Uh oh!

codecov Bot commented May 5, 2026 •

edited

Loading

Uh oh!

Uh oh!

bito-code-review Bot commented May 5, 2026 •

edited

Loading

Code Review Agent Run #b70632

Uh oh!

msyavuz May 5, 2026

Uh oh!

msyavuz May 5, 2026

Uh oh!

msyavuz May 5, 2026

Uh oh!

msyavuz May 5, 2026

Uh oh!

msyavuz May 5, 2026

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

2 participants

		from superset.mcp_service.chart.tool.get_chart_data import _GENERIC_TYPE_MAP
		from superset.utils.core import GenericDataType

Conversation

alexandrusoare commented May 5, 2026

SUMMARY

Root Cause

Fix

BEFORE/AFTER SCREENSHOTS OR ANIMATED GIF

TESTING INSTRUCTIONS

ADDITIONAL INFORMATION

Uh oh!

bito-code-review Bot commented May 5, 2026 • edited Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

Code Review Agent Run #04a47b

Uh oh!

netlify Bot commented May 5, 2026

✅ Deploy Preview for superset-docs-preview ready!

Uh oh!

codecov Bot commented May 5, 2026 • edited Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

Codecov Report

Uh oh!

Uh oh!

bito-code-review Bot commented May 5, 2026 • edited Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

Code Review Agent Run #b70632

Uh oh!

msyavuz May 5, 2026

Choose a reason for hiding this comment

Uh oh!

msyavuz May 5, 2026

Choose a reason for hiding this comment

Uh oh!

msyavuz May 5, 2026

Choose a reason for hiding this comment

Uh oh!

msyavuz May 5, 2026

Choose a reason for hiding this comment

Uh oh!

msyavuz May 5, 2026

Choose a reason for hiding this comment

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

2 participants

bito-code-review Bot commented May 5, 2026 •

edited

Loading

codecov Bot commented May 5, 2026 •

edited

Loading

bito-code-review Bot commented May 5, 2026 •

edited

Loading