⚡ Bolt: Optimize deduplication in push_rules by abhimehro · Pull Request #788 · abhimehro/ctrld-sync

abhimehro · 2026-05-14T11:52:03Z

💡 What: Deduplicate hostnames using dict.fromkeys() before filtering against the existing_rules set.
🎯 Why: Reduces redundant hash map lookups and Python interpreter overhead when processing rules.
📊 Impact: ~2-3x speedup on dict comprehension in push_rules for large rule lists containing duplicates.
🔬 Measurement: Tested via custom microbenchmark; time dropped from ~0.0165s to ~0.0056s.

trunk-io · 2026-05-14T11:52:06Z

Merging to main in this repository is managed by Trunk.

To merge this pull request, check the box to the left or comment /trunk merge below.

After your PR is submitted to the merge queue, this comment will be automatically updated with its status. If the PR fails, failure details will also be posted here

cursor · 2026-05-14T11:52:09Z

PR Summary

Low Risk
Low risk performance-only change that preserves filtering semantics while reducing hash lookups; main risk is subtle behavioral drift if ordering/duplicate handling is relied on elsewhere.

Overview
Improves push_rules performance by deduplicating hostnames via dict.fromkeys() before filtering against ctx.existing_rules, reducing redundant membership checks for duplicate-heavy rule lists.

No API behavior changes; this is a hot-path optimization in the rule-prep step prior to safety validation and batch submission.

^{Reviewed by Cursor Bugbot for commit 15ff5b1. Configure here.}

codescene-delta-analysis

Gates Passed
6 Quality Gates Passed

See analysis details in CodeScene

Quality Gate Profile: Pay Down Tech Debt
Install CodeScene MCP: safeguard and uplift AI-generated code. Catch issues early with our IDE extension and CLI tool.

devin-ai-integration

Devin Review found 1 potential issue.

devin-ai-integration · 2026-05-14T11:53:35Z

+        # ⚡ Bolt: Deduplicate hostnames before filtering against existing_rules.
+        # This significantly reduces redundant hash map lookups for inputs with
+        # many duplicates, yielding up to a 3x speedup on this comprehension step.
+        unique_hostnames_dict = {
+            h: None for h in dict.fromkeys(hostnames) if h not in existing_rules
+        }


📝 Info: Functional equivalence of deduplication reorder confirmed

The old code {h: None for h in hostnames if h not in existing_rules} and the new code {h: None for h in dict.fromkeys(hostnames) if h not in existing_rules} produce identical dictionaries. In both cases, the output contains exactly the unique hostnames from the input that are not in existing_rules, preserving first-occurrence order. The intermediate dict.fromkeys(hostnames) just removes duplicates earlier so that each unique hostname is only checked once against existing_rules. The downstream duplicates_count at main.py:2223 (original_count - len(filtered_hostnames) - skipped_unsafe) is unaffected because original_count still reflects the raw input length, and filtered_hostnames/skipped_unsafe derive from unique_hostnames_dict which has the same contents either way.

Was this helpful? React with 👍 or 👎 to provide feedback.

Copilot

Pull request overview

This PR optimizes push_rules by deduplicating hostnames before checking them against existing rules, aiming to reduce repeated lookups for duplicate-heavy rule lists.

Changes:

Updates the existing_rules filtering path in push_rules.
Adds comments describing the intended duplicate-heavy performance improvement.

+        unique_hostnames_dict = {
+            h: None for h in dict.fromkeys(hostnames) if h not in existing_rules
+        }


⚡ Bolt: Optimize deduplication in push_rules

15ff5b1

Copilot AI review requested due to automatic review settings May 14, 2026 11:52

github-actions Bot added the python label May 14, 2026

Copilot started reviewing on behalf of abhimehro May 14, 2026 11:52 View session

codescene-delta-analysis Bot approved these changes May 14, 2026

View reviewed changes

devin-ai-integration Bot reviewed May 14, 2026

View reviewed changes

Copilot AI reviewed May 14, 2026

View reviewed changes

Comment thread main.py

Comment on lines +2199 to +2201

unique_hostnames_dict = {

h: None for h in dict.fromkeys(hostnames) if h not in existing_rules

}

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

⚡ Bolt: Optimize deduplication in push_rules#788

⚡ Bolt: Optimize deduplication in push_rules#788
abhimehro wants to merge 1 commit into
mainfrom
jules-3793304560845718993-0e1b0ce9

abhimehro commented May 14, 2026 •

edited by devin-ai-integration Bot

Loading

Uh oh!

trunk-io Bot commented May 14, 2026

Uh oh!

cursor Bot commented May 14, 2026 •

edited

Loading

Uh oh!

codescene-delta-analysis Bot left a comment

Uh oh!

devin-ai-integration Bot left a comment

Uh oh!

devin-ai-integration Bot May 14, 2026

Uh oh!

Copilot AI left a comment

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

2 participants

Conversation

abhimehro commented May 14, 2026 • edited by devin-ai-integration Bot Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

Uh oh!

trunk-io Bot commented May 14, 2026

Uh oh!

cursor Bot commented May 14, 2026 • edited Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

PR Summary

Uh oh!

codescene-delta-analysis Bot left a comment

Choose a reason for hiding this comment

Uh oh!

devin-ai-integration Bot left a comment

Choose a reason for hiding this comment

Uh oh!

devin-ai-integration Bot May 14, 2026

Choose a reason for hiding this comment

Uh oh!

Copilot AI left a comment

Choose a reason for hiding this comment

Pull request overview

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

2 participants

abhimehro commented May 14, 2026 •

edited by devin-ai-integration Bot

Loading

cursor Bot commented May 14, 2026 •

edited

Loading