Fix hardcoded zeros for `encode(0, ...)` and padding by sd-yip · Pull Request #27 · suminb/base62

sd-yip · 2024-02-02T02:51:05Z

Currently, decode(encode(0, charset=MY_CHARSET), charset=MY_CHARSET) may not yield 0 due to hardcoding. This proposed change eliminates hardcoding instances and resolves the issue.

google-cla · 2024-02-02T02:51:09Z

Thanks for your pull request! It looks like this may be your first contribution to a Google open source project. Before we can look at your pull request, you'll need to sign a Contributor License Agreement (CLA).

View this failed invocation of the CLA check for more information.

For the most up to date status, view the checks section at the bottom of the pull request.

sd-yip · 2024-02-06T04:36:40Z

@suminb Would you mind taking a look?

Copilot

Pull request overview

Fixes incorrect roundtrips for encode(0, charset=...) and leading-zero byte padding by removing hardcoded "0" assumptions and consistently using charset[0] as the zero digit / marker.

Changes:

Update encode(0, charset=...) to return charset[0] instead of the hardcoded "0".
Update encodebytes/decodebytes leading-zero padding logic to use charset[0] instead of "0".
Add a new test validating encode/decode behavior with a non-default (QWERTY) charset.

Reviewed changes

Copilot reviewed 2 out of 2 changed files in this pull request and generated 2 comments.

File	Description
`base62.py`	Removes hardcoded `"0"` usage by switching to `charset[0]` for zero and byte-padding markers.
`tests/test_basic.py`	Adds coverage for `encode(0)` / `decode()` with a custom charset to prevent regressions.

💡 Add Copilot custom instructions for smarter, more guided reviews. Learn how to get started.

Copilot · 2026-02-10T09:15:30Z

base62.py

    # Encode the leading zeros as "0" followed by a character indicating the count.
    # This pattern may occur several times if there are many leading zeros.


The comment describing leading-zero padding is now inaccurate: the marker is no longer the literal string "0", but charset[0]. Please update the comment so it matches the new behavior (and avoids confusing readers when using non-default charsets).

Suggested change

# Encode the leading zeros as "0" followed by a character indicating the count.

# This pattern may occur several times if there are many leading zeros.

# Encode the leading zeros using charset[0] as a marker, followed by a character

# indicating the count. This pattern may occur several times if there are many

# leading zeros.

Copilot · 2026-02-10T09:15:30Z

tests/test_basic.py

+def test_basic_qwerty():
+    kwargs = {"charset": CHARSET_QWERTY}
+
+    assert base62.encode(0, **kwargs) == "1"
+    assert base62.decode("1", **kwargs) == 0
+    assert base62.decode("1111", **kwargs) == 0
+    assert base62.decode("111112", **kwargs) == 1


This adds coverage for encode(0) / decode() with a non-default charset, but the PR also changes the non-default-charset behavior of encodebytes() / decodebytes() leading-zero padding. Please add assertions that encodebytes/decodebytes roundtrip (including leading null bytes) with CHARSET_QWERTY (and/or CHARSET_INVERTED) so the padding changes are exercised by tests.

sd-yip added 2 commits February 2, 2024 11:17

Add test_basic_qwerty

8db024c

Fix hardcoded zeros for encode(0, ...) and padding

906aa73

suminb requested a review from Copilot February 10, 2026 09:13

Copilot started reviewing on behalf of suminb February 10, 2026 09:13 View session

Copilot AI reviewed Feb 10, 2026

View reviewed changes

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Comments

Fix hardcoded zeros for `encode(0, ...)` and padding#27

Fix hardcoded zeros for `encode(0, ...)` and padding#27
sd-yip wants to merge 2 commits intosuminb:developfrom
sd-yip:fix-hardcoded-zeros

sd-yip commented Feb 2, 2024

Uh oh!

google-cla bot commented Feb 2, 2024

Uh oh!

sd-yip commented Feb 6, 2024

Uh oh!

Copilot AI left a comment

Uh oh!

Copilot AI Feb 10, 2026

Uh oh!

Copilot AI Feb 10, 2026

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

1 participant

		# Encode the leading zeros as "0" followed by a character indicating the count.
		# This pattern may occur several times if there are many leading zeros.

-    # Encode the leading zeros as "0" followed by a character indicating the count.
-    # This pattern may occur several times if there are many leading zeros.
+    # Encode the leading zeros using charset[0] as a marker, followed by a character
+    # indicating the count. This pattern may occur several times if there are many
+    # leading zeros.

Comments

Conversation

sd-yip commented Feb 2, 2024

Uh oh!

google-cla bot commented Feb 2, 2024

Uh oh!

sd-yip commented Feb 6, 2024

Uh oh!

Copilot AI left a comment

Choose a reason for hiding this comment

Pull request overview

Reviewed changes

Uh oh!

Copilot AI Feb 10, 2026

Choose a reason for hiding this comment

Uh oh!

Copilot AI Feb 10, 2026

Choose a reason for hiding this comment

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

1 participant