Skip to content

Comments

handle invalid UTF-8 more cleanly - replacement character#18

Open
slippycheeze wants to merge 1 commit intomlsmithjr:masterfrom
slippycheeze:utf8-decode-replace-invalid
Open

handle invalid UTF-8 more cleanly - replacement character#18
slippycheeze wants to merge 1 commit intomlsmithjr:masterfrom
slippycheeze:utf8-decode-replace-invalid

Conversation

@slippycheeze
Copy link

This updates the UTF-8 decoding in pytranscoder to treat non-UTF-8
output without error: it replaces invalid characters with the Unicode
REPLACEMENT CHARACTER U+FFFD.

These characters only occur in "user"-supplied metadata in media, such
as title, series, or encoder tags, so there is little to no risk of loss
of any useful information in the change.

This updates the UTF-8 decoding in pytranscoder to treat non-UTF-8
output without error: it replaces invalid characters with the Unicode
`REPLACEMENT CHARACTER` `U+FFFD`.

These characters only occur in "user"-supplied metadata in media, such
as title, series, or encoder tags, so there is little to no risk of loss
of any useful information in the change.
Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment

Labels

None yet

Projects

None yet

Development

Successfully merging this pull request may close these issues.

1 participant