Skip to content
New issue

Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.

By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.

Already on GitHub? Sign in to your account

files with invalid utf-8 characters cause a crash #4

Open
pierstitus opened this issue Jan 25, 2022 · 0 comments
Open

files with invalid utf-8 characters cause a crash #4

pierstitus opened this issue Jan 25, 2022 · 0 comments

Comments

@pierstitus
Copy link
Owner

UnicodeDecodeError is raised:

UnicodeDecodeError: 'utf-8' codec can't decode byte 0x92 in position 2717: invalid start byte

options would be to use errors='backslashreplace' in the open() calls in klonpatiencediff.py, or else doing the whole diff algorithm in binary and leaving the errors in. errors='surrogateescape' doesn't work because the surrogates are not allowed by python to be converted back.

Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment
Labels
None yet
Projects
None yet
Development

No branches or pull requests

1 participant