gh-94823: Improve coverage in tokenizer.c:valid_utf8#94856
Conversation
When loading a source file from disk, there is a separate UTF-8 validator distinct from the one in `unicode_decode_utf8`. This exercises that code path with the same set of invalid inputs as we use for testing the "other" UTF-8 decoder.
6db7d84 to
e52f328
Compare
July 15, 2022 18:11
ericsnowcurrently
left a comment
There was a problem hiding this comment.
Thanks for working on this! It mostly looks good. I noticed a couple of small typos.
Sorry, something went wrong.
|
A Python core developer has requested some changes be made to your pull request before we can consider merging it. If you could please address their requests along with any other requests in other reviews from core developers that would be appreciated. Once you have made the requested changes, please leave a comment on this pull request containing the phrase |
Sorry, something went wrong.
|
I have made the requested changes; please review again |
Sorry, something went wrong.
|
Thanks for making the requested changes! @ericsnowcurrently: please review the changes made to this pull request. |
Sorry, something went wrong.
ericsnowcurrently
left a comment
There was a problem hiding this comment.
LGTM
Sorry, something went wrong.
|
Thanks @mdboom for the PR 🌮🎉.. I'm working now to backport this PR to: 3.11. |
Sorry, something went wrong.
…94856) When loading a source file from disk, there is a separate UTF-8 validator distinct from the one in `unicode_decode_utf8`. This exercises that code path with the same set of invalid inputs as we use for testing the "other" UTF-8 decoder. (cherry picked from commit f215d7c) Co-authored-by: Michael Droettboom <mdboom@gmail.com>
|
Thanks for the test! |
Sorry, something went wrong.
When loading a source file from disk, there is a separate UTF-8 validator
distinct from the one in
unicode_decode_utf8. This exercises that code pathwith the same set of invalid inputs as we use for testing the "other" UTF-8
decoder.
Automerge-Triggered-By: GH:ericsnowcurrently