gh-109052: Use the base opcode when comparing code objects#109107
Conversation
There was a problem hiding this comment.
Looks better?
diff --git a/Objects/codeobject.c b/Objects/codeobject.c
index 70a0c2ebd6..4180d216a3 100644
--- a/Objects/codeobject.c
+++ b/Objects/codeobject.c
@@ -1788,20 +1788,24 @@ code_richcompare(PyObject *self, PyObject *other, int op)
if (co_code == ENTER_EXECUTOR) {
const int exec_index = co_arg;
_PyExecutorObject *exec = co->co_executors->executors[exec_index];
- co_code = exec->vm_data.opcode;
+ co_code = _PyOpcode_Deopt[exec->vm_data.opcode];
co_arg = exec->vm_data.oparg;
}
+ else {
+ co_code = _Py_GetBaseOpcode(co, i);
+ }
assert(co_code != ENTER_EXECUTOR);
- co_code = _PyOpcode_Deopt[co_code];
if (cp_code == ENTER_EXECUTOR) {
const int exec_index = cp_arg;
_PyExecutorObject *exec = cp->co_executors->executors[exec_index];
- cp_code = exec->vm_data.opcode;
+ cp_code = _PyOpcode_Deopt[exec->vm_data.opcode];
cp_arg = exec->vm_data.oparg;
}
+ else {
+ cp_code = _Py_GetBaseOpcode(cp, i);
+ }
assert(cp_code != ENTER_EXECUTOR);
- cp_code = _PyOpcode_Deopt[cp_code];
if (co_code != cp_code || co_arg != cp_arg) {
goto unequal;
Sorry, something went wrong.
|
A Python core developer has requested some changes be made to your pull request before we can consider merging it. If you could please address their requests along with any other requests in other reviews from core developers that would be appreciated. Once you have made the requested changes, please leave a comment on this pull request containing the phrase And if you don't make the requested changes, you will be put in the comfy chair! |
Sorry, something went wrong.
|
I don't think the current code is "wrong" functionality wise, as I'm not sure if |
Sorry, something went wrong.
vstinner
left a comment
There was a problem hiding this comment.
I dislike this approach, iterating on each bytecode at a specific index, deoptimize it, and then look for the next bytecode. The problem is that there are cases where i++ doesn't give you a bytecode but a cache :-( You have to take care of ENTER_EXECUTOR. The exact implementation of bytecode changes often these days, and it's hard to keep all functions consuming bytecode to remain correct.
Would it be possible instead to write a function which creates a copy of the deoptimized code in one shot? Something like PyCode_GetOriginalBytecode() which would create a bytes object.
Sorry, something went wrong.
I'm thinking at bug gh-107082 which has been fixed by commit 233b878. cc @gvanrossum |
Sorry, something went wrong.
I'm not saying this is the best way to do it (it was there already to compare the code objects), but the logic here is correct (at least for now). A function like |
Sorry, something went wrong.
Correct, but it's easier to implement when you iterate on a single code object, and this function can be reused in other places, and it can be moved closer to functions which modify bytecode. |
Sorry, something went wrong.
|
Also I believe this piece of code was originally written to avoid allocating a new piece of memory. A new function returning bytes would contradict that - not saying that's the wrong way to go, just to mention. |
Sorry, something went wrong.
Honestly, I don't think that code1==code2 operation matters for Python performance. It's usually used in the compiler/parser, not "at runtime". It shouldn't be part of "hot code". |
Sorry, something went wrong.
|
By the way, for me, it's surprising that _Py_GetBaseOpcode() can still return ENTER_EXECUTOR and require to get the executor (opcode, oparg). Each caller has to take care of that. Would it be possible to have a function which returns the original base case, so don't return ENTER_EXECUTOR? Maybe an "iterator-like" API which also gives an offset to the next "original" instruction (the next instruction which, one decoded, will give the original bytecode)? |
Sorry, something went wrong.
I'm just saying, this code is explicitly converted to the current status from what you wanted in bd2e47c . Do you think we should revert that change? I would guess there might be reasons behind it. |
Sorry, something went wrong.
I don't think that copying the original object is not beneficial for the scenario as you commented. |
Sorry, something went wrong.
corona10
left a comment
There was a problem hiding this comment.
Fixing itself looks good to me.
For overall policy, it would be a different issue.
I will leave the rest of review to @gvanrossum
Sorry, something went wrong.
|
I prefer to abstraint myself from reviewing this PR :-) I didn't follow recent developments about optimization, so I don't have a strong opinion. I just shared my opinion and feedback on the recent code changes and issues that I saw ;-) |
Sorry, something went wrong.
|
I confirm that this change fix issue #109052 crash. Without this change, Python does crash with the following command: With this change, the test does pass as expected, and Python doesn't crash. |
Sorry, something went wrong.
brandtbucher
left a comment
There was a problem hiding this comment.
I think this is a good fix for the problem at hand. Zooming out, I agree that we should put all of this "deopting" logic in one place (we're pretty close with _Py_GetBaseOpcode) and actually use it everywhere.
I'll merge in a couple of hours if nobody has any other concerns.
Sorry, something went wrong.
Let's just merge! |
Sorry, something went wrong.
After instrumentation, the opcode of the code object would become
INSTRUMENTED_LINEorINSTRUMENTED_INSTRUCTION, we should make sure to get the actual opcode when we compare them.This also belongs to gh-107265.