Fix fix_test.py#6415

youknowone

with better collecting, async support and win32

I'd like to rename the script to mark_test_failures or something like that to shows better what it is

Summary by CodeRabbit

Chores
- Improved test automation script with enhanced reliability, better handling of test identification, and cross-platform compatibility improvements.

_{✏️ Tip: You can customize this high-level summary in your review settings.}

coderabbitai

Walkthrough

The test-fixing script undergoes substantial refactoring to replace in-place line-wise modifications with a locate-modify-apply sequence. Results parsing is updated to extract failing tests from targeted patterns, new helper functions locate test definitions and apply modifications in reverse order, and file I/O transitions to Path-based methods with Windows executable support.

Changes

Cohort / File(s)	Summary
Test result parsing and modification flow `scripts/fix_test.py`	Results parsing now collects only FAIL/ERROR results and extracts test names via targeted pattern matching; replaces previous general test line parsing with new `find_test_lineno()` helper to locate test function definitions by class and method name; introduces `apply_modifications()` to apply multiple modifications in reverse line order; removes old `modify_test`/`modify_test_v2` functions; updates file I/O to use `Path.read_text()` and `write_text()`; adds Windows support for RustPython executable naming (`.exe` suffix); adds input path validation; improves logging to report modification targets and counts.

Estimated code review effort

🎯 3 (Moderate) | ⏱️ ~20–25 minutes

find_test_lineno() logic: Verify AST-based test function location handles both regular and async test methods correctly, and edge cases where tests cannot be located.
apply_modifications() logic: Confirm reverse-order application prevents line number offset issues when inserting decorators and comments.
Results parsing: Ensure FAIL/ERROR extraction correctly parses test output and extracts class/method names from target patterns.
Deduplication and file I/O: Verify modifications list deduplicates by (class, method) and that Path-based I/O handles encoding/edge cases appropriately.

Suggested reviewers

reactive-firewall

Poem

🐰 Hops through test logs with gleeful delight,
Locating lost tests and making them right,
No more line shifts—we reverse with care,
Each modification finds its true lair!
With Path and with parsing, the script now shines,
A refactored warren of fix-perfect lines! 🌙

Pre-merge checks and finishing touches

❌ Failed checks (1 warning, 1 inconclusive)

Check name	Status	Explanation	Resolution
Docstring Coverage	⚠️ Warning	Docstring coverage is 40.00% which is insufficient. The required threshold is 80.00%.	You can run `@coderabbitai generate docstrings` to improve docstring coverage.
Title check	❓ Inconclusive	The title 'Fix fix_test.py' is vague and generic. While it references the file being modified, it does not meaningfully convey the specific improvements made (better collecting, async support, Windows compatibility, or the refactored test-handling approach).	Revise the title to be more descriptive of the key improvements, such as 'Refactor fix_test.py with async support and Windows compatibility' or 'Improve test failure collection and apply modifications robustly'.

✅ Passed checks (1 passed)

Check name	Status	Explanation
Description Check	✅ Passed	Check skipped - CodeRabbit’s high-level summary is enabled.

✨ Finishing touches

📝 Generate docstrings

🧪 Generate unit tests (beta)

Create PR with unit tests
Post copyable unit tests in a comment

Thanks for using CodeRabbit! It's free for OSS, and your support helps us grow. If you like it, consider giving us a shout-out.

❤️ Share

_{Comment @coderabbitai help to get the list of available commands and usage tips.}

coderabbitai

Actionable comments posted: 3

🧹 Nitpick comments (3)

scripts/fix_test.py (3)
105-124: find_test_lineno() should iterate a.body directly and avoid over-restrictive structure assumptions.
Current ast.iter_fields() + key == "body" works but is indirect; also consider catching SyntaxError from ast.parse (script should warn and skip instead of crashing on a bad file).
 def find_test_lineno(file: str, test: list[str]) -> tuple[int, int] | None:
@@
-    a = ast.parse(file)
-    for key, node in ast.iter_fields(a):
-        if key == "body":
-            for n in node:
+    try:
+        a = ast.parse(file)
+    except SyntaxError:
+        return None
+
+    for n in a.body:
                 match n:
                     case ast.ClassDef():
                         if len(test) == 2 and test[0] == n.name:
                             for fn in n.body:
                                 match fn:
                                     case ast.FunctionDef() | ast.AsyncFunctionDef():
                                         if fn.name == test[-1]:
                                             return (fn.lineno, fn.col_offset)
                     case ast.FunctionDef() | ast.AsyncFunctionDef():
                         if n.name == test[0] and len(test) == 1:
                             return (n.lineno, n.col_offset)
     return None
140-153: Add a timeout (and surface stderr) to avoid hanging forever on known-hanging tests.
Docs say “skip the tests that hang” but the script itself can still hang in subprocess.run(...).
-    result = subprocess.run(
+    result = subprocess.run(
         [rustpython_location, "-m", "test", "-v", test_name],
         capture_output=True,
         text=True,
+        timeout=60 * 20,
     )
Optionally catch subprocess.TimeoutExpired and print a clear message with next steps.

156-188: Batch “locate → apply in reverse” flow is a good direction; just ensure the “platform/force” flags aren’t dead.
The new Path-based I/O + reverse-apply avoids offset bugs—nice. But --force and --platform are currently unused in this file (at least in the shown code); either wire them in (e.g., allow re-marking / platform-gated TODO text) or remove them to avoid misleading CLI. Based on repo guidance, ensure the inserted markers match the project’s required patterns for Lib/test/** edits (unittest.expectedFailure / unittest.skip("TODO: RustPython ...") + # TODO: RUSTPYTHON ...). Based on learnings, apply only those modifications to tests.

📜 Review details

Configuration used: Path: .coderabbit.yml

Review profile: CHILL

Plan: Pro

📥 Commits

Reviewing files that changed from the base of the PR and between 5a5b721 and 7bd59fa.

📒 Files selected for processing (1)

scripts/fix_test.py (3 hunks)

🧰 Additional context used

📓 Path-based instructions (1)

**/*.py

📄 CodeRabbit inference engine (.github/copilot-instructions.md)

**/*.py: In most cases, Python code should not be edited; bug fixes should be made through Rust code modifications only
Follow PEP 8 style for custom Python code
Use ruff for linting Python code

Files:

scripts/fix_test.py

🧠 Learnings (12)

📓 Common learnings

Learnt from: CR
Repo: RustPython/RustPython PR: 0
File: .github/copilot-instructions.md:0-0
Timestamp: 2025-11-29T12:17:28.606Z
Learning: Applies to Lib/test/**/*.py : Only acceptable modifications to test files are: (1) Removing `unittest.expectedFailure` decorators and upper TODO comments when tests actually pass, (2) Adding `unittest.expectedFailure` decorators when tests cannot be fixed

📚 Learning: 2025-11-29T12:17:28.606Z

Learnt from: CR
Repo: RustPython/RustPython PR: 0
File: .github/copilot-instructions.md:0-0
Timestamp: 2025-11-29T12:17:28.606Z
Learning: Applies to Lib/test/**/*.py : Only acceptable modifications to test files are: (1) Removing `unittest.expectedFailure` decorators and upper TODO comments when tests actually pass, (2) Adding `unittest.expectedFailure` decorators when tests cannot be fixed

Applied to files:

scripts/fix_test.py

📚 Learning: 2025-11-29T12:17:28.606Z

Learnt from: CR
Repo: RustPython/RustPython PR: 0
File: .github/copilot-instructions.md:0-0
Timestamp: 2025-11-29T12:17:28.606Z
Learning: Applies to Lib/test/**/*.py : When modifying test files in `Lib/test/`, use `unittest.skip("TODO: RustPython <reason>")` or `unittest.expectedFailure` with `# TODO: RUSTPYTHON <reason>` comment to mark unsupported features

Applied to files:

scripts/fix_test.py

📚 Learning: 2025-11-29T12:17:28.606Z

Learnt from: CR
Repo: RustPython/RustPython PR: 0
File: .github/copilot-instructions.md:0-0
Timestamp: 2025-11-29T12:17:28.606Z
Learning: Applies to Lib/test/**/*.py : NEVER comment out or delete test code lines except for removing `unittest.expectedFailure` decorators and upper TODO comments

Applied to files:

scripts/fix_test.py

📚 Learning: 2025-11-29T12:17:28.606Z

Learnt from: CR
Repo: RustPython/RustPython PR: 0
File: .github/copilot-instructions.md:0-0
Timestamp: 2025-11-29T12:17:28.606Z
Learning: Applies to Lib/test/**/*.py : When a test cannot pass due to missing language features, keep it as expectedFailure and document the reason instead of modifying test code

Applied to files:

scripts/fix_test.py

📚 Learning: 2025-11-29T12:17:28.606Z

Learnt from: CR
Repo: RustPython/RustPython PR: 0
File: .github/copilot-instructions.md:0-0
Timestamp: 2025-11-29T12:17:28.606Z
Learning: Use `cargo run -- script.py` instead of the standard `python` command to run and test Python code with RustPython

Applied to files:

scripts/fix_test.py

📚 Learning: 2025-11-29T12:17:28.606Z

Learnt from: CR
Repo: RustPython/RustPython PR: 0
File: .github/copilot-instructions.md:0-0
Timestamp: 2025-11-29T12:17:28.606Z
Learning: Run `cargo test --workspace --exclude rustpython_wasm` to execute Rust unit tests

Applied to files:

scripts/fix_test.py

📚 Learning: 2025-09-28T22:22:55.921Z

Learnt from: reactive-firewall
Repo: RustPython/RustPython PR: 6176
File: .github/workflows/Check_Tests.yml:133-141
Timestamp: 2025-09-28T22:22:55.921Z
Learning: In the RustPython project's CI-5974-Test-RustPython-Integration action, the override-rustpython-path input is marked as required but has runtime fallback logic that defaults to RUSTPYTHONPATH environment variable or "Lib" if neither is provided, making explicit specification unnecessary in most cases.

Applied to files:

scripts/fix_test.py

📚 Learning: 2025-11-29T12:17:28.606Z

Learnt from: CR
Repo: RustPython/RustPython PR: 0
File: .github/copilot-instructions.md:0-0
Timestamp: 2025-11-29T12:17:28.606Z
Learning: Applies to Lib/**/*.py : Minimize modifications to CPython standard library files in the `Lib/` directory; modifications should be minimal and only to work around RustPython limitations

Applied to files:

scripts/fix_test.py

📚 Learning: 2025-11-29T12:17:28.606Z

Learnt from: CR
Repo: RustPython/RustPython PR: 0
File: .github/copilot-instructions.md:0-0
Timestamp: 2025-11-29T12:17:28.606Z
Learning: Applies to Lib/**/*.py : Add a `# TODO: RUSTPYTHON` comment when modifications are made to files in the `Lib/` directory

Applied to files:

scripts/fix_test.py

📚 Learning: 2025-11-29T12:17:28.606Z

Learnt from: CR
Repo: RustPython/RustPython PR: 0
File: .github/copilot-instructions.md:0-0
Timestamp: 2025-11-29T12:17:28.606Z
Learning: Applies to **/*.py : In most cases, Python code should not be edited; bug fixes should be made through Rust code modifications only

Applied to files:

scripts/fix_test.py

📚 Learning: 2025-11-29T12:17:28.606Z

Learnt from: CR
Repo: RustPython/RustPython PR: 0
File: .github/copilot-instructions.md:0-0
Timestamp: 2025-11-29T12:17:28.606Z
Learning: Use the `python` command explicitly when comparing behavior with CPython or running test suites against CPython

Applied to files:

scripts/fix_test.py

🪛 Flake8 (7.3.0)

scripts/fix_test.py

[error] 19-19: 'itertools' imported but unused

(F401)

[error] 20-20: 'platform' imported but unused

(F401)

⏰ Context from checks skipped due to timeout of 90000ms. You can increase the timeout in your CodeRabbit configuration to a maximum of 15 minutes (900000ms). (1)

GitHub Check: Check Rust code with clippy

🔇 Additional comments (1)

scripts/fix_test.py (1)

55-95: Python code modifications not appropriate per project guidelines.

While the parsing logic concern has merit (the code relies on specific output markers like "Run tests sequentially" that could be fragile), the coding guidelines explicitly state: "In most cases, Python code should not be edited; bug fixes should be made through Rust code modifications only." If the underlying test runner or output format is problematic, address the root cause in Rust rather than modifying the Python parsing code.

youknowone requested a review from arihant2math December 12, 2025 10:46

youknowone added the skip:ci Skip running the ci label Dec 12, 2025

Fix fix_test.py

7bd59fa

youknowone force-pushed the fix-test branch from 72ae328 to 7bd59fa Compare December 12, 2025 10:46

youknowone marked this pull request as ready for review December 12, 2025 10:46

coderabbitai Bot reviewed Dec 12, 2025

View reviewed changes

ShaharNaveh reviewed Dec 12, 2025

View reviewed changes

arihant2math approved these changes Dec 19, 2025

View reviewed changes

youknowone deleted the fix-test branch December 20, 2025 00:55

coderabbitai Bot mentioned this pull request Jan 10, 2026

preparing step to integrate fix_test into lib_updater #6698

Merged

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Fix fix_test.py#6415