bpo-34060: Report system load when running test suite for Windows#8357

ammaraskar

This is (mostly) a pure Python implementation of the other PR. It leverages the typeperf command which monitors performance counters and outputs them at a given interval. So every 5 seconds, typeperf can output the processor queue length into stdout.

subprocess.stdout.readline is a blocking call. Using a thread seemed like an obvious solution, but we can't achieve this with multiprocessing or a thread, because like Victor speculated in the previous bug report on this, it conflicts with test_multiprocessing and test_threading. Hence, I opted to use the asynchronous/overlapped IO API which was designed for async. Most of the diff actually just pertains to using this rather low level API.

This is almost a pure python implementation but there was one edge case where this would fail. Namely, when the python interpreter running the test suite crashes, this leaves an orphaned typeperf process running which refuses to die. This means that when the test suite is run with -j x and this situation happens:

python -m test -j2
├── python <test runner>
│   └── typeperf.exe
└── python *CRASHED*
    └── typeperf.exe

The big test coordinating python process will wait forever on the crashed python and consequently typeperf to terminate, which just doesn't happen by default in Windows. After reading up on the APIs, the right way to fix this is by using a Job Object to ask the OS to kill the child when the parent dies. Hence, there is a change in _winapi to make this happen. Unlike the last PR, this API is actually reusable and fit to be exposed to the public. It could even allow implementing things like bpo-5115 to be a lot easier.

https://bugs.python.org/issue34060

zooba

I like this! And I'll be happy to have support for job objects in there too :)

Looking at the test runs, the numbers seem to be consistent with other platforms, but since I don't have as good a feel for what to expect here, I'd like someone else who's been involved in this or the previous PR to sign off as well.

ammaraskar

I'd like someone else who's been involved in this or the previous PR to sign off as well.

Yeah I'd definitely like to get @vstinner's take on it since he is likely familiar with normal load values.

vstinner

subprocess.stdout.readline is a blocking call. Using a thread seemed like an obvious solution, but we can't achieve this with multiprocessing or a thread, because like Victor speculated in the previous bug report on this, it conflicts with test_multiprocessing and test_threading.

Did I say thatI? A thread is fine here. regrtest already uses threads to run tests in subprocesses when using -jN. faulthandler also uses a C thread to implement an hard timeout (dumping the Python traceback on timeout). regrtest is full of threads :-) Overlapped IO may be more complicated than a thread, no?

ammaraskar

This is the failure that shows up when using a thread:

0:00:21 load avg: 0.07 [ 16/417] test__xxsubinterpreters
test test__xxsubinterpreters failed -- Traceback (most recent call last):
  File "C:\Users\ammar\workspace\cpython\lib\test\test__xxsubinterpreters.py", line 473, in test_main
    self.assertTrue(interpreters.is_running(main))
RuntimeError: interpreter has more than one thread

ammaraskar

So I think I jumped the gun early with the job grouping stuff. There was a much easier solution to dealing with interpreter crashes in -jN mode. Only run the load tracking subprocess in the main interpreter coordinating the children. It's the only one that needs the information since it prints out the progress reports.

This is now actually just pure python and consequently a lot simpler.

bedevere-bot

A Python core developer has requested some changes be made to your pull request before we can consider merging it. If you could please address their requests along with any other requests in other reviews from core developers that would be appreciated.

Once you have made the requested changes, please leave a comment on this pull request containing the phrase I have made the requested changes; please review again. I will then notify any core developers who have left a review that you're ready for them to take another look at this pull request.

And if you don't make the requested changes, you will be poked with soft cushions!

ammaraskar

I have made the requested changes; please review again

bedevere-bot

Thanks for making the requested changes!

@zooba, @vstinner: please review the changes made to this pull request.

csabella

It looks like there was a lot of interest and activity on this PR a few months ago and @zooba had approved it. @ammaraskar, could you resolve the merge conflict? I think that might be all that is needed, along with @vstinner's approval for merging.

Thanks!

It seems like my comments have been addressed.

vstinner

I'm sorry but I don't have the bandwidth right to review this change (test it manually).

@ammaraskar: You have to update your PR, there is now a conflict.

@zware: If you are confident that the change is good, please go ahead and merge it (once CI tests and the conflict is solved).

zware

I haven't researched whether this is the best way to do this to any extent, but this looks fine to me.

While Windows exposes the system processor queue length: the raw value used for load calculations on Unix systems, it does not provide an API to access the averaged value. Hence to calculate the load we must track and average it ourselves. We can't use multiprocessing or a thread to read it in the background while the tests run since using those would conflict with test_multiprocessing and test_xxsubprocess. Thus, we use Window's asynchronous IO API to run the tracker in the background with it sampling at the correct rate. When we wish to access the load we check to see if there's new data on the stream, if there is, we update our load values.

ammaraskar

Thanks for the reminder Cheryl, appreciate it! :)

@zware Just fixed the merge conflict, please take a look.
@zooba If you're available for a re-review, that would be appreciated as well.

csabella

Based on @zooba's approval and the other consensus on this, I've merged the PR. Thanks @ammaraskar for the PR and to @zware, @eryksun, @vstinner, and @zooba for the reviews! 🙂

vstinner

Thanks @ammaraskar!

* Clean up code which checked presence of os.{stat,lstat,chmod} (GH-11643) (cherry picked from commit 8377cd4) * bpo-36725: regrtest: add TestResult type (GH-12960) * Add TestResult and MultiprocessResult types to ensure that results always have the same fields. * runtest() now handles KeyboardInterrupt * accumulate_result() and format_test_result() now takes a TestResult * cleanup_test_droppings() is now called by runtest() and mark the test as ENV_CHANGED if the test leaks support.TESTFN file. * runtest() now includes code "around" the test in the test timing * Add print_warning() in test.libregrtest.utils to standardize how libregrtest logs warnings to ease parsing the test output. * support.unload() is now called with abstest rather than test_name * Rename 'test' variable/parameter to 'test_name' * dash_R(): remove unused the_module parameter * Remove unused imports (cherry picked from commit 4d29983) * bpo-36725: Refactor regrtest multiprocessing code (GH-12961) Rewrite run_tests_multiprocess() function as a new MultiprocessRunner class with multiple methods to better report errors and stop immediately when needed. Changes: * Worker processes are now killed immediately if tests are interrupted or if a test does crash (CHILD_ERROR): worker processes are killed. * Rewrite how errors in a worker thread are reported to the main thread. No longer ignore BaseException or parsing errors silently. * Remove 'finished' variable: use worker.is_alive() instead * Always compute omitted tests. Add Regrtest.get_executed() method. (cherry picked from commit 3cde440) * bpo-36719: regrtest always detect uncollectable objects (GH-12951) regrtest now always detects uncollectable objects. Previously, the check was only enabled by --findleaks. The check now also works with -jN/--multiprocess N. --findleaks becomes a deprecated alias to --fail-env-changed. (cherry picked from commit 75120d2) * bpo-34060: Report system load when running test suite for Windows (GH-8357) While Windows exposes the system processor queue length, the raw value used for load calculations on Unix systems, it does not provide an API to access the averaged value. Hence to calculate the load we must track and average it ourselves. We can't use multiprocessing or a thread to read it in the background while the tests run since using those would conflict with test_multiprocessing and test_xxsubprocess. Thus, we use Window's asynchronous IO API to run the tracker in the background with it sampling at the correct rate. When we wish to access the load we check to see if there's new data on the stream, if there is, we update our load values. (cherry picked from commit e16467a) * bpo-36719: Fix regrtest re-run (GH-12964) Properly handle a test which fail but then pass. Add test_rerun_success() unit test. (cherry picked from commit 837acc1) * bpo-36719: regrtest closes explicitly WindowsLoadTracker (GH-12965) Regrtest.finalize() now closes explicitly the WindowsLoadTracker instance. (cherry picked from commit 00db7c7)

the-knights-who-say-ni added the CLA signed label Jul 20, 2018

bedevere-bot added the awaiting review label Jul 20, 2018

ammaraskar force-pushed the windows_load2 branch from ede0a5a to da62440 Compare July 20, 2018 20:40

zooba approved these changes Jul 20, 2018

View reviewed changes

bedevere-bot added awaiting merge and removed awaiting review labels Jul 20, 2018

zooba reviewed Jul 20, 2018

View reviewed changes

eryksun reviewed Jul 20, 2018

View reviewed changes

eryksun reviewed Jul 21, 2018

View reviewed changes

ammaraskar force-pushed the windows_load2 branch from fc8c05d to 63b57b2 Compare July 21, 2018 12:35

vstinner previously requested changes Jul 23, 2018

View reviewed changes

bedevere-bot removed the awaiting merge label Jul 23, 2018

bedevere-bot added the awaiting changes label Jul 23, 2018

ammaraskar force-pushed the windows_load2 branch 2 times, most recently from 6990d2c to a51f181 Compare July 25, 2018 02:38

bedevere-bot added awaiting change review and removed awaiting changes labels Aug 10, 2018

Move windows specific code to its own file

00a0895

Move imports to top of file

5c0e275

ammaraskar force-pushed the windows_load2 branch 3 times, most recently from fb0a14d to 31a71df Compare February 9, 2019 22:44

Add comment explaining check in libregrtest

a8df864

ammaraskar force-pushed the windows_load2 branch from 31a71df to a8df864 Compare February 10, 2019 02:13

csabella merged commit e16467a into python:master Apr 9, 2019

bedevere-bot removed the awaiting change review label Apr 9, 2019

ammaraskar mentioned this pull request Apr 10, 2019

bpo-34060: Report system load when running test suite for Windows #8287

Closed

Conversation

ammaraskar commented Jul 20, 2018 • edited by bedevere-bot Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

Uh oh!

zooba left a comment

Choose a reason for hiding this comment

Uh oh!

ammaraskar commented Jul 20, 2018

Uh oh!

vstinner commented Jul 21, 2018

Uh oh!

ammaraskar commented Jul 21, 2018 • edited Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

Uh oh!

ammaraskar commented Jul 21, 2018 • edited Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

Uh oh!

bedevere-bot commented Jul 23, 2018

Uh oh!

ammaraskar commented Aug 10, 2018

Uh oh!

bedevere-bot commented Aug 10, 2018

Uh oh!

csabella commented Jan 20, 2019

Uh oh!

vstinner commented Jan 23, 2019

Uh oh!

zware commented Feb 1, 2019

Uh oh!

ammaraskar commented Feb 10, 2019

Uh oh!

csabella commented Apr 9, 2019

Uh oh!

vstinner commented Apr 11, 2019

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

8 participants

ammaraskar commented Jul 20, 2018 •

edited by bedevere-bot

Loading

ammaraskar commented Jul 21, 2018 •

edited

Loading

ammaraskar commented Jul 21, 2018 •

edited

Loading