Change python invocation #1908

8W9aG · 2024-08-26T18:18:19Z

Use -B to prevent python from writing pyc files, this is wasted effort due to the container being
ephemeral
Set check-hash-based-pycs to never, this prevents python from scanning the entire file and
calculating its hash to check on pyc hits, instead forces it into the timestamp validation step
Use -OO to force runtime optimizations such as ignorance of assets, debug flags and docstrings

* Use -B to prevent python from writing pyc files, this is wasted effort due to the container being ephemeral * Set check-hash-based-pycs to never, this prevents python from scanning the entire file and calculating its hash to check on pyc hits, instead forces it into the timestamp validation step * Use -OO to force runtime optimizations such as ignorance of assets, debug flags and docstrings

nickstenning · 2024-08-27T09:49:19Z

I'm sure these optimizations improve things, but how do we know that and how are we tracking regressions? Can you add some benchmarks to the test suite that demonstrate these are improvements?

8W9aG · 2024-08-27T17:47:08Z

I'm sure these optimizations improve things, but how do we know that and how are we tracking regressions? Can you add some benchmarks to the test suite that demonstrate these are improvements?

In this case this is part of the "Compile All" tests from the Python Interpreter Speed Tests spreadsheet. It should be noted that this is a split up PR out of that, with the follow up PR focused on generating compiled python bytecode using find, however first we need to generate base images with the findutils installed (so hence the split of the original PR). Going from no precompiled bytecode to compiled bytecode yields a 50% decrease in interpreter boot time. The flags in this PR force the interpreter to act a certain way that is conducive to not stomping on the precompiled bytecode (although they should in general have other benefits as well in our case, but that has yet to be studied in isolation).

Sadly I don't have any great ideas around preventing regressions on this front, since general noise can be louder than the signal, and the study above does this 100 times to establish the signal. We could theoretically run the same test in our own test suites however it would take on the order of hours, and running on peoples dev machines might introduce even more noise making it very non-deterministic. Let me know if you have any ideas on how to prevent regressions here.

technillogue · 2024-08-27T19:29:57Z

this won't help in prod as we override the entrypoint, the same change needs to be repeated elsewhere

8W9aG · 2024-08-27T20:28:23Z

this won't help in prod as we override the entrypoint, the same change needs to be repeated elsewhere

I'll make sure the change is repeated in the k8s, I do think they should be kept as similar as possible though to minimise drift between prod and dev.

* PYTHONDONTWRITEBYTECODE is the same as specifying -B * PYTHONOPTIMIZE is the same as specifying -OO

Signed-off-by: Will Sackfield <[email protected]>

* Mirroring the server side

Signed-off-by: Will Sackfield <[email protected]>

8W9aG · 2024-09-05T21:38:58Z

Removed the optimize flags in line with the cluster discussion.

tempusfrangit

I don't see anything that stands out as wrong with this but I'm going to tag in @mattt for a second set of eyes.

mattt

This approach seems well-reasoned. I also don't have a good answer to @nickstenning's question about how to track regressions other than to try rolling this out and looking for aggregate error rates and performance metrics.

mattt

Approving so that we can test this out internally before cutting a release. Following the same general approach described in #1858 (review).

8W9aG requested review from mattt, nevillelyh, andreasjansson and tempusfrangit August 26, 2024 18:18

8W9aG mentioned this pull request Aug 27, 2024

Add strip to cog builds #1902

Merged

8W9aG added 5 commits August 28, 2024 16:19

Add performance environment variables to preamble

f300e3d

* PYTHONDONTWRITEBYTECODE is the same as specifying -B * PYTHONOPTIMIZE is the same as specifying -OO

Merge branch 'main' into python-invocation-speedup

43e95eb

Signed-off-by: Will Sackfield <[email protected]>

Remove optimization flag and option

fcc7ee0

* Mirroring the server side

Merge branch 'main' into python-invocation-speedup

c1e9950

Signed-off-by: Will Sackfield <[email protected]>

Fix test to show correct python invocation

32f6452

tempusfrangit reviewed Sep 6, 2024

View reviewed changes

mattt reviewed Sep 9, 2024

View reviewed changes

mattt approved these changes Sep 9, 2024

View reviewed changes

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Change python invocation #1908

Change python invocation #1908

8W9aG commented Aug 26, 2024

nickstenning commented Aug 27, 2024

8W9aG commented Aug 27, 2024 •

edited

Loading

technillogue commented Aug 27, 2024

8W9aG commented Aug 27, 2024

8W9aG commented Sep 5, 2024

tempusfrangit left a comment

mattt left a comment

mattt left a comment

Change python invocation #1908

Are you sure you want to change the base?

Change python invocation #1908

Conversation

8W9aG commented Aug 26, 2024

nickstenning commented Aug 27, 2024

8W9aG commented Aug 27, 2024 • edited Loading

technillogue commented Aug 27, 2024

8W9aG commented Aug 27, 2024

8W9aG commented Sep 5, 2024

tempusfrangit left a comment

Choose a reason for hiding this comment

mattt left a comment

Choose a reason for hiding this comment

mattt left a comment

Choose a reason for hiding this comment

8W9aG commented Aug 27, 2024 •

edited

Loading