Feature: cross validate timings #233

alekseykalyagin · 2024-12-12T14:05:42Z

Description

Closes issue #138

Type of change

Bug fix (non-breaking change which fixes an issue)
New feature (non-breaking change which adds functionality)
Breaking change (fix or feature that would cause existing functionality to not work as expected)
Optimization

How Has This Been Tested?

Before submitting a PR, please check yourself against the following list. It would save us quite a lot of time.

Have you read the contribution guide?
Have you updated the relevant docstrings? We're using Numpy format, please double-check yourself
Does your change require any new tests?
Have you updated the changelog file?

codecov · 2024-12-12T14:25:06Z

Codecov Report

All modified and coverable lines are covered by tests ✅

Project coverage is 100.00%. Comparing base (9b3992e) to head (e474430).
Report is 86 commits behind head on main.

Additional details and impacted files

@@             Coverage Diff             @@
##              main      #233     +/-   ##
===========================================
  Coverage   100.00%   100.00%             
===========================================
  Files           45        59     +14     
  Lines         2242      3893   +1651     
===========================================
+ Hits          2242      3893   +1651

☔ View full report in Codecov by Sentry.
📢 Have feedback on the report? Share it here.

rectools/model_selection/cross_validate.py

blondered · 2024-12-18T12:59:49Z

tests/model_selection/test_cross_validate.py

+
+        if compute_timings:
+            for data in actual["metrics"]:
+                assert len(expected_keys.intersection(set(data.keys()))) == 2


We are checking only keys intersection because we have very differing timings for each run? If this is the case, let's just round the actual results to some reasonable number. And check values.
Or check that these values are less then some treshold.

Also we need to check that we didn't mess metrics results.
So we need to check all of the values like you do below for case when timings are not needed. (but as I wrote in another place, you don't need to check compute_timings=False scenario in this test)

done, chose threshold = 0.5

blondered · 2024-12-19T15:22:58Z

rectools/model_selection/cross_validate.py

+    if timings is not None:
+        start_time = time.time()
+        yield
+        timings[label] = round(time.time() - start_time, 2)


we needed to round not in the actual code but in the tests below. we are adding rounding just to pass the tests. so it shouldn't affect the actual code in framework.
let's round to 5 digits here

blondered · 2024-12-19T15:31:57Z

tests/model_selection/test_cross_validate.py

+                        "precision@2": 0.375,
+                        "recall@1": 0.5,
+                        "intersection_popular": 0.75,
+                        "fit_time": 0.0,


let's drop timings from expected dicts.
and keep thresholds comparison.
just pop timings from the actual dict

blondered · 2024-12-19T15:34:22Z

tests/model_selection/test_cross_validate.py

+        "ref_models,validate_ref_models,expected_metrics,compute_timings",
+        (
+            (
+                ["popular"],


let's keep only ["popular"] and not put ref_models in parametrize.
let's iterate over validate_ref_models and compute_timings . only 4 test cases are needed

alekseykalyagin and others added 2 commits December 12, 2024 16:55

Add compute_timing arg for cross_validate function

2b95c08

Merge branch 'MobileTeleSystems:main' into main

e474430

alekseykalyagin added 2 commits December 13, 2024 12:35

Update compute timing using contextmanager

b05fbe1

Remove unused variable

a64db58

blondered reviewed Dec 13, 2024

View reviewed changes

rectools/model_selection/cross_validate.py Outdated Show resolved Hide resolved

Remove fit_recommend function

85c1604

blondered reviewed Dec 18, 2024

View reviewed changes

fix comments

5afc0e4

blondered reviewed Dec 20, 2024

View reviewed changes

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Feature: cross validate timings #233

Feature: cross validate timings #233

alekseykalyagin commented Dec 12, 2024 •

edited

Loading

codecov bot commented Dec 12, 2024

blondered Dec 18, 2024

alekseykalyagin Dec 19, 2024

blondered Dec 19, 2024

blondered Dec 19, 2024

blondered Dec 19, 2024

Feature: cross validate timings #233

Are you sure you want to change the base?

Feature: cross validate timings #233

Conversation

alekseykalyagin commented Dec 12, 2024 • edited Loading

Description

Type of change

How Has This Been Tested?

codecov bot commented Dec 12, 2024

Codecov Report

blondered Dec 18, 2024

Choose a reason for hiding this comment

alekseykalyagin Dec 19, 2024

Choose a reason for hiding this comment

blondered Dec 19, 2024

Choose a reason for hiding this comment

blondered Dec 19, 2024

Choose a reason for hiding this comment

blondered Dec 19, 2024

Choose a reason for hiding this comment

alekseykalyagin commented Dec 12, 2024 •

edited

Loading