Skip to content
New issue

Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.

By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.

Already on GitHub? Sign in to your account

[BugFix,Doc] Fix BATCHED_PIPE_TIMEOUT refs and doc #2695

Open
wants to merge 1 commit into
base: gh/vmoens/67/base
Choose a base branch
from

Conversation

vmoens
Copy link
Contributor

@vmoens vmoens commented Jan 14, 2025

[ghstack-poisoned]
Copy link

pytorch-bot bot commented Jan 14, 2025

🔗 Helpful Links

🧪 See artifacts and rendered test results at hud.pytorch.org/pr/pytorch/rl/2695

Note: Links to docs will display an error until the docs builds have been completed.

❌ 2 New Failures, 8 Unrelated Failures

As of commit 8c92c9c with merge base 61e05b3 (image):

NEW FAILURES - The following jobs have failed:

BROKEN TRUNK - The following jobs failed but were present on the merge base:

👉 Rebase onto the `viable/strict` branch to avoid these failures

This comment was automatically generated by Dr. CI and updates every 15 minutes.

vmoens added a commit that referenced this pull request Jan 14, 2025
ghstack-source-id: 6e43c4ff1c319545cf0952abf6f35f3e7ed473e0
Pull Request resolved: #2695
@facebook-github-bot facebook-github-bot added the CLA Signed This label is managed by the Facebook bot. Authors need to sign the CLA before a PR can be reviewed. label Jan 14, 2025
Copy link

$\color{#D29922}\textsf{\Large⚠\kern{0.2cm}\normalsize Warning}$ Result of CPU Benchmark Tests

Total Benchmarks: 149. Improved: $\large\color{#35bf28}5$. Worsened: $\large\color{#d91a1a}7$.

Expand to view detailed results
Name Max Mean Ops Ops on Repo HEAD Change
test_simple 0.5267s 0.4441s 2.2515 Ops/s 2.2622 Ops/s $\color{#d91a1a}-0.47\%$
test_transformed 0.7127s 0.6269s 1.5952 Ops/s 1.6073 Ops/s $\color{#d91a1a}-0.75\%$
test_serial 1.4555s 1.3687s 0.7306 Ops/s 0.7219 Ops/s $\color{#35bf28}+1.20\%$
test_parallel 1.3014s 1.2043s 0.8304 Ops/s 0.8309 Ops/s $\color{#d91a1a}-0.07\%$
test_step_mdp_speed[True-True-True-True-True] 0.2263ms 30.9550μs 32.3049 KOps/s 32.1604 KOps/s $\color{#35bf28}+0.45\%$
test_step_mdp_speed[True-True-True-True-False] 55.4030μs 17.8390μs 56.0569 KOps/s 55.3263 KOps/s $\color{#35bf28}+1.32\%$
test_step_mdp_speed[True-True-True-False-True] 51.6160μs 17.0457μs 58.6658 KOps/s 57.3273 KOps/s $\color{#35bf28}+2.33\%$
test_step_mdp_speed[True-True-True-False-False] 41.5170μs 10.0741μs 99.2646 KOps/s 98.0213 KOps/s $\color{#35bf28}+1.27\%$
test_step_mdp_speed[True-True-False-True-True] 0.1250ms 34.1563μs 29.2772 KOps/s 30.6923 KOps/s $\color{#d91a1a}-4.61\%$
test_step_mdp_speed[True-True-False-True-False] 52.0460μs 19.7454μs 50.6448 KOps/s 49.9346 KOps/s $\color{#35bf28}+1.42\%$
test_step_mdp_speed[True-True-False-False-True] 48.7010μs 18.9352μs 52.8117 KOps/s 51.9531 KOps/s $\color{#35bf28}+1.65\%$
test_step_mdp_speed[True-True-False-False-False] 42.0680μs 11.9559μs 83.6408 KOps/s 82.5263 KOps/s $\color{#35bf28}+1.35\%$
test_step_mdp_speed[True-False-True-True-True] 65.8020μs 34.1772μs 29.2593 KOps/s 28.9374 KOps/s $\color{#35bf28}+1.11\%$
test_step_mdp_speed[True-False-True-True-False] 71.3710μs 21.6632μs 46.1612 KOps/s 45.5369 KOps/s $\color{#35bf28}+1.37\%$
test_step_mdp_speed[True-False-True-False-True] 69.8400μs 19.0521μs 52.4877 KOps/s 52.3631 KOps/s $\color{#35bf28}+0.24\%$
test_step_mdp_speed[True-False-True-False-False] 31.5390μs 11.9542μs 83.6528 KOps/s 82.6149 KOps/s $\color{#35bf28}+1.26\%$
test_step_mdp_speed[True-False-False-True-True] 90.2680μs 35.7867μs 27.9434 KOps/s 27.5120 KOps/s $\color{#35bf28}+1.57\%$
test_step_mdp_speed[True-False-False-True-False] 91.5000μs 23.3744μs 42.7818 KOps/s 42.3639 KOps/s $\color{#35bf28}+0.99\%$
test_step_mdp_speed[True-False-False-False-True] 71.7840μs 20.6034μs 48.5356 KOps/s 47.6056 KOps/s $\color{#35bf28}+1.95\%$
test_step_mdp_speed[True-False-False-False-False] 37.9310μs 13.7540μs 72.7061 KOps/s 72.2617 KOps/s $\color{#35bf28}+0.61\%$
test_step_mdp_speed[False-True-True-True-True] 95.4000μs 34.2286μs 29.2154 KOps/s 28.9303 KOps/s $\color{#35bf28}+0.99\%$
test_step_mdp_speed[False-True-True-True-False] 74.9280μs 21.8411μs 45.7852 KOps/s 45.1067 KOps/s $\color{#35bf28}+1.50\%$
test_step_mdp_speed[False-True-True-False-True] 45.1350μs 21.8581μs 45.7496 KOps/s 46.2014 KOps/s $\color{#d91a1a}-0.98\%$
test_step_mdp_speed[False-True-True-False-False] 65.2210μs 13.4568μs 74.3118 KOps/s 74.6081 KOps/s $\color{#d91a1a}-0.40\%$
test_step_mdp_speed[False-True-False-True-True] 71.7130μs 35.7849μs 27.9447 KOps/s 27.8683 KOps/s $\color{#35bf28}+0.27\%$
test_step_mdp_speed[False-True-False-True-False] 77.1040μs 23.5132μs 42.5294 KOps/s 42.2846 KOps/s $\color{#35bf28}+0.58\%$
test_step_mdp_speed[False-True-False-False-True] 2.5911ms 23.3741μs 42.7823 KOps/s 41.3063 KOps/s $\color{#35bf28}+3.57\%$
test_step_mdp_speed[False-True-False-False-False] 63.7890μs 15.1081μs 66.1898 KOps/s 65.6396 KOps/s $\color{#35bf28}+0.84\%$
test_step_mdp_speed[False-False-True-True-True] 0.1310ms 37.9668μs 26.3388 KOps/s 26.2352 KOps/s $\color{#35bf28}+0.39\%$
test_step_mdp_speed[False-False-True-True-False] 79.3800μs 25.3674μs 39.4206 KOps/s 39.6279 KOps/s $\color{#d91a1a}-0.52\%$
test_step_mdp_speed[False-False-True-False-True] 56.3450μs 23.9803μs 41.7009 KOps/s 42.2594 KOps/s $\color{#d91a1a}-1.32\%$
test_step_mdp_speed[False-False-True-False-False] 46.2770μs 15.2714μs 65.4820 KOps/s 66.0480 KOps/s $\color{#d91a1a}-0.86\%$
test_step_mdp_speed[False-False-False-True-True] 0.1064ms 39.4217μs 25.3667 KOps/s 25.0199 KOps/s $\color{#35bf28}+1.39\%$
test_step_mdp_speed[False-False-False-True-False] 52.2770μs 26.8549μs 37.2371 KOps/s 37.1186 KOps/s $\color{#35bf28}+0.32\%$
test_step_mdp_speed[False-False-False-False-True] 75.1500μs 24.9549μs 40.0722 KOps/s 39.1630 KOps/s $\color{#35bf28}+2.32\%$
test_step_mdp_speed[False-False-False-False-False] 47.9490μs 16.9615μs 58.9572 KOps/s 58.9939 KOps/s $\color{#d91a1a}-0.06\%$
test_values[generalized_advantage_estimate-True-True] 12.0766ms 9.9634ms 100.3678 Ops/s 103.4808 Ops/s $\color{#d91a1a}-3.01\%$
test_values[vec_generalized_advantage_estimate-True-True] 35.4978ms 33.3018ms 30.0284 Ops/s 29.9167 Ops/s $\color{#35bf28}+0.37\%$
test_values[td0_return_estimate-False-False] 0.2462ms 0.1781ms 5.6142 KOps/s 5.7288 KOps/s $\color{#d91a1a}-2.00\%$
test_values[td1_return_estimate-False-False] 26.4733ms 24.7290ms 40.4384 Ops/s 41.9881 Ops/s $\color{#d91a1a}-3.69\%$
test_values[vec_td1_return_estimate-False-False] 35.5771ms 33.3823ms 29.9560 Ops/s 29.8421 Ops/s $\color{#35bf28}+0.38\%$
test_values[td_lambda_return_estimate-True-False] 37.7727ms 35.8928ms 27.8607 Ops/s 29.4245 Ops/s $\textbf{\color{#d91a1a}-5.31\%}$
test_values[vec_td_lambda_return_estimate-True-False] 35.0203ms 33.4721ms 29.8756 Ops/s 29.9276 Ops/s $\color{#d91a1a}-0.17\%$
test_gae_speed[generalized_advantage_estimate-False-1-512] 11.7120ms 8.7126ms 114.7760 Ops/s 119.5051 Ops/s $\color{#d91a1a}-3.96\%$
test_gae_speed[vec_generalized_advantage_estimate-True-1-512] 2.3054ms 1.8810ms 531.6415 Ops/s 515.7706 Ops/s $\color{#35bf28}+3.08\%$
test_gae_speed[vec_generalized_advantage_estimate-False-1-512] 0.5659ms 0.3620ms 2.7623 KOps/s 2.8034 KOps/s $\color{#d91a1a}-1.47\%$
test_gae_speed[vec_generalized_advantage_estimate-True-32-512] 43.9912ms 41.9501ms 23.8378 Ops/s 23.5373 Ops/s $\color{#35bf28}+1.28\%$
test_gae_speed[vec_generalized_advantage_estimate-False-32-512] 3.8331ms 3.0346ms 329.5289 Ops/s 329.9876 Ops/s $\color{#d91a1a}-0.14\%$
test_dqn_speed[False-None] 8.4442ms 1.4287ms 699.9454 Ops/s 716.1298 Ops/s $\color{#d91a1a}-2.26\%$
test_dqn_speed[False-backward] 3.1620ms 2.2141ms 451.6516 Ops/s 530.1276 Ops/s $\textbf{\color{#d91a1a}-14.80\%}$
test_dqn_speed[True-None] 0.1651s 0.5721ms 1.7481 KOps/s 2.1027 KOps/s $\textbf{\color{#d91a1a}-16.86\%}$
test_dqn_speed[True-backward] 0.9442ms 0.9001ms 1.1109 KOps/s 944.2108 Ops/s $\textbf{\color{#35bf28}+17.66\%}$
test_dqn_speed[reduce-overhead-None] 0.8151ms 0.4894ms 2.0435 KOps/s 2.0980 KOps/s $\color{#d91a1a}-2.60\%$
test_dqn_speed[reduce-overhead-backward] 0.9913ms 0.9098ms 1.0991 KOps/s 1.1177 KOps/s $\color{#d91a1a}-1.66\%$
test_ddpg_speed[False-None] 3.7480ms 2.9587ms 337.9845 Ops/s 348.8972 Ops/s $\color{#d91a1a}-3.13\%$
test_ddpg_speed[False-backward] 5.1916ms 4.1606ms 240.3519 Ops/s 245.9597 Ops/s $\color{#d91a1a}-2.28\%$
test_ddpg_speed[True-None] 1.2041ms 1.0378ms 963.5496 Ops/s 976.1319 Ops/s $\color{#d91a1a}-1.29\%$
test_ddpg_speed[True-backward] 2.0051ms 1.9402ms 515.4220 Ops/s 512.6499 Ops/s $\color{#35bf28}+0.54\%$
test_ddpg_speed[reduce-overhead-None] 1.5827ms 1.0522ms 950.4325 Ops/s 962.1331 Ops/s $\color{#d91a1a}-1.22\%$
test_ddpg_speed[reduce-overhead-backward] 2.0732ms 1.9578ms 510.7688 Ops/s 515.1053 Ops/s $\color{#d91a1a}-0.84\%$
test_sac_speed[False-None] 10.4848ms 8.2920ms 120.5979 Ops/s 123.5523 Ops/s $\color{#d91a1a}-2.39\%$
test_sac_speed[False-backward] 15.4698ms 11.2288ms 89.0569 Ops/s 92.4254 Ops/s $\color{#d91a1a}-3.64\%$
test_sac_speed[True-None] 2.3775ms 1.8728ms 533.9674 Ops/s 542.2521 Ops/s $\color{#d91a1a}-1.53\%$
test_sac_speed[True-backward] 3.9595ms 3.6821ms 271.5859 Ops/s 278.5352 Ops/s $\color{#d91a1a}-2.49\%$
test_sac_speed[reduce-overhead-None] 2.1769ms 1.8759ms 533.0833 Ops/s 541.0188 Ops/s $\color{#d91a1a}-1.47\%$
test_sac_speed[reduce-overhead-backward] 3.8813ms 3.6476ms 274.1556 Ops/s 285.2453 Ops/s $\color{#d91a1a}-3.89\%$
test_redq_speed[False-None] 15.4693ms 13.2433ms 75.5098 Ops/s 78.0074 Ops/s $\color{#d91a1a}-3.20\%$
test_redq_speed[False-backward] 23.9873ms 22.6100ms 44.2282 Ops/s 44.6993 Ops/s $\color{#d91a1a}-1.05\%$
test_redq_speed[True-None] 6.3285ms 4.9263ms 202.9904 Ops/s 197.6163 Ops/s $\color{#35bf28}+2.72\%$
test_redq_speed[True-backward] 13.0396ms 12.5882ms 79.4392 Ops/s 82.7181 Ops/s $\color{#d91a1a}-3.96\%$
test_redq_speed[reduce-overhead-None] 5.6468ms 4.9345ms 202.6555 Ops/s 210.3831 Ops/s $\color{#d91a1a}-3.67\%$
test_redq_speed[reduce-overhead-backward] 13.7004ms 12.6466ms 79.0727 Ops/s 82.7528 Ops/s $\color{#d91a1a}-4.45\%$
test_redq_deprec_speed[False-None] 15.3377ms 13.4878ms 74.1411 Ops/s 76.5752 Ops/s $\color{#d91a1a}-3.18\%$
test_redq_deprec_speed[False-backward] 21.0157ms 19.2827ms 51.8600 Ops/s 53.3697 Ops/s $\color{#d91a1a}-2.83\%$
test_redq_deprec_speed[True-None] 4.4994ms 3.7196ms 268.8492 Ops/s 276.5603 Ops/s $\color{#d91a1a}-2.79\%$
test_redq_deprec_speed[True-backward] 10.1869ms 8.7311ms 114.5331 Ops/s 115.9759 Ops/s $\color{#d91a1a}-1.24\%$
test_redq_deprec_speed[reduce-overhead-None] 4.8948ms 3.7809ms 264.4908 Ops/s 273.4926 Ops/s $\color{#d91a1a}-3.29\%$
test_redq_deprec_speed[reduce-overhead-backward] 9.5245ms 8.6019ms 116.2533 Ops/s 121.3143 Ops/s $\color{#d91a1a}-4.17\%$
test_td3_speed[False-None] 8.6761ms 8.2389ms 121.3749 Ops/s 118.8277 Ops/s $\color{#35bf28}+2.14\%$
test_td3_speed[False-backward] 11.8565ms 10.8518ms 92.1508 Ops/s 93.2161 Ops/s $\color{#d91a1a}-1.14\%$
test_td3_speed[True-None] 1.9336ms 1.7751ms 563.3604 Ops/s 571.8389 Ops/s $\color{#d91a1a}-1.48\%$
test_td3_speed[True-backward] 3.6329ms 3.4236ms 292.0890 Ops/s 300.9664 Ops/s $\color{#d91a1a}-2.95\%$
test_td3_speed[reduce-overhead-None] 1.9222ms 1.7567ms 569.2571 Ops/s 574.8937 Ops/s $\color{#d91a1a}-0.98\%$
test_td3_speed[reduce-overhead-backward] 3.7761ms 3.4152ms 292.8108 Ops/s 300.5930 Ops/s $\color{#d91a1a}-2.59\%$
test_cql_speed[False-None] 40.5257ms 37.1951ms 26.8853 Ops/s 27.1089 Ops/s $\color{#d91a1a}-0.82\%$
test_cql_speed[False-backward] 51.9016ms 47.5527ms 21.0293 Ops/s 21.2059 Ops/s $\color{#d91a1a}-0.83\%$
test_cql_speed[True-None] 17.2071ms 16.0170ms 62.4338 Ops/s 62.7333 Ops/s $\color{#d91a1a}-0.48\%$
test_cql_speed[True-backward] 25.0540ms 23.0140ms 43.4518 Ops/s 44.8610 Ops/s $\color{#d91a1a}-3.14\%$
test_cql_speed[reduce-overhead-None] 16.9282ms 15.9517ms 62.6892 Ops/s 62.8735 Ops/s $\color{#d91a1a}-0.29\%$
test_cql_speed[reduce-overhead-backward] 24.1213ms 23.0498ms 43.3844 Ops/s 44.1252 Ops/s $\color{#d91a1a}-1.68\%$
test_a2c_speed[False-None] 8.8081ms 7.4260ms 134.6625 Ops/s 137.3126 Ops/s $\color{#d91a1a}-1.93\%$
test_a2c_speed[False-backward] 15.6610ms 14.6384ms 68.3134 Ops/s 69.8350 Ops/s $\color{#d91a1a}-2.18\%$
test_a2c_speed[True-None] 5.2566ms 4.2649ms 234.4694 Ops/s 235.1568 Ops/s $\color{#d91a1a}-0.29\%$
test_a2c_speed[True-backward] 11.9177ms 10.8047ms 92.5525 Ops/s 92.8727 Ops/s $\color{#d91a1a}-0.34\%$
test_a2c_speed[reduce-overhead-None] 4.7004ms 4.2347ms 236.1432 Ops/s 236.3788 Ops/s $\color{#d91a1a}-0.10\%$
test_a2c_speed[reduce-overhead-backward] 11.9343ms 10.8636ms 92.0509 Ops/s 91.9329 Ops/s $\color{#35bf28}+0.13\%$
test_ppo_speed[False-None] 8.6476ms 7.5474ms 132.4959 Ops/s 132.2406 Ops/s $\color{#35bf28}+0.19\%$
test_ppo_speed[False-backward] 15.5371ms 15.0159ms 66.5960 Ops/s 68.0300 Ops/s $\color{#d91a1a}-2.11\%$
test_ppo_speed[True-None] 4.0847ms 3.7109ms 269.4738 Ops/s 258.2551 Ops/s $\color{#35bf28}+4.34\%$
test_ppo_speed[True-backward] 10.7085ms 9.6726ms 103.3847 Ops/s 103.5285 Ops/s $\color{#d91a1a}-0.14\%$
test_ppo_speed[reduce-overhead-None] 4.7591ms 3.7155ms 269.1403 Ops/s 268.2246 Ops/s $\color{#35bf28}+0.34\%$
test_ppo_speed[reduce-overhead-backward] 10.1746ms 9.6474ms 103.6548 Ops/s 102.5569 Ops/s $\color{#35bf28}+1.07\%$
test_reinforce_speed[False-None] 7.7535ms 6.5799ms 151.9771 Ops/s 150.5927 Ops/s $\color{#35bf28}+0.92\%$
test_reinforce_speed[False-backward] 10.9065ms 9.8546ms 101.4758 Ops/s 98.7806 Ops/s $\color{#35bf28}+2.73\%$
test_reinforce_speed[True-None] 3.5246ms 2.6865ms 372.2310 Ops/s 371.1484 Ops/s $\color{#35bf28}+0.29\%$
test_reinforce_speed[True-backward] 9.0226ms 8.6151ms 116.0757 Ops/s 112.7865 Ops/s $\color{#35bf28}+2.92\%$
test_reinforce_speed[reduce-overhead-None] 3.3286ms 2.6635ms 375.4499 Ops/s 372.8133 Ops/s $\color{#35bf28}+0.71\%$
test_reinforce_speed[reduce-overhead-backward] 9.7305ms 8.5986ms 116.2983 Ops/s 116.6439 Ops/s $\color{#d91a1a}-0.30\%$
test_iql_speed[False-None] 39.8317ms 33.3868ms 29.9520 Ops/s 30.3449 Ops/s $\color{#d91a1a}-1.29\%$
test_iql_speed[False-backward] 50.6639ms 45.8308ms 21.8194 Ops/s 15.6369 Ops/s $\textbf{\color{#35bf28}+39.54\%}$
test_iql_speed[True-None] 11.8571ms 10.7583ms 92.9516 Ops/s 92.3157 Ops/s $\color{#35bf28}+0.69\%$
test_iql_speed[True-backward] 23.0870ms 21.6500ms 46.1893 Ops/s 45.2060 Ops/s $\color{#35bf28}+2.18\%$
test_iql_speed[reduce-overhead-None] 12.1842ms 10.7169ms 93.3105 Ops/s 91.4356 Ops/s $\color{#35bf28}+2.05\%$
test_iql_speed[reduce-overhead-backward] 23.2641ms 21.8265ms 45.8158 Ops/s 45.2733 Ops/s $\color{#35bf28}+1.20\%$
test_rb_sample[TensorDictReplayBuffer-ListStorage-RandomSampler-4000] 0.3153s 6.6030ms 151.4469 Ops/s 200.5582 Ops/s $\textbf{\color{#d91a1a}-24.49\%}$
test_rb_sample[TensorDictReplayBuffer-LazyMemmapStorage-RandomSampler-10000] 0.8686ms 0.5249ms 1.9052 KOps/s 1.9100 KOps/s $\color{#d91a1a}-0.25\%$
test_rb_sample[TensorDictReplayBuffer-LazyTensorStorage-RandomSampler-10000] 0.8300ms 0.4962ms 2.0152 KOps/s 2.0025 KOps/s $\color{#35bf28}+0.64\%$
test_rb_sample[TensorDictReplayBuffer-ListStorage-SamplerWithoutReplacement-4000] 5.8383ms 4.8447ms 206.4123 Ops/s 208.1659 Ops/s $\color{#d91a1a}-0.84\%$
test_rb_sample[TensorDictReplayBuffer-LazyMemmapStorage-SamplerWithoutReplacement-10000] 2.9425ms 0.5133ms 1.9480 KOps/s 1.9438 KOps/s $\color{#35bf28}+0.22\%$
test_rb_sample[TensorDictReplayBuffer-LazyTensorStorage-SamplerWithoutReplacement-10000] 0.7755ms 0.4920ms 2.0327 KOps/s 1.9845 KOps/s $\color{#35bf28}+2.43\%$
test_rb_sample[TensorDictReplayBuffer-LazyMemmapStorage-sampler6-10000] 2.2969ms 1.6596ms 602.5493 Ops/s 600.4268 Ops/s $\color{#35bf28}+0.35\%$
test_rb_sample[TensorDictReplayBuffer-LazyTensorStorage-sampler7-10000] 2.3287ms 1.5653ms 638.8718 Ops/s 633.4724 Ops/s $\color{#35bf28}+0.85\%$
test_rb_sample[TensorDictPrioritizedReplayBuffer-ListStorage-None-4000] 9.2461ms 5.0459ms 198.1810 Ops/s 201.4850 Ops/s $\color{#d91a1a}-1.64\%$
test_rb_sample[TensorDictPrioritizedReplayBuffer-LazyMemmapStorage-None-10000] 1.1012ms 0.6499ms 1.5388 KOps/s 1.5320 KOps/s $\color{#35bf28}+0.44\%$
test_rb_sample[TensorDictPrioritizedReplayBuffer-LazyTensorStorage-None-10000] 1.1065ms 0.6284ms 1.5913 KOps/s 1.5406 KOps/s $\color{#35bf28}+3.29\%$
test_rb_iterate[TensorDictReplayBuffer-ListStorage-RandomSampler-4000] 7.5870ms 4.8394ms 206.6358 Ops/s 210.5492 Ops/s $\color{#d91a1a}-1.86\%$
test_rb_iterate[TensorDictReplayBuffer-LazyMemmapStorage-RandomSampler-10000] 1.6577ms 0.5209ms 1.9197 KOps/s 1.9126 KOps/s $\color{#35bf28}+0.37\%$
test_rb_iterate[TensorDictReplayBuffer-LazyTensorStorage-RandomSampler-10000] 0.8159ms 0.5014ms 1.9943 KOps/s 2.0069 KOps/s $\color{#d91a1a}-0.63\%$
test_rb_iterate[TensorDictReplayBuffer-ListStorage-SamplerWithoutReplacement-4000] 5.0114ms 4.6920ms 213.1300 Ops/s 214.0778 Ops/s $\color{#d91a1a}-0.44\%$
test_rb_iterate[TensorDictReplayBuffer-LazyMemmapStorage-SamplerWithoutReplacement-10000] 0.9693ms 0.5028ms 1.9889 KOps/s 528.4678 Ops/s $\textbf{\color{#35bf28}+276.35\%}$
test_rb_iterate[TensorDictReplayBuffer-LazyTensorStorage-SamplerWithoutReplacement-10000] 0.9997ms 0.5057ms 1.9773 KOps/s 2.0636 KOps/s $\color{#d91a1a}-4.18\%$
test_rb_iterate[TensorDictPrioritizedReplayBuffer-ListStorage-None-4000] 5.2865ms 4.8176ms 207.5743 Ops/s 195.7226 Ops/s $\textbf{\color{#35bf28}+6.06\%}$
test_rb_iterate[TensorDictPrioritizedReplayBuffer-LazyMemmapStorage-None-10000] 0.9847ms 0.6467ms 1.5462 KOps/s 1.5120 KOps/s $\color{#35bf28}+2.26\%$
test_rb_iterate[TensorDictPrioritizedReplayBuffer-LazyTensorStorage-None-10000] 0.9957ms 0.6288ms 1.5903 KOps/s 1.5891 KOps/s $\color{#35bf28}+0.08\%$
test_rb_populate[TensorDictReplayBuffer-ListStorage-RandomSampler-400] 5.7480ms 4.4042ms 227.0579 Ops/s 229.8751 Ops/s $\color{#d91a1a}-1.23\%$
test_rb_populate[TensorDictReplayBuffer-LazyMemmapStorage-RandomSampler-400] 8.2923ms 2.4212ms 413.0255 Ops/s 426.3130 Ops/s $\color{#d91a1a}-3.12\%$
test_rb_populate[TensorDictReplayBuffer-LazyTensorStorage-RandomSampler-400] 2.2595ms 1.3483ms 741.6854 Ops/s 720.4376 Ops/s $\color{#35bf28}+2.95\%$
test_rb_populate[TensorDictReplayBuffer-ListStorage-SamplerWithoutReplacement-400] 7.9108ms 4.8550ms 205.9749 Ops/s 225.4862 Ops/s $\textbf{\color{#d91a1a}-8.65\%}$
test_rb_populate[TensorDictReplayBuffer-LazyMemmapStorage-SamplerWithoutReplacement-400] 5.3977ms 2.3153ms 431.9134 Ops/s 438.4227 Ops/s $\color{#d91a1a}-1.48\%$
test_rb_populate[TensorDictReplayBuffer-LazyTensorStorage-SamplerWithoutReplacement-400] 6.9175ms 1.4201ms 704.2000 Ops/s 871.5296 Ops/s $\textbf{\color{#d91a1a}-19.20\%}$
test_rb_populate[TensorDictPrioritizedReplayBuffer-ListStorage-None-400] 0.4183s 12.8614ms 77.7523 Ops/s 237.1605 Ops/s $\textbf{\color{#d91a1a}-67.22\%}$
test_rb_populate[TensorDictPrioritizedReplayBuffer-LazyMemmapStorage-None-400] 7.7603ms 2.5232ms 396.3208 Ops/s 408.8542 Ops/s $\color{#d91a1a}-3.07\%$
test_rb_populate[TensorDictPrioritizedReplayBuffer-LazyTensorStorage-None-400] 5.3237ms 1.4530ms 688.2172 Ops/s 648.3171 Ops/s $\textbf{\color{#35bf28}+6.15\%}$
test_rb_extend_sample[ReplayBuffer-LazyTensorStorage-RandomSampler-10000-10000-100-True] 14.4482ms 13.1516ms 76.0361 Ops/s 72.9674 Ops/s $\color{#35bf28}+4.21\%$
test_rb_extend_sample[ReplayBuffer-LazyTensorStorage-RandomSampler-10000-10000-100-False] 16.1519ms 14.8753ms 67.2257 Ops/s 67.5079 Ops/s $\color{#d91a1a}-0.42\%$
test_rb_extend_sample[ReplayBuffer-LazyTensorStorage-RandomSampler-100000-10000-100-True] 23.4501ms 21.6699ms 46.1469 Ops/s 44.4798 Ops/s $\color{#35bf28}+3.75\%$
test_rb_extend_sample[ReplayBuffer-LazyTensorStorage-RandomSampler-100000-10000-100-False] 16.3918ms 14.9937ms 66.6945 Ops/s 67.0132 Ops/s $\color{#d91a1a}-0.48\%$
test_rb_extend_sample[ReplayBuffer-LazyTensorStorage-RandomSampler-1000000-10000-100-True] 24.2125ms 21.7627ms 45.9502 Ops/s 44.9565 Ops/s $\color{#35bf28}+2.21\%$
test_rb_extend_sample[ReplayBuffer-LazyTensorStorage-RandomSampler-1000000-10000-100-False] 17.6204ms 16.3011ms 61.3455 Ops/s 62.0377 Ops/s $\color{#d91a1a}-1.12\%$

Copy link

$\color{#D29922}\textsf{\Large⚠\kern{0.2cm}\normalsize Warning}$ Result of GPU Benchmark Tests

Total Benchmarks: 149. Improved: $\large\color{#35bf28}27$. Worsened: $\large\color{#d91a1a}14$.

Expand to view detailed results
Name Max Mean Ops Ops on Repo HEAD Change
test_simple 0.8320s 0.7456s 1.3413 Ops/s 1.3370 Ops/s $\color{#35bf28}+0.32\%$
test_transformed 0.9791s 0.9747s 1.0259 Ops/s 0.9993 Ops/s $\color{#35bf28}+2.67\%$
test_serial 2.1511s 2.1460s 0.4660 Ops/s 0.4581 Ops/s $\color{#35bf28}+1.72\%$
test_parallel 1.8199s 1.8005s 0.5554 Ops/s 0.5448 Ops/s $\color{#35bf28}+1.94\%$
test_step_mdp_speed[True-True-True-True-True] 0.2419ms 40.9933μs 24.3942 KOps/s 24.8288 KOps/s $\color{#d91a1a}-1.75\%$
test_step_mdp_speed[True-True-True-True-False] 99.5820μs 24.0181μs 41.6352 KOps/s 42.5714 KOps/s $\color{#d91a1a}-2.20\%$
test_step_mdp_speed[True-True-True-False-True] 52.4610μs 22.7482μs 43.9595 KOps/s 44.9258 KOps/s $\color{#d91a1a}-2.15\%$
test_step_mdp_speed[True-True-True-False-False] 46.5400μs 13.1734μs 75.9103 KOps/s 77.1333 KOps/s $\color{#d91a1a}-1.59\%$
test_step_mdp_speed[True-True-False-True-True] 76.5620μs 43.8342μs 22.8132 KOps/s 23.4326 KOps/s $\color{#d91a1a}-2.64\%$
test_step_mdp_speed[True-True-False-True-False] 77.7720μs 25.7518μs 38.8323 KOps/s 39.4155 KOps/s $\color{#d91a1a}-1.48\%$
test_step_mdp_speed[True-True-False-False-True] 54.3410μs 25.7272μs 38.8693 KOps/s 40.7485 KOps/s $\color{#d91a1a}-4.61\%$
test_step_mdp_speed[True-True-False-False-False] 45.5900μs 15.4596μs 64.6847 KOps/s 65.5162 KOps/s $\color{#d91a1a}-1.27\%$
test_step_mdp_speed[True-False-True-True-True] 83.3620μs 46.9262μs 21.3100 KOps/s 22.2590 KOps/s $\color{#d91a1a}-4.26\%$
test_step_mdp_speed[True-False-True-True-False] 53.7110μs 28.5638μs 35.0093 KOps/s 35.9136 KOps/s $\color{#d91a1a}-2.52\%$
test_step_mdp_speed[True-False-True-False-True] 56.8010μs 25.7074μs 38.8992 KOps/s 40.9530 KOps/s $\textbf{\color{#d91a1a}-5.02\%}$
test_step_mdp_speed[True-False-True-False-False] 42.9700μs 15.5090μs 64.4789 KOps/s 66.2617 KOps/s $\color{#d91a1a}-2.69\%$
test_step_mdp_speed[True-False-False-True-True] 85.5110μs 49.1428μs 20.3488 KOps/s 21.1621 KOps/s $\color{#d91a1a}-3.84\%$
test_step_mdp_speed[True-False-False-True-False] 73.5110μs 30.8763μs 32.3873 KOps/s 33.1545 KOps/s $\color{#d91a1a}-2.31\%$
test_step_mdp_speed[True-False-False-False-True] 59.4110μs 27.9011μs 35.8409 KOps/s 37.4580 KOps/s $\color{#d91a1a}-4.32\%$
test_step_mdp_speed[True-False-False-False-False] 44.4210μs 17.9430μs 55.7322 KOps/s 57.2958 KOps/s $\color{#d91a1a}-2.73\%$
test_step_mdp_speed[False-True-True-True-True] 78.8810μs 46.7663μs 21.3829 KOps/s 22.2228 KOps/s $\color{#d91a1a}-3.78\%$
test_step_mdp_speed[False-True-True-True-False] 56.2910μs 28.7541μs 34.7776 KOps/s 35.6429 KOps/s $\color{#d91a1a}-2.43\%$
test_step_mdp_speed[False-True-True-False-True] 54.9610μs 29.0888μs 34.3775 KOps/s 35.8143 KOps/s $\color{#d91a1a}-4.01\%$
test_step_mdp_speed[False-True-True-False-False] 43.0910μs 17.3985μs 57.4762 KOps/s 58.5599 KOps/s $\color{#d91a1a}-1.85\%$
test_step_mdp_speed[False-True-False-True-True] 82.5410μs 48.3906μs 20.6652 KOps/s 20.8838 KOps/s $\color{#d91a1a}-1.05\%$
test_step_mdp_speed[False-True-False-True-False] 67.0610μs 31.0178μs 32.2395 KOps/s 32.7048 KOps/s $\color{#d91a1a}-1.42\%$
test_step_mdp_speed[False-True-False-False-True] 3.1350ms 32.0637μs 31.1880 KOps/s 31.9541 KOps/s $\color{#d91a1a}-2.40\%$
test_step_mdp_speed[False-True-False-False-False] 45.8110μs 19.4474μs 51.4208 KOps/s 51.0143 KOps/s $\color{#35bf28}+0.80\%$
test_step_mdp_speed[False-False-True-True-True] 80.9810μs 52.0502μs 19.2122 KOps/s 19.9412 KOps/s $\color{#d91a1a}-3.66\%$
test_step_mdp_speed[False-False-True-True-False] 61.4810μs 33.4553μs 29.8906 KOps/s 30.3377 KOps/s $\color{#d91a1a}-1.47\%$
test_step_mdp_speed[False-False-True-False-True] 68.6810μs 31.6935μs 31.5522 KOps/s 32.5267 KOps/s $\color{#d91a1a}-3.00\%$
test_step_mdp_speed[False-False-True-False-False] 68.5320μs 19.6384μs 50.9206 KOps/s 51.2097 KOps/s $\color{#d91a1a}-0.56\%$
test_step_mdp_speed[False-False-False-True-True] 89.1020μs 52.8540μs 18.9201 KOps/s 19.3645 KOps/s $\color{#d91a1a}-2.30\%$
test_step_mdp_speed[False-False-False-True-False] 61.6810μs 35.5575μs 28.1234 KOps/s 28.4318 KOps/s $\color{#d91a1a}-1.08\%$
test_step_mdp_speed[False-False-False-False-True] 60.4720μs 33.5927μs 29.7684 KOps/s 30.5344 KOps/s $\color{#d91a1a}-2.51\%$
test_step_mdp_speed[False-False-False-False-False] 48.7410μs 21.8221μs 45.8251 KOps/s 46.6902 KOps/s $\color{#d91a1a}-1.85\%$
test_values[generalized_advantage_estimate-True-True] 25.5677ms 25.0846ms 39.8652 Ops/s 39.5139 Ops/s $\color{#35bf28}+0.89\%$
test_values[vec_generalized_advantage_estimate-True-True] 98.1070ms 2.8657ms 348.9528 Ops/s 351.5237 Ops/s $\color{#d91a1a}-0.73\%$
test_values[td0_return_estimate-False-False] 0.1038ms 81.5863μs 12.2570 KOps/s 12.1989 KOps/s $\color{#35bf28}+0.48\%$
test_values[td1_return_estimate-False-False] 58.3357ms 56.4742ms 17.7072 Ops/s 17.4942 Ops/s $\color{#35bf28}+1.22\%$
test_values[vec_td1_return_estimate-False-False] 1.3405ms 1.0916ms 916.0828 Ops/s 907.1238 Ops/s $\color{#35bf28}+0.99\%$
test_values[td_lambda_return_estimate-True-False] 92.9034ms 90.6039ms 11.0370 Ops/s 11.0402 Ops/s $\color{#d91a1a}-0.03\%$
test_values[vec_td_lambda_return_estimate-True-False] 1.4142ms 1.0899ms 917.5434 Ops/s 895.0181 Ops/s $\color{#35bf28}+2.52\%$
test_gae_speed[generalized_advantage_estimate-False-1-512] 25.4883ms 25.1112ms 39.8228 Ops/s 37.2483 Ops/s $\textbf{\color{#35bf28}+6.91\%}$
test_gae_speed[vec_generalized_advantage_estimate-True-1-512] 1.0402ms 0.7615ms 1.3132 KOps/s 1.2901 KOps/s $\color{#35bf28}+1.79\%$
test_gae_speed[vec_generalized_advantage_estimate-False-1-512] 0.7850ms 0.6811ms 1.4683 KOps/s 1.4402 KOps/s $\color{#35bf28}+1.95\%$
test_gae_speed[vec_generalized_advantage_estimate-True-32-512] 1.5493ms 1.4873ms 672.3375 Ops/s 664.8353 Ops/s $\color{#35bf28}+1.13\%$
test_gae_speed[vec_generalized_advantage_estimate-False-32-512] 0.7954ms 0.6983ms 1.4322 KOps/s 1.4184 KOps/s $\color{#35bf28}+0.97\%$
test_dqn_speed[False-None] 6.9153ms 1.5381ms 650.1645 Ops/s 645.7707 Ops/s $\color{#35bf28}+0.68\%$
test_dqn_speed[False-backward] 2.2453ms 2.1493ms 465.2709 Ops/s 457.8510 Ops/s $\color{#35bf28}+1.62\%$
test_dqn_speed[True-None] 0.6759ms 0.5480ms 1.8249 KOps/s 1.7457 KOps/s $\color{#35bf28}+4.54\%$
test_dqn_speed[True-backward] 1.2729ms 1.2199ms 819.7130 Ops/s 882.5789 Ops/s $\textbf{\color{#d91a1a}-7.12\%}$
test_dqn_speed[reduce-overhead-None] 0.6164ms 0.5666ms 1.7650 KOps/s 1.7463 KOps/s $\color{#35bf28}+1.07\%$
test_dqn_speed[reduce-overhead-backward] 1.1145ms 1.0759ms 929.4626 Ops/s 1.0251 KOps/s $\textbf{\color{#d91a1a}-9.33\%}$
test_ddpg_speed[False-None] 3.2130ms 2.8808ms 347.1250 Ops/s 344.7979 Ops/s $\color{#35bf28}+0.67\%$
test_ddpg_speed[False-backward] 4.7330ms 4.2926ms 232.9585 Ops/s 238.7605 Ops/s $\color{#d91a1a}-2.43\%$
test_ddpg_speed[True-None] 1.1739ms 1.0805ms 925.5267 Ops/s 917.0449 Ops/s $\color{#35bf28}+0.92\%$
test_ddpg_speed[True-backward] 2.3707ms 2.2991ms 434.9600 Ops/s 460.4986 Ops/s $\textbf{\color{#d91a1a}-5.55\%}$
test_ddpg_speed[reduce-overhead-None] 1.2620ms 1.0983ms 910.5353 Ops/s 907.1136 Ops/s $\color{#35bf28}+0.38\%$
test_ddpg_speed[reduce-overhead-backward] 1.8088ms 1.7768ms 562.8120 Ops/s 601.6952 Ops/s $\textbf{\color{#d91a1a}-6.46\%}$
test_sac_speed[False-None] 8.4670ms 8.0732ms 123.8660 Ops/s 122.0971 Ops/s $\color{#35bf28}+1.45\%$
test_sac_speed[False-backward] 11.8235ms 11.3173ms 88.3603 Ops/s 89.2981 Ops/s $\color{#d91a1a}-1.05\%$
test_sac_speed[True-None] 1.6253ms 1.5290ms 654.0257 Ops/s 628.9997 Ops/s $\color{#35bf28}+3.98\%$
test_sac_speed[True-backward] 3.2575ms 3.2152ms 311.0232 Ops/s 307.3176 Ops/s $\color{#35bf28}+1.21\%$
test_sac_speed[reduce-overhead-None] 23.6252ms 12.8532ms 77.8015 Ops/s 77.8182 Ops/s $\color{#d91a1a}-0.02\%$
test_sac_speed[reduce-overhead-backward] 1.5227ms 1.3570ms 736.9083 Ops/s 647.1499 Ops/s $\textbf{\color{#35bf28}+13.87\%}$
test_redq_speed[False-None] 8.3831ms 7.5446ms 132.5446 Ops/s 130.4672 Ops/s $\color{#35bf28}+1.59\%$
test_redq_speed[False-backward] 12.3282ms 11.3954ms 87.7546 Ops/s 83.7143 Ops/s $\color{#35bf28}+4.83\%$
test_redq_speed[True-None] 2.0633ms 1.9846ms 503.8867 Ops/s 497.8270 Ops/s $\color{#35bf28}+1.22\%$
test_redq_speed[True-backward] 3.7082ms 3.6475ms 274.1585 Ops/s 270.6596 Ops/s $\color{#35bf28}+1.29\%$
test_redq_speed[reduce-overhead-None] 2.0855ms 1.9970ms 500.7606 Ops/s 472.3764 Ops/s $\textbf{\color{#35bf28}+6.01\%}$
test_redq_speed[reduce-overhead-backward] 3.7957ms 3.6506ms 273.9251 Ops/s 256.9738 Ops/s $\textbf{\color{#35bf28}+6.60\%}$
test_redq_deprec_speed[False-None] 9.5276ms 9.1145ms 109.7155 Ops/s 107.0289 Ops/s $\color{#35bf28}+2.51\%$
test_redq_deprec_speed[False-backward] 12.5626ms 12.1096ms 82.5789 Ops/s 78.9065 Ops/s $\color{#35bf28}+4.65\%$
test_redq_deprec_speed[True-None] 2.4346ms 2.3384ms 427.6355 Ops/s 423.2166 Ops/s $\color{#35bf28}+1.04\%$
test_redq_deprec_speed[True-backward] 4.4796ms 4.0349ms 247.8349 Ops/s 235.9693 Ops/s $\textbf{\color{#35bf28}+5.03\%}$
test_redq_deprec_speed[reduce-overhead-None] 2.5643ms 2.4067ms 415.5120 Ops/s 416.0047 Ops/s $\color{#d91a1a}-0.12\%$
test_redq_deprec_speed[reduce-overhead-backward] 4.0860ms 3.9823ms 251.1108 Ops/s 235.7854 Ops/s $\textbf{\color{#35bf28}+6.50\%}$
test_td3_speed[False-None] 34.9347ms 8.2478ms 121.2442 Ops/s 122.4158 Ops/s $\color{#d91a1a}-0.96\%$
test_td3_speed[False-backward] 10.8310ms 10.3732ms 96.4019 Ops/s 92.7334 Ops/s $\color{#35bf28}+3.96\%$
test_td3_speed[True-None] 1.6428ms 1.5884ms 629.5630 Ops/s 629.5247 Ops/s $+0.01\%$
test_td3_speed[True-backward] 3.1622ms 3.1074ms 321.8153 Ops/s 300.9686 Ops/s $\textbf{\color{#35bf28}+6.93\%}$
test_td3_speed[reduce-overhead-None] 59.4958ms 26.4059ms 37.8703 Ops/s 37.5057 Ops/s $\color{#35bf28}+0.97\%$
test_td3_speed[reduce-overhead-backward] 1.4004ms 1.3149ms 760.5007 Ops/s 674.2684 Ops/s $\textbf{\color{#35bf28}+12.79\%}$
test_cql_speed[False-None] 18.1044ms 17.0612ms 58.6124 Ops/s 58.0241 Ops/s $\color{#35bf28}+1.01\%$
test_cql_speed[False-backward] 23.0254ms 22.1530ms 45.1406 Ops/s 43.5737 Ops/s $\color{#35bf28}+3.60\%$
test_cql_speed[True-None] 3.2825ms 2.9817ms 335.3825 Ops/s 340.0173 Ops/s $\color{#d91a1a}-1.36\%$
test_cql_speed[True-backward] 5.6664ms 5.1273ms 195.0342 Ops/s 187.7019 Ops/s $\color{#35bf28}+3.91\%$
test_cql_speed[reduce-overhead-None] 0.3620s 14.8004ms 67.5660 Ops/s 74.1369 Ops/s $\textbf{\color{#d91a1a}-8.86\%}$
test_cql_speed[reduce-overhead-backward] 1.6482ms 1.5499ms 645.2075 Ops/s 575.6688 Ops/s $\textbf{\color{#35bf28}+12.08\%}$
test_a2c_speed[False-None] 3.6601ms 3.2406ms 308.5876 Ops/s 293.5489 Ops/s $\textbf{\color{#35bf28}+5.12\%}$
test_a2c_speed[False-backward] 6.8279ms 6.2227ms 160.7022 Ops/s 153.8701 Ops/s $\color{#35bf28}+4.44\%$
test_a2c_speed[True-None] 1.0729ms 1.0098ms 990.2707 Ops/s 967.8909 Ops/s $\color{#35bf28}+2.31\%$
test_a2c_speed[True-backward] 2.7146ms 2.6005ms 384.5415 Ops/s 356.7738 Ops/s $\textbf{\color{#35bf28}+7.78\%}$
test_a2c_speed[reduce-overhead-None] 21.8037ms 11.6828ms 85.5959 Ops/s 85.6745 Ops/s $\color{#d91a1a}-0.09\%$
test_a2c_speed[reduce-overhead-backward] 1.0168ms 0.9784ms 1.0221 KOps/s 859.6455 Ops/s $\textbf{\color{#35bf28}+18.90\%}$
test_ppo_speed[False-None] 4.0596ms 3.7952ms 263.4932 Ops/s 261.9791 Ops/s $\color{#35bf28}+0.58\%$
test_ppo_speed[False-backward] 7.4721ms 6.9823ms 143.2184 Ops/s 138.2399 Ops/s $\color{#35bf28}+3.60\%$
test_ppo_speed[True-None] 1.1209ms 0.9632ms 1.0382 KOps/s 1.0077 KOps/s $\color{#35bf28}+3.03\%$
test_ppo_speed[True-backward] 2.6650ms 2.5612ms 390.4400 Ops/s 365.0458 Ops/s $\textbf{\color{#35bf28}+6.96\%}$
test_ppo_speed[reduce-overhead-None] 0.6806ms 0.5389ms 1.8556 KOps/s 68.8736 Ops/s $\textbf{\color{#35bf28}+2594.18\%}$
test_ppo_speed[reduce-overhead-backward] 1.0268ms 0.9732ms 1.0276 KOps/s 838.5369 Ops/s $\textbf{\color{#35bf28}+22.54\%}$
test_reinforce_speed[False-None] 2.4400ms 2.2901ms 436.6695 Ops/s 428.5400 Ops/s $\color{#35bf28}+1.90\%$
test_reinforce_speed[False-backward] 3.8009ms 3.3199ms 301.2151 Ops/s 287.1434 Ops/s $\color{#35bf28}+4.90\%$
test_reinforce_speed[True-None] 0.8859ms 0.8319ms 1.2020 KOps/s 1.1266 KOps/s $\textbf{\color{#35bf28}+6.69\%}$
test_reinforce_speed[True-backward] 2.4668ms 2.3974ms 417.1202 Ops/s 378.8973 Ops/s $\textbf{\color{#35bf28}+10.09\%}$
test_reinforce_speed[reduce-overhead-None] 0.2953s 12.1362ms 82.3981 Ops/s 88.1379 Ops/s $\textbf{\color{#d91a1a}-6.51\%}$
test_reinforce_speed[reduce-overhead-backward] 1.0917ms 1.0416ms 960.0302 Ops/s 834.8303 Ops/s $\textbf{\color{#35bf28}+15.00\%}$
test_iql_speed[False-None] 9.9409ms 9.4083ms 106.2894 Ops/s 105.5317 Ops/s $\color{#35bf28}+0.72\%$
test_iql_speed[False-backward] 13.6518ms 13.1732ms 75.9115 Ops/s 74.0285 Ops/s $\color{#35bf28}+2.54\%$
test_iql_speed[True-None] 1.8721ms 1.7600ms 568.1794 Ops/s 563.6353 Ops/s $\color{#35bf28}+0.81\%$
test_iql_speed[True-backward] 4.3136ms 4.2223ms 236.8359 Ops/s 224.3211 Ops/s $\textbf{\color{#35bf28}+5.58\%}$
test_iql_speed[reduce-overhead-None] 20.0070ms 11.5495ms 86.5839 Ops/s 86.0014 Ops/s $\color{#35bf28}+0.68\%$
test_iql_speed[reduce-overhead-backward] 1.4900ms 1.4429ms 693.0713 Ops/s 605.7692 Ops/s $\textbf{\color{#35bf28}+14.41\%}$
test_rb_sample[TensorDictReplayBuffer-ListStorage-RandomSampler-4000] 8.1222ms 6.5403ms 152.8988 Ops/s 153.7230 Ops/s $\color{#d91a1a}-0.54\%$
test_rb_sample[TensorDictReplayBuffer-LazyMemmapStorage-RandomSampler-10000] 0.5022ms 0.2705ms 3.6965 KOps/s 3.6611 KOps/s $\color{#35bf28}+0.97\%$
test_rb_sample[TensorDictReplayBuffer-LazyTensorStorage-RandomSampler-10000] 0.4112ms 0.2505ms 3.9914 KOps/s 3.8737 KOps/s $\color{#35bf28}+3.04\%$
test_rb_sample[TensorDictReplayBuffer-ListStorage-SamplerWithoutReplacement-4000] 6.8199ms 6.2489ms 160.0270 Ops/s 160.6383 Ops/s $\color{#d91a1a}-0.38\%$
test_rb_sample[TensorDictReplayBuffer-LazyMemmapStorage-SamplerWithoutReplacement-10000] 0.9391ms 0.2592ms 3.8580 KOps/s 3.3194 KOps/s $\textbf{\color{#35bf28}+16.23\%}$
test_rb_sample[TensorDictReplayBuffer-LazyTensorStorage-SamplerWithoutReplacement-10000] 0.5631ms 0.3295ms 3.0352 KOps/s 3.3649 KOps/s $\textbf{\color{#d91a1a}-9.80\%}$
test_rb_sample[TensorDictReplayBuffer-LazyMemmapStorage-sampler6-10000] 1.6573ms 1.4003ms 714.1214 Ops/s 776.5192 Ops/s $\textbf{\color{#d91a1a}-8.04\%}$
test_rb_sample[TensorDictReplayBuffer-LazyTensorStorage-sampler7-10000] 1.5300ms 1.3051ms 766.2372 Ops/s 799.7254 Ops/s $\color{#d91a1a}-4.19\%$
test_rb_sample[TensorDictPrioritizedReplayBuffer-ListStorage-None-4000] 6.5779ms 6.4293ms 155.5391 Ops/s 155.5964 Ops/s $\color{#d91a1a}-0.04\%$
test_rb_sample[TensorDictPrioritizedReplayBuffer-LazyMemmapStorage-None-10000] 1.6370ms 0.4684ms 2.1351 KOps/s 2.4247 KOps/s $\textbf{\color{#d91a1a}-11.94\%}$
test_rb_sample[TensorDictPrioritizedReplayBuffer-LazyTensorStorage-None-10000] 0.7914ms 0.3865ms 2.5871 KOps/s 2.5498 KOps/s $\color{#35bf28}+1.46\%$
test_rb_iterate[TensorDictReplayBuffer-ListStorage-RandomSampler-4000] 6.5221ms 6.2777ms 159.2928 Ops/s 159.3659 Ops/s $\color{#d91a1a}-0.05\%$
test_rb_iterate[TensorDictReplayBuffer-LazyMemmapStorage-RandomSampler-10000] 0.7026ms 0.2715ms 3.6826 KOps/s 2.8488 KOps/s $\textbf{\color{#35bf28}+29.27\%}$
test_rb_iterate[TensorDictReplayBuffer-LazyTensorStorage-RandomSampler-10000] 0.4562ms 0.2486ms 4.0231 KOps/s 3.8752 KOps/s $\color{#35bf28}+3.82\%$
test_rb_iterate[TensorDictReplayBuffer-ListStorage-SamplerWithoutReplacement-4000] 6.3993ms 6.1439ms 162.7618 Ops/s 159.8114 Ops/s $\color{#35bf28}+1.85\%$
test_rb_iterate[TensorDictReplayBuffer-LazyMemmapStorage-SamplerWithoutReplacement-10000] 1.5175ms 0.2891ms 3.4590 KOps/s 3.1870 KOps/s $\textbf{\color{#35bf28}+8.54\%}$
test_rb_iterate[TensorDictReplayBuffer-LazyTensorStorage-SamplerWithoutReplacement-10000] 0.6910ms 0.2659ms 3.7614 KOps/s 3.3720 KOps/s $\textbf{\color{#35bf28}+11.55\%}$
test_rb_iterate[TensorDictPrioritizedReplayBuffer-ListStorage-None-4000] 6.5402ms 6.4025ms 156.1890 Ops/s 156.0456 Ops/s $\color{#35bf28}+0.09\%$
test_rb_iterate[TensorDictPrioritizedReplayBuffer-LazyMemmapStorage-None-10000] 1.0161ms 0.5091ms 1.9643 KOps/s 2.0535 KOps/s $\color{#d91a1a}-4.35\%$
test_rb_iterate[TensorDictPrioritizedReplayBuffer-LazyTensorStorage-None-10000] 0.7501ms 0.4858ms 2.0586 KOps/s 2.1647 KOps/s $\color{#d91a1a}-4.90\%$
test_rb_populate[TensorDictReplayBuffer-ListStorage-RandomSampler-400] 7.1488ms 5.5320ms 180.7668 Ops/s 183.4344 Ops/s $\color{#d91a1a}-1.45\%$
test_rb_populate[TensorDictReplayBuffer-LazyMemmapStorage-RandomSampler-400] 8.1743ms 2.0420ms 489.7100 Ops/s 433.6098 Ops/s $\textbf{\color{#35bf28}+12.94\%}$
test_rb_populate[TensorDictReplayBuffer-LazyTensorStorage-RandomSampler-400] 7.9749ms 1.2640ms 791.1238 Ops/s 836.3709 Ops/s $\textbf{\color{#d91a1a}-5.41\%}$
test_rb_populate[TensorDictReplayBuffer-ListStorage-SamplerWithoutReplacement-400] 7.4385ms 5.5937ms 178.7721 Ops/s 183.8535 Ops/s $\color{#d91a1a}-2.76\%$
test_rb_populate[TensorDictReplayBuffer-LazyMemmapStorage-SamplerWithoutReplacement-400] 8.6403ms 2.0980ms 476.6460 Ops/s 424.8726 Ops/s $\textbf{\color{#35bf28}+12.19\%}$
test_rb_populate[TensorDictReplayBuffer-LazyTensorStorage-SamplerWithoutReplacement-400] 7.2552ms 1.2879ms 776.4339 Ops/s 864.5750 Ops/s $\textbf{\color{#d91a1a}-10.19\%}$
test_rb_populate[TensorDictPrioritizedReplayBuffer-ListStorage-None-400] 0.5098s 15.8950ms 62.9130 Ops/s 32.7787 Ops/s $\textbf{\color{#35bf28}+91.93\%}$
test_rb_populate[TensorDictPrioritizedReplayBuffer-LazyMemmapStorage-None-400] 10.9655ms 2.2743ms 439.7004 Ops/s 536.6427 Ops/s $\textbf{\color{#d91a1a}-18.06\%}$
test_rb_populate[TensorDictPrioritizedReplayBuffer-LazyTensorStorage-None-400] 6.5639ms 1.3647ms 732.7792 Ops/s 802.8909 Ops/s $\textbf{\color{#d91a1a}-8.73\%}$
test_rb_extend_sample[ReplayBuffer-LazyTensorStorage-RandomSampler-10000-10000-100-True] 16.4623ms 15.6225ms 64.0104 Ops/s 62.9899 Ops/s $\color{#35bf28}+1.62\%$
test_rb_extend_sample[ReplayBuffer-LazyTensorStorage-RandomSampler-10000-10000-100-False] 19.7103ms 17.5624ms 56.9399 Ops/s 56.6249 Ops/s $\color{#35bf28}+0.56\%$
test_rb_extend_sample[ReplayBuffer-LazyTensorStorage-RandomSampler-100000-10000-100-True] 20.7648ms 19.7901ms 50.5304 Ops/s 48.9784 Ops/s $\color{#35bf28}+3.17\%$
test_rb_extend_sample[ReplayBuffer-LazyTensorStorage-RandomSampler-100000-10000-100-False] 19.8845ms 17.8539ms 56.0102 Ops/s 55.7892 Ops/s $\color{#35bf28}+0.40\%$
test_rb_extend_sample[ReplayBuffer-LazyTensorStorage-RandomSampler-1000000-10000-100-True] 20.6669ms 19.7950ms 50.5177 Ops/s 49.9113 Ops/s $\color{#35bf28}+1.21\%$
test_rb_extend_sample[ReplayBuffer-LazyTensorStorage-RandomSampler-1000000-10000-100-False] 20.9607ms 19.0461ms 52.5042 Ops/s 50.5141 Ops/s $\color{#35bf28}+3.94\%$

Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment
Labels
CLA Signed This label is managed by the Facebook bot. Authors need to sign the CLA before a PR can be reviewed.
Projects
None yet
Development

Successfully merging this pull request may close these issues.

2 participants