Skip to content
New issue

Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.

By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.

Already on GitHub? Sign in to your account

[CI] Fix nightly build #861

Merged
merged 8 commits into from
Jul 9, 2024
Merged

[CI] Fix nightly build #861

merged 8 commits into from
Jul 9, 2024

Conversation

vmoens
Copy link
Contributor

@vmoens vmoens commented Jul 9, 2024

Attempt to fix nightly builds with actions/checkout#1809

@facebook-github-bot facebook-github-bot added the CLA Signed This label is managed by the Facebook bot. Authors need to sign the CLA before a PR can be reviewed. label Jul 9, 2024
Copy link

github-actions bot commented Jul 9, 2024

$\color{#D29922}\textsf{\Large⚠\kern{0.2cm}\normalsize Warning}$ Result of CPU Benchmark Tests

Total Benchmarks: 144. Improved: $\large\color{#35bf28}26$. Worsened: $\large\color{#d91a1a}3$.

Expand to view detailed results
Name Max Mean Ops Ops on Repo HEAD Change
test_plain_set_nested 45.4940μs 15.6233μs 64.0071 KOps/s 57.5041 KOps/s $\textbf{\color{#35bf28}+11.31\%}$
test_plain_set_stack_nested 41.2670μs 15.8219μs 63.2034 KOps/s 56.6656 KOps/s $\textbf{\color{#35bf28}+11.54\%}$
test_plain_set_nested_inplace 71.2390μs 17.8172μs 56.1257 KOps/s 51.6468 KOps/s $\textbf{\color{#35bf28}+8.67\%}$
test_plain_set_stack_nested_inplace 0.1040ms 17.8468μs 56.0325 KOps/s 49.9104 KOps/s $\textbf{\color{#35bf28}+12.27\%}$
test_items 43.1410μs 2.6006μs 384.5261 KOps/s 373.5889 KOps/s $\color{#35bf28}+2.93\%$
test_items_nested 0.4356ms 0.2699ms 3.7049 KOps/s 3.6963 KOps/s $\color{#35bf28}+0.23\%$
test_items_nested_locked 1.4869ms 0.2765ms 3.6169 KOps/s 3.6510 KOps/s $\color{#d91a1a}-0.93\%$
test_items_nested_leaf 0.1402ms 78.2088μs 12.7863 KOps/s 12.4817 KOps/s $\color{#35bf28}+2.44\%$
test_items_stack_nested 0.4706ms 0.2746ms 3.6415 KOps/s 3.7029 KOps/s $\color{#d91a1a}-1.66\%$
test_items_stack_nested_leaf 0.1413ms 80.3831μs 12.4404 KOps/s 12.4324 KOps/s $\color{#35bf28}+0.06\%$
test_items_stack_nested_locked 1.1720ms 0.2749ms 3.6376 KOps/s 3.6617 KOps/s $\color{#d91a1a}-0.66\%$
test_keys 29.7950μs 3.8227μs 261.5973 KOps/s 257.7293 KOps/s $\color{#35bf28}+1.50\%$
test_keys_nested 0.2379ms 0.1396ms 7.1633 KOps/s 7.2071 KOps/s $\color{#d91a1a}-0.61\%$
test_keys_nested_locked 0.7565ms 0.1452ms 6.8853 KOps/s 6.9840 KOps/s $\color{#d91a1a}-1.41\%$
test_keys_nested_leaf 0.2046ms 0.1184ms 8.4484 KOps/s 8.5258 KOps/s $\color{#d91a1a}-0.91\%$
test_keys_stack_nested 0.2416ms 0.1390ms 7.1967 KOps/s 7.1810 KOps/s $\color{#35bf28}+0.22\%$
test_keys_stack_nested_leaf 0.1983ms 0.1182ms 8.4631 KOps/s 8.5618 KOps/s $\color{#d91a1a}-1.15\%$
test_keys_stack_nested_locked 0.2529ms 0.1436ms 6.9645 KOps/s 6.9707 KOps/s $\color{#d91a1a}-0.09\%$
test_values 11.2460μs 1.1357μs 880.5273 KOps/s 864.5501 KOps/s $\color{#35bf28}+1.85\%$
test_values_nested 0.1033ms 50.8263μs 19.6749 KOps/s 19.6702 KOps/s $\color{#35bf28}+0.02\%$
test_values_nested_locked 99.5250μs 51.0708μs 19.5807 KOps/s 19.6310 KOps/s $\color{#d91a1a}-0.26\%$
test_values_nested_leaf 0.1377ms 46.2923μs 21.6019 KOps/s 21.6148 KOps/s $\color{#d91a1a}-0.06\%$
test_values_stack_nested 91.0200μs 51.7041μs 19.3408 KOps/s 19.3047 KOps/s $\color{#35bf28}+0.19\%$
test_values_stack_nested_leaf 97.7620μs 46.1313μs 21.6773 KOps/s 20.9248 KOps/s $\color{#35bf28}+3.60\%$
test_values_stack_nested_locked 90.9290μs 51.6911μs 19.3457 KOps/s 19.4823 KOps/s $\color{#d91a1a}-0.70\%$
test_membership 18.0130μs 1.3530μs 739.1240 KOps/s 739.7339 KOps/s $\color{#d91a1a}-0.08\%$
test_membership_nested 32.5510μs 3.4348μs 291.1386 KOps/s 292.8043 KOps/s $\color{#d91a1a}-0.57\%$
test_membership_nested_leaf 31.1780μs 3.4008μs 294.0475 KOps/s 290.4903 KOps/s $\color{#35bf28}+1.22\%$
test_membership_stacked_nested 30.5260μs 3.3822μs 295.6640 KOps/s 294.2876 KOps/s $\color{#35bf28}+0.47\%$
test_membership_stacked_nested_leaf 47.4390μs 3.4309μs 291.4645 KOps/s 284.3147 KOps/s $\color{#35bf28}+2.51\%$
test_membership_nested_last 59.5710μs 4.1555μs 240.6450 KOps/s 234.3644 KOps/s $\color{#35bf28}+2.68\%$
test_membership_nested_leaf_last 34.0940μs 4.1450μs 241.2552 KOps/s 232.9035 KOps/s $\color{#35bf28}+3.59\%$
test_membership_stacked_nested_last 42.3990μs 4.0894μs 244.5362 KOps/s 233.5285 KOps/s $\color{#35bf28}+4.71\%$
test_membership_stacked_nested_leaf_last 39.6740μs 4.1194μs 242.7566 KOps/s 234.6394 KOps/s $\color{#35bf28}+3.46\%$
test_nested_getleaf 60.5730μs 10.7233μs 93.2549 KOps/s 93.3565 KOps/s $\color{#d91a1a}-0.11\%$
test_nested_get 61.9350μs 10.1395μs 98.6244 KOps/s 98.5584 KOps/s $\color{#35bf28}+0.07\%$
test_stacked_getleaf 49.6120μs 10.6650μs 93.7647 KOps/s 96.7057 KOps/s $\color{#d91a1a}-3.04\%$
test_stacked_get 61.2150μs 10.1229μs 98.7857 KOps/s 100.3080 KOps/s $\color{#d91a1a}-1.52\%$
test_nested_getitemleaf 57.9080μs 11.1709μs 89.5186 KOps/s 89.5671 KOps/s $\color{#d91a1a}-0.05\%$
test_nested_getitem 50.4840μs 10.3435μs 96.6793 KOps/s 97.3995 KOps/s $\color{#d91a1a}-0.74\%$
test_stacked_getitemleaf 61.0940μs 11.0654μs 90.3717 KOps/s 89.5453 KOps/s $\color{#35bf28}+0.92\%$
test_stacked_getitem 53.0090μs 10.2840μs 97.2383 KOps/s 96.6142 KOps/s $\color{#35bf28}+0.65\%$
test_lock_nested 58.5290ms 0.3947ms 2.5334 KOps/s 3.0245 KOps/s $\textbf{\color{#d91a1a}-16.24\%}$
test_lock_stack_nested 0.4022ms 0.3019ms 3.3122 KOps/s 3.3010 KOps/s $\color{#35bf28}+0.34\%$
test_unlock_nested 0.8331ms 0.3451ms 2.8978 KOps/s 2.9904 KOps/s $\color{#d91a1a}-3.10\%$
test_unlock_stack_nested 0.3946ms 0.3108ms 3.2177 KOps/s 3.2183 KOps/s $\color{#d91a1a}-0.02\%$
test_flatten_speed 0.2407ms 98.1613μs 10.1873 KOps/s 10.0424 KOps/s $\color{#35bf28}+1.44\%$
test_unflatten_speed 0.9796ms 0.4119ms 2.4277 KOps/s 2.4284 KOps/s $\color{#d91a1a}-0.03\%$
test_common_ops 4.2445ms 0.7147ms 1.3991 KOps/s 1.3083 KOps/s $\textbf{\color{#35bf28}+6.94\%}$
test_creation 45.8360μs 2.0033μs 499.1872 KOps/s 529.8916 KOps/s $\textbf{\color{#d91a1a}-5.79\%}$
test_creation_empty 43.7320μs 8.4716μs 118.0415 KOps/s 86.2007 KOps/s $\textbf{\color{#35bf28}+36.94\%}$
test_creation_nested_1 32.5800μs 11.0686μs 90.3454 KOps/s 70.7695 KOps/s $\textbf{\color{#35bf28}+27.66\%}$
test_creation_nested_2 44.6140μs 14.3059μs 69.9013 KOps/s 56.2706 KOps/s $\textbf{\color{#35bf28}+24.22\%}$
test_clone 0.1007ms 12.9799μs 77.0421 KOps/s 75.7758 KOps/s $\color{#35bf28}+1.67\%$
test_getitem[int] 40.2950μs 11.0441μs 90.5461 KOps/s 89.0083 KOps/s $\color{#35bf28}+1.73\%$
test_getitem[slice_int] 75.4810μs 21.5122μs 46.4852 KOps/s 44.6122 KOps/s $\color{#35bf28}+4.20\%$
test_getitem[range] 91.4600μs 61.5308μs 16.2520 KOps/s 16.7934 KOps/s $\color{#d91a1a}-3.22\%$
test_getitem[tuple] 56.0950μs 17.9559μs 55.6919 KOps/s 53.5622 KOps/s $\color{#35bf28}+3.98\%$
test_getitem[list] 0.1933ms 38.7438μs 25.8106 KOps/s 24.4895 KOps/s $\textbf{\color{#35bf28}+5.39\%}$
test_setitem_dim[int] 62.7670μs 30.8344μs 32.4313 KOps/s 28.9310 KOps/s $\textbf{\color{#35bf28}+12.10\%}$
test_setitem_dim[slice_int] 0.1060ms 58.3490μs 17.1383 KOps/s 16.5176 KOps/s $\color{#35bf28}+3.76\%$
test_setitem_dim[range] 0.1662ms 77.0283μs 12.9822 KOps/s 12.1386 KOps/s $\textbf{\color{#35bf28}+6.95\%}$
test_setitem_dim[tuple] 91.0500μs 46.3748μs 21.5634 KOps/s 20.0343 KOps/s $\textbf{\color{#35bf28}+7.63\%}$
test_setitem 90.9100μs 18.3561μs 54.4778 KOps/s 48.5663 KOps/s $\textbf{\color{#35bf28}+12.17\%}$
test_set 76.6530μs 18.4906μs 54.0815 KOps/s 49.2638 KOps/s $\textbf{\color{#35bf28}+9.78\%}$
test_set_shared 4.1408ms 0.1734ms 5.7670 KOps/s 5.8141 KOps/s $\color{#d91a1a}-0.81\%$
test_update 0.1786ms 20.0702μs 49.8250 KOps/s 41.2384 KOps/s $\textbf{\color{#35bf28}+20.82\%}$
test_update_nested 95.8280μs 28.7217μs 34.8169 KOps/s 29.9607 KOps/s $\textbf{\color{#35bf28}+16.21\%}$
test_update__nested 96.1590μs 25.7688μs 38.8066 KOps/s 37.7407 KOps/s $\color{#35bf28}+2.82\%$
test_set_nested 66.8850μs 19.9819μs 50.0453 KOps/s 45.5335 KOps/s $\textbf{\color{#35bf28}+9.91\%}$
test_set_nested_new 92.6930μs 24.6961μs 40.4922 KOps/s 37.9633 KOps/s $\textbf{\color{#35bf28}+6.66\%}$
test_select 0.1368ms 39.9279μs 25.0451 KOps/s 24.0085 KOps/s $\color{#35bf28}+4.32\%$
test_select_nested 1.0209ms 56.7750μs 17.6134 KOps/s 17.4105 KOps/s $\color{#35bf28}+1.17\%$
test_exclude_nested 0.2532ms 0.1173ms 8.5219 KOps/s 8.5207 KOps/s $\color{#35bf28}+0.01\%$
test_empty[True] 0.8890ms 0.3960ms 2.5250 KOps/s 2.5273 KOps/s $\color{#d91a1a}-0.09\%$
test_empty[False] 21.5822μs 1.0437μs 958.1726 KOps/s 974.0931 KOps/s $\color{#d91a1a}-1.63\%$
test_unbind_speed 0.3398ms 0.2473ms 4.0441 KOps/s 4.0465 KOps/s $\color{#d91a1a}-0.06\%$
test_unbind_speed_stack0 0.3940ms 0.2460ms 4.0655 KOps/s 4.0531 KOps/s $\color{#35bf28}+0.30\%$
test_unbind_speed_stack1 75.6163ms 0.7208ms 1.3874 KOps/s 1.4219 KOps/s $\color{#d91a1a}-2.43\%$
test_split 70.9983ms 1.6072ms 622.2019 Ops/s 638.2019 Ops/s $\color{#d91a1a}-2.51\%$
test_chunk 76.9778ms 1.6183ms 617.9220 Ops/s 639.7713 Ops/s $\color{#d91a1a}-3.42\%$
test_creation[device0] 0.2595ms 94.1328μs 10.6233 KOps/s 10.7145 KOps/s $\color{#d91a1a}-0.85\%$
test_creation_from_tensor 3.6444ms 97.3220μs 10.2752 KOps/s 10.1658 KOps/s $\color{#35bf28}+1.08\%$
test_add_one[memmap_tensor0] 99.7360μs 5.5393μs 180.5270 KOps/s 181.2974 KOps/s $\color{#d91a1a}-0.42\%$
test_contiguous[memmap_tensor0] 30.7170μs 0.6334μs 1.5787 MOps/s 1.5851 MOps/s $\color{#d91a1a}-0.40\%$
test_stack[memmap_tensor0] 56.3950μs 3.6236μs 275.9714 KOps/s 272.5539 KOps/s $\color{#35bf28}+1.25\%$
test_memmaptd_index 1.0577ms 0.2590ms 3.8615 KOps/s 3.8739 KOps/s $\color{#d91a1a}-0.32\%$
test_memmaptd_index_astensor 0.6014ms 0.3320ms 3.0124 KOps/s 3.0134 KOps/s $\color{#d91a1a}-0.03\%$
test_memmaptd_index_op 0.9666ms 0.5906ms 1.6933 KOps/s 1.5859 KOps/s $\textbf{\color{#35bf28}+6.77\%}$
test_serialize_model 0.2059s 0.1393s 7.1808 Ops/s 7.3327 Ops/s $\color{#d91a1a}-2.07\%$
test_serialize_model_pickle 0.4506s 0.3929s 2.5452 Ops/s 2.5504 Ops/s $\color{#d91a1a}-0.20\%$
test_serialize_weights 0.1261s 0.1198s 8.3460 Ops/s 8.1780 Ops/s $\color{#35bf28}+2.05\%$
test_serialize_weights_returnearly 0.2354s 0.1733s 5.7707 Ops/s 5.6576 Ops/s $\color{#35bf28}+2.00\%$
test_serialize_weights_pickle 0.5621s 0.4409s 2.2679 Ops/s 2.3840 Ops/s $\color{#d91a1a}-4.87\%$
test_serialize_weights_filesystem 0.1505s 0.1455s 6.8740 Ops/s 6.9520 Ops/s $\color{#d91a1a}-1.12\%$
test_serialize_model_filesystem 0.2142s 0.1572s 6.3616 Ops/s 6.0641 Ops/s $\color{#35bf28}+4.91\%$
test_reshape_pytree 69.2090μs 25.4504μs 39.2921 KOps/s 38.9584 KOps/s $\color{#35bf28}+0.86\%$
test_reshape_td 73.7470μs 34.0495μs 29.3690 KOps/s 29.8217 KOps/s $\color{#d91a1a}-1.52\%$
test_view_pytree 71.9940μs 25.5624μs 39.1199 KOps/s 39.4399 KOps/s $\color{#d91a1a}-0.81\%$
test_view_td 91.9110μs 38.6540μs 25.8706 KOps/s 26.3210 KOps/s $\color{#d91a1a}-1.71\%$
test_unbind_pytree 73.1260μs 29.0820μs 34.3855 KOps/s 34.1342 KOps/s $\color{#35bf28}+0.74\%$
test_unbind_td 0.3565ms 36.2976μs 27.5500 KOps/s 27.4298 KOps/s $\color{#35bf28}+0.44\%$
test_split_pytree 67.6870μs 29.2220μs 34.2208 KOps/s 33.7143 KOps/s $\color{#35bf28}+1.50\%$
test_split_td 0.1196ms 39.6989μs 25.1896 KOps/s 25.2190 KOps/s $\color{#d91a1a}-0.12\%$
test_add_pytree 79.2570μs 35.6855μs 28.0226 KOps/s 28.9154 KOps/s $\color{#d91a1a}-3.09\%$
test_add_td 0.1203ms 52.4468μs 19.0669 KOps/s 17.8140 KOps/s $\textbf{\color{#35bf28}+7.03\%}$
test_distributed 0.2871ms 0.1306ms 7.6567 KOps/s 7.4607 KOps/s $\color{#35bf28}+2.63\%$
test_tdmodule 79.6580μs 16.5809μs 60.3104 KOps/s 52.7499 KOps/s $\textbf{\color{#35bf28}+14.33\%}$
test_tdmodule_dispatch 61.4850μs 32.5359μs 30.7353 KOps/s 26.9508 KOps/s $\textbf{\color{#35bf28}+14.04\%}$
test_tdseq 44.2330μs 19.0596μs 52.4671 KOps/s 47.6347 KOps/s $\textbf{\color{#35bf28}+10.14\%}$
test_tdseq_dispatch 73.4570μs 36.6113μs 27.3140 KOps/s 24.3222 KOps/s $\textbf{\color{#35bf28}+12.30\%}$
test_instantiation_functorch 1.6229ms 1.3161ms 759.8412 Ops/s 759.7014 Ops/s $\color{#35bf28}+0.02\%$
test_instantiation_td 1.8260ms 1.0213ms 979.1352 Ops/s 950.0655 Ops/s $\color{#35bf28}+3.06\%$
test_exec_functorch 0.2634ms 0.1673ms 5.9767 KOps/s 6.2301 KOps/s $\color{#d91a1a}-4.07\%$
test_exec_functional_call 0.2966ms 0.1533ms 6.5238 KOps/s 6.6873 KOps/s $\color{#d91a1a}-2.45\%$
test_exec_td 0.2625ms 0.1536ms 6.5118 KOps/s 6.6558 KOps/s $\color{#d91a1a}-2.16\%$
test_exec_td_decorator 0.5199ms 0.2292ms 4.3629 KOps/s 4.4716 KOps/s $\color{#d91a1a}-2.43\%$
test_vmap_mlp_speed[True-True] 0.8301ms 0.5068ms 1.9730 KOps/s 2.0250 KOps/s $\color{#d91a1a}-2.57\%$
test_vmap_mlp_speed[True-False] 0.7933ms 0.4955ms 2.0183 KOps/s 2.0497 KOps/s $\color{#d91a1a}-1.53\%$
test_vmap_mlp_speed[False-True] 0.6483ms 0.4081ms 2.4505 KOps/s 2.5179 KOps/s $\color{#d91a1a}-2.68\%$
test_vmap_mlp_speed[False-False] 0.7317ms 0.4104ms 2.4369 KOps/s 2.5231 KOps/s $\color{#d91a1a}-3.42\%$
test_vmap_mlp_speed_decorator[True-True] 1.1226ms 0.5673ms 1.7628 KOps/s 1.7754 KOps/s $\color{#d91a1a}-0.71\%$
test_vmap_mlp_speed_decorator[True-False] 0.8237ms 0.5662ms 1.7662 KOps/s 1.7816 KOps/s $\color{#d91a1a}-0.86\%$
test_vmap_mlp_speed_decorator[False-True] 0.7682ms 0.4735ms 2.1119 KOps/s 2.1708 KOps/s $\color{#d91a1a}-2.71\%$
test_vmap_mlp_speed_decorator[False-False] 0.7714ms 0.4776ms 2.0939 KOps/s 2.1556 KOps/s $\color{#d91a1a}-2.87\%$
test_to_module_speed[True] 2.4202ms 1.6878ms 592.4826 Ops/s 588.2228 Ops/s $\color{#35bf28}+0.72\%$
test_to_module_speed[False] 2.4243ms 1.6544ms 604.4663 Ops/s 606.9532 Ops/s $\color{#d91a1a}-0.41\%$
test_tc_init 98.0330μs 52.1231μs 19.1853 KOps/s 16.4641 KOps/s $\textbf{\color{#35bf28}+16.53\%}$
test_tc_init_nested 0.1974ms 0.1028ms 9.7239 KOps/s 8.4577 KOps/s $\textbf{\color{#35bf28}+14.97\%}$
test_tc_first_layer_tensor 33.6420μs 8.2136μs 121.7495 KOps/s 121.1492 KOps/s $\color{#35bf28}+0.50\%$
test_tc_first_layer_nontensor 30.9480μs 8.1501μs 122.6975 KOps/s 121.9920 KOps/s $\color{#35bf28}+0.58\%$
test_tc_second_layer_tensor 23.3530μs 2.4801μs 403.2128 KOps/s 404.8420 KOps/s $\color{#d91a1a}-0.40\%$
test_tc_second_layer_nontensor 52.0580μs 9.2074μs 108.6083 KOps/s 108.1362 KOps/s $\color{#35bf28}+0.44\%$
test_unbind 0.1006s 15.7518ms 63.4849 Ops/s 63.7622 Ops/s $\color{#d91a1a}-0.43\%$
test_full_like 10.9012ms 8.3555ms 119.6820 Ops/s 130.2116 Ops/s $\textbf{\color{#d91a1a}-8.09\%}$
test_zeros_like 11.2812ms 7.4928ms 133.4611 Ops/s 134.7485 Ops/s $\color{#d91a1a}-0.96\%$
test_ones_like 16.8291ms 7.8448ms 127.4731 Ops/s 126.4381 Ops/s $\color{#35bf28}+0.82\%$
test_clone 17.4082ms 9.8104ms 101.9329 Ops/s 105.0471 Ops/s $\color{#d91a1a}-2.96\%$
test_squeeze 94.6870μs 12.5192μs 79.8774 KOps/s 77.9966 KOps/s $\color{#35bf28}+2.41\%$
test_unsqueeze 0.1698ms 95.9130μs 10.4261 KOps/s 10.1815 KOps/s $\color{#35bf28}+2.40\%$
test_split 0.5071ms 0.2736ms 3.6556 KOps/s 3.5870 KOps/s $\color{#35bf28}+1.91\%$
test_permute 0.4220ms 0.2188ms 4.5709 KOps/s 4.4600 KOps/s $\color{#35bf28}+2.49\%$
test_stack 29.9480ms 25.6367ms 39.0066 Ops/s 38.7831 Ops/s $\color{#35bf28}+0.58\%$
test_cat 34.5386ms 25.0651ms 39.8961 Ops/s 38.6804 Ops/s $\color{#35bf28}+3.14\%$

Copy link

github-actions bot commented Jul 9, 2024

$\color{#D29922}\textsf{\Large⚠\kern{0.2cm}\normalsize Warning}$ Result of GPU Benchmark Tests

Total Benchmarks: 152. Improved: $\large\color{#35bf28}5$. Worsened: $\large\color{#d91a1a}26$.

Expand to view detailed results
Name Max Mean Ops Ops on Repo HEAD Change
test_plain_set_nested 28.1920μs 13.1086μs 76.2859 KOps/s 86.7898 KOps/s $\textbf{\color{#d91a1a}-12.10\%}$
test_plain_set_stack_nested 28.1710μs 13.3632μs 74.8326 KOps/s 86.4818 KOps/s $\textbf{\color{#d91a1a}-13.47\%}$
test_plain_set_nested_inplace 36.7620μs 14.4593μs 69.1598 KOps/s 77.7195 KOps/s $\textbf{\color{#d91a1a}-11.01\%}$
test_plain_set_stack_nested_inplace 38.7920μs 14.4418μs 69.2435 KOps/s 77.5039 KOps/s $\textbf{\color{#d91a1a}-10.66\%}$
test_items 17.1510μs 4.7428μs 210.8474 KOps/s 211.4929 KOps/s $\color{#d91a1a}-0.31\%$
test_items_nested 0.3676ms 0.3350ms 2.9850 KOps/s 2.9009 KOps/s $\color{#35bf28}+2.90\%$
test_items_nested_locked 0.3708ms 0.3427ms 2.9183 KOps/s 2.9094 KOps/s $\color{#35bf28}+0.31\%$
test_items_nested_leaf 0.1077ms 83.4477μs 11.9836 KOps/s 12.1405 KOps/s $\color{#d91a1a}-1.29\%$
test_items_stack_nested 0.3634ms 0.3377ms 2.9615 KOps/s 2.9446 KOps/s $\color{#35bf28}+0.58\%$
test_items_stack_nested_leaf 0.1072ms 82.7093μs 12.0905 KOps/s 12.0043 KOps/s $\color{#35bf28}+0.72\%$
test_items_stack_nested_locked 0.3817ms 0.3406ms 2.9360 KOps/s 2.9160 KOps/s $\color{#35bf28}+0.69\%$
test_keys 20.0710μs 4.3715μs 228.7560 KOps/s 228.3713 KOps/s $\color{#35bf28}+0.17\%$
test_keys_nested 89.1760μs 69.9891μs 14.2879 KOps/s 14.5388 KOps/s $\color{#d91a1a}-1.73\%$
test_keys_nested_locked 2.3928ms 75.6624μs 13.2166 KOps/s 13.3030 KOps/s $\color{#d91a1a}-0.65\%$
test_keys_nested_leaf 76.1660μs 60.5781μs 16.5076 KOps/s 16.5892 KOps/s $\color{#d91a1a}-0.49\%$
test_keys_stack_nested 90.1950μs 68.0514μs 14.6948 KOps/s 14.7117 KOps/s $\color{#d91a1a}-0.11\%$
test_keys_stack_nested_leaf 75.2940μs 57.6110μs 17.3578 KOps/s 17.2692 KOps/s $\color{#35bf28}+0.51\%$
test_keys_stack_nested_locked 95.6850μs 72.9613μs 13.7059 KOps/s 13.7666 KOps/s $\color{#d91a1a}-0.44\%$
test_values 8.0173μs 1.8017μs 555.0369 KOps/s 550.8679 KOps/s $\color{#35bf28}+0.76\%$
test_values_nested 59.0350μs 35.8496μs 27.8943 KOps/s 28.6676 KOps/s $\color{#d91a1a}-2.70\%$
test_values_nested_locked 53.3430μs 37.4623μs 26.6935 KOps/s 27.0288 KOps/s $\color{#d91a1a}-1.24\%$
test_values_nested_leaf 50.7530μs 31.6860μs 31.5596 KOps/s 32.0782 KOps/s $\color{#d91a1a}-1.62\%$
test_values_stack_nested 60.1530μs 36.5655μs 27.3482 KOps/s 27.8156 KOps/s $\color{#d91a1a}-1.68\%$
test_values_stack_nested_leaf 53.3630μs 32.0987μs 31.1539 KOps/s 31.6235 KOps/s $\color{#d91a1a}-1.48\%$
test_values_stack_nested_locked 56.6930μs 38.3917μs 26.0473 KOps/s 26.6625 KOps/s $\color{#d91a1a}-2.31\%$
test_membership 2.3302μs 0.7046μs 1.4192 MOps/s 1.4069 MOps/s $\color{#35bf28}+0.87\%$
test_membership_nested 19.7410μs 2.5966μs 385.1234 KOps/s 379.0871 KOps/s $\color{#35bf28}+1.59\%$
test_membership_nested_leaf 30.5710μs 2.6143μs 382.5095 KOps/s 380.1982 KOps/s $\color{#35bf28}+0.61\%$
test_membership_stacked_nested 20.9610μs 2.6158μs 382.2868 KOps/s 381.9352 KOps/s $\color{#35bf28}+0.09\%$
test_membership_stacked_nested_leaf 21.2810μs 2.6403μs 378.7478 KOps/s 383.2376 KOps/s $\color{#d91a1a}-1.17\%$
test_membership_nested_last 34.4920μs 3.1627μs 316.1812 KOps/s 317.3689 KOps/s $\color{#d91a1a}-0.37\%$
test_membership_nested_leaf_last 20.1220μs 3.1636μs 316.0917 KOps/s 317.8665 KOps/s $\color{#d91a1a}-0.56\%$
test_membership_stacked_nested_last 39.8320μs 9.8299μs 101.7300 KOps/s 101.8331 KOps/s $\color{#d91a1a}-0.10\%$
test_membership_stacked_nested_leaf_last 27.2120μs 9.8007μs 102.0336 KOps/s 101.8386 KOps/s $\color{#35bf28}+0.19\%$
test_nested_getleaf 33.1320μs 8.4256μs 118.6857 KOps/s 119.2715 KOps/s $\color{#d91a1a}-0.49\%$
test_nested_get 29.1820μs 7.9377μs 125.9815 KOps/s 127.4818 KOps/s $\color{#d91a1a}-1.18\%$
test_stacked_getleaf 94.4450μs 8.4251μs 118.6929 KOps/s 119.7146 KOps/s $\color{#d91a1a}-0.85\%$
test_stacked_get 36.4520μs 7.9075μs 126.4622 KOps/s 127.5277 KOps/s $\color{#d91a1a}-0.84\%$
test_nested_getitemleaf 25.0610μs 8.6307μs 115.8661 KOps/s 117.7888 KOps/s $\color{#d91a1a}-1.63\%$
test_nested_getitem 31.4610μs 8.1297μs 123.0056 KOps/s 124.7965 KOps/s $\color{#d91a1a}-1.44\%$
test_stacked_getitemleaf 33.0520μs 8.6142μs 116.0870 KOps/s 117.5255 KOps/s $\color{#d91a1a}-1.22\%$
test_stacked_getitem 23.8320μs 8.0730μs 123.8704 KOps/s 123.8316 KOps/s $\color{#35bf28}+0.03\%$
test_lock_nested 58.0382ms 0.3975ms 2.5156 KOps/s 2.5106 KOps/s $\color{#35bf28}+0.20\%$
test_lock_stack_nested 0.3208ms 0.2885ms 3.4660 KOps/s 3.4729 KOps/s $\color{#d91a1a}-0.20\%$
test_unlock_nested 59.7712ms 0.3991ms 2.5059 KOps/s 2.5089 KOps/s $\color{#d91a1a}-0.12\%$
test_unlock_stack_nested 0.3185ms 0.2966ms 3.3719 KOps/s 3.3765 KOps/s $\color{#d91a1a}-0.14\%$
test_flatten_speed 0.3700ms 0.1010ms 9.9023 KOps/s 9.6911 KOps/s $\color{#35bf28}+2.18\%$
test_unflatten_speed 0.3360ms 0.2905ms 3.4429 KOps/s 3.4445 KOps/s $\color{#d91a1a}-0.04\%$
test_common_ops 1.0407ms 0.5849ms 1.7096 KOps/s 1.8261 KOps/s $\textbf{\color{#d91a1a}-6.38\%}$
test_creation 15.3600μs 1.6505μs 605.8721 KOps/s 621.3168 KOps/s $\color{#d91a1a}-2.49\%$
test_creation_empty 43.1820μs 8.9268μs 112.0218 KOps/s 159.7915 KOps/s $\textbf{\color{#d91a1a}-29.90\%}$
test_creation_nested_1 50.6630μs 10.6144μs 94.2113 KOps/s 123.8357 KOps/s $\textbf{\color{#d91a1a}-23.92\%}$
test_creation_nested_2 41.8220μs 13.0626μs 76.5544 KOps/s 97.2137 KOps/s $\textbf{\color{#d91a1a}-21.25\%}$
test_clone 88.0650μs 11.5010μs 86.9489 KOps/s 86.1439 KOps/s $\color{#35bf28}+0.93\%$
test_getitem[int] 25.7810μs 10.7298μs 93.1986 KOps/s 95.2596 KOps/s $\color{#d91a1a}-2.16\%$
test_getitem[slice_int] 41.4030μs 20.2988μs 49.2640 KOps/s 49.8660 KOps/s $\color{#d91a1a}-1.21\%$
test_getitem[range] 67.5340μs 49.6386μs 20.1456 KOps/s 20.0356 KOps/s $\color{#35bf28}+0.55\%$
test_getitem[tuple] 41.0330μs 18.1829μs 54.9966 KOps/s 53.9935 KOps/s $\color{#35bf28}+1.86\%$
test_getitem[list] 0.1359ms 33.8223μs 29.5663 KOps/s 29.2304 KOps/s $\color{#35bf28}+1.15\%$
test_setitem_dim[int] 42.9420μs 26.7999μs 37.3136 KOps/s 39.3371 KOps/s $\textbf{\color{#d91a1a}-5.14\%}$
test_setitem_dim[slice_int] 76.6440μs 48.6466μs 20.5564 KOps/s 21.5512 KOps/s $\color{#d91a1a}-4.62\%$
test_setitem_dim[range] 89.9950μs 66.1902μs 15.1080 KOps/s 15.3939 KOps/s $\color{#d91a1a}-1.86\%$
test_setitem_dim[tuple] 61.7240μs 42.3210μs 23.6290 KOps/s 24.8147 KOps/s $\color{#d91a1a}-4.78\%$
test_setitem 49.1930μs 16.5691μs 60.3533 KOps/s 66.3305 KOps/s $\textbf{\color{#d91a1a}-9.01\%}$
test_set 52.1930μs 16.2166μs 61.6652 KOps/s 70.0068 KOps/s $\textbf{\color{#d91a1a}-11.92\%}$
test_set_shared 1.8691ms 0.1002ms 9.9780 KOps/s 10.0497 KOps/s $\color{#d91a1a}-0.71\%$
test_update 87.5750μs 18.9583μs 52.7473 KOps/s 62.1502 KOps/s $\textbf{\color{#d91a1a}-15.13\%}$
test_update_nested 80.2850μs 23.9478μs 41.7575 KOps/s 46.6941 KOps/s $\textbf{\color{#d91a1a}-10.57\%}$
test_update__nested 62.9530μs 21.6909μs 46.1023 KOps/s 45.4328 KOps/s $\color{#35bf28}+1.47\%$
test_set_nested 63.9040μs 16.9435μs 59.0198 KOps/s 64.1595 KOps/s $\textbf{\color{#d91a1a}-8.01\%}$
test_set_nested_new 60.1030μs 19.5462μs 51.1609 KOps/s 54.4770 KOps/s $\textbf{\color{#d91a1a}-6.09\%}$
test_select 68.6440μs 32.2558μs 31.0022 KOps/s 31.6657 KOps/s $\color{#d91a1a}-2.10\%$
test_select_nested 0.6297ms 53.1755μs 18.8056 KOps/s 19.3210 KOps/s $\color{#d91a1a}-2.67\%$
test_exclude_nested 0.1479ms 0.1077ms 9.2842 KOps/s 9.3070 KOps/s $\color{#d91a1a}-0.25\%$
test_empty[True] 0.3693ms 0.3430ms 2.9156 KOps/s 2.9301 KOps/s $\color{#d91a1a}-0.50\%$
test_empty[False] 2.2001μs 0.8080μs 1.2376 MOps/s 1.2447 MOps/s $\color{#d91a1a}-0.58\%$
test_to 91.0650μs 61.2998μs 16.3133 KOps/s 16.8763 KOps/s $\color{#d91a1a}-3.34\%$
test_to_nonblocking 64.8340μs 36.5905μs 27.3295 KOps/s 27.3925 KOps/s $\color{#d91a1a}-0.23\%$
test_unbind_speed 0.3022ms 0.2573ms 3.8858 KOps/s 3.8651 KOps/s $\color{#35bf28}+0.53\%$
test_unbind_speed_stack0 0.3123ms 0.2530ms 3.9522 KOps/s 3.9408 KOps/s $\color{#35bf28}+0.29\%$
test_unbind_speed_stack1 75.3551ms 0.7806ms 1.2811 KOps/s 1.2800 KOps/s $\color{#35bf28}+0.09\%$
test_split 75.9541ms 1.6277ms 614.3793 Ops/s 610.6614 Ops/s $\color{#35bf28}+0.61\%$
test_chunk 75.6327ms 1.6213ms 616.7792 Ops/s 612.5219 Ops/s $\color{#35bf28}+0.70\%$
test_creation[device0] 0.1210ms 57.1601μs 17.4947 KOps/s 17.2623 KOps/s $\color{#35bf28}+1.35\%$
test_creation_from_tensor 0.1272ms 53.1019μs 18.8317 KOps/s 17.9220 KOps/s $\textbf{\color{#35bf28}+5.08\%}$
test_add_one[memmap_tensor0] 69.9640μs 7.6439μs 130.8233 KOps/s 130.0260 KOps/s $\color{#35bf28}+0.61\%$
test_contiguous[memmap_tensor0] 12.4610μs 0.6725μs 1.4869 MOps/s 1.4774 MOps/s $\color{#35bf28}+0.64\%$
test_stack[memmap_tensor0] 20.5210μs 4.8205μs 207.4491 KOps/s 204.2079 KOps/s $\color{#35bf28}+1.59\%$
test_memmaptd_index 1.0967ms 0.2785ms 3.5902 KOps/s 3.5378 KOps/s $\color{#35bf28}+1.48\%$
test_memmaptd_index_astensor 0.5970ms 0.3402ms 2.9397 KOps/s 2.9024 KOps/s $\color{#35bf28}+1.29\%$
test_memmaptd_index_op 0.9596ms 0.6532ms 1.5309 KOps/s 1.6196 KOps/s $\textbf{\color{#d91a1a}-5.48\%}$
test_serialize_model 94.5110ms 90.6665ms 11.0294 Ops/s 10.3628 Ops/s $\textbf{\color{#35bf28}+6.43\%}$
test_serialize_model_pickle 1.3496s 1.2356s 0.8093 Ops/s 0.8086 Ops/s $\color{#35bf28}+0.10\%$
test_serialize_weights 91.9345ms 88.2938ms 11.3258 Ops/s 9.7649 Ops/s $\textbf{\color{#35bf28}+15.99\%}$
test_serialize_weights_returnearly 0.1762s 72.5662ms 13.7805 Ops/s 13.6401 Ops/s $\color{#35bf28}+1.03\%$
test_serialize_weights_pickle 1.3506s 1.2483s 0.8011 Ops/s 0.7960 Ops/s $\color{#35bf28}+0.64\%$
test_reshape_pytree 54.9430μs 26.5998μs 37.5943 KOps/s 37.7774 KOps/s $\color{#d91a1a}-0.48\%$
test_reshape_td 58.0440μs 31.9457μs 31.3031 KOps/s 31.3527 KOps/s $\color{#d91a1a}-0.16\%$
test_view_pytree 50.8020μs 26.1199μs 38.2850 KOps/s 38.2000 KOps/s $\color{#35bf28}+0.22\%$
test_view_td 66.2550μs 36.9628μs 27.0542 KOps/s 26.9256 KOps/s $\color{#35bf28}+0.48\%$
test_unbind_pytree 54.3330μs 31.6921μs 31.5536 KOps/s 31.3835 KOps/s $\color{#35bf28}+0.54\%$
test_unbind_td 0.4613ms 40.4020μs 24.7512 KOps/s 24.5888 KOps/s $\color{#35bf28}+0.66\%$
test_split_pytree 54.7130μs 35.1976μs 28.4111 KOps/s 28.7241 KOps/s $\color{#d91a1a}-1.09\%$
test_split_td 0.1031ms 37.7783μs 26.4702 KOps/s 26.4311 KOps/s $\color{#35bf28}+0.15\%$
test_add_pytree 58.5030μs 38.9544μs 25.6711 KOps/s 25.3365 KOps/s $\color{#35bf28}+1.32\%$
test_add_td 86.6940μs 52.4841μs 19.0534 KOps/s 19.9774 KOps/s $\color{#d91a1a}-4.63\%$
test_distributed 2.1092ms 87.4346μs 11.4371 KOps/s 14.2344 KOps/s $\textbf{\color{#d91a1a}-19.65\%}$
test_tdmodule 33.5310μs 16.0959μs 62.1278 KOps/s 73.7799 KOps/s $\textbf{\color{#d91a1a}-15.79\%}$
test_tdmodule_dispatch 55.5030μs 31.6077μs 31.6378 KOps/s 37.8145 KOps/s $\textbf{\color{#d91a1a}-16.33\%}$
test_tdseq 33.3420μs 17.4403μs 57.3384 KOps/s 67.2187 KOps/s $\textbf{\color{#d91a1a}-14.70\%}$
test_tdseq_dispatch 51.7920μs 34.4240μs 29.0495 KOps/s 34.3330 KOps/s $\textbf{\color{#d91a1a}-15.39\%}$
test_instantiation_functorch 1.5051ms 1.4024ms 713.0789 Ops/s 714.8022 Ops/s $\color{#d91a1a}-0.24\%$
test_instantiation_td 78.6640ms 1.0698ms 934.7520 Ops/s 930.5434 Ops/s $\color{#35bf28}+0.45\%$
test_exec_functorch 0.2161ms 0.1484ms 6.7399 KOps/s 6.6670 KOps/s $\color{#35bf28}+1.09\%$
test_exec_functional_call 0.1823ms 0.1388ms 7.2070 KOps/s 7.0630 KOps/s $\color{#35bf28}+2.04\%$
test_exec_td 0.1795ms 0.1356ms 7.3773 KOps/s 7.2721 KOps/s $\color{#35bf28}+1.45\%$
test_exec_td_decorator 0.8059ms 0.2075ms 4.8195 KOps/s 4.8168 KOps/s $\color{#35bf28}+0.05\%$
test_vmap_mlp_speed[True-True] 0.7223ms 0.5832ms 1.7146 KOps/s 1.7394 KOps/s $\color{#d91a1a}-1.43\%$
test_vmap_mlp_speed[True-False] 0.7714ms 0.5787ms 1.7281 KOps/s 1.7424 KOps/s $\color{#d91a1a}-0.82\%$
test_vmap_mlp_speed[False-True] 0.5687ms 0.5088ms 1.9654 KOps/s 1.9645 KOps/s $\color{#35bf28}+0.04\%$
test_vmap_mlp_speed[False-False] 0.6310ms 0.5150ms 1.9418 KOps/s 1.9704 KOps/s $\color{#d91a1a}-1.45\%$
test_vmap_mlp_speed_decorator[True-True] 94.2888ms 0.7317ms 1.3667 KOps/s 1.2638 KOps/s $\textbf{\color{#35bf28}+8.14\%}$
test_vmap_mlp_speed_decorator[True-False] 0.7599ms 0.6438ms 1.5533 KOps/s 1.5605 KOps/s $\color{#d91a1a}-0.46\%$
test_vmap_mlp_speed_decorator[False-True] 0.7370ms 0.5837ms 1.7132 KOps/s 1.7607 KOps/s $\color{#d91a1a}-2.70\%$
test_vmap_mlp_speed_decorator[False-False] 0.6958ms 0.5670ms 1.7637 KOps/s 1.7598 KOps/s $\color{#35bf28}+0.22\%$
test_vmap_transformer_speed[True-True] 8.1804ms 7.8094ms 128.0503 Ops/s 130.3549 Ops/s $\color{#d91a1a}-1.77\%$
test_vmap_transformer_speed[True-False] 8.0439ms 7.7512ms 129.0116 Ops/s 130.6674 Ops/s $\color{#d91a1a}-1.27\%$
test_vmap_transformer_speed[False-True] 7.7169ms 7.5896ms 131.7585 Ops/s 130.8485 Ops/s $\color{#35bf28}+0.70\%$
test_vmap_transformer_speed[False-False] 7.8981ms 7.5961ms 131.6469 Ops/s 130.5862 Ops/s $\color{#35bf28}+0.81\%$
test_vmap_transformer_speed_decorator[True-True] 19.3323ms 18.6887ms 53.5082 Ops/s 53.5670 Ops/s $\color{#d91a1a}-0.11\%$
test_vmap_transformer_speed_decorator[True-False] 19.3222ms 18.6363ms 53.6586 Ops/s 53.6039 Ops/s $\color{#35bf28}+0.10\%$
test_vmap_transformer_speed_decorator[False-True] 18.7107ms 18.5409ms 53.9350 Ops/s 53.7002 Ops/s $\color{#35bf28}+0.44\%$
test_vmap_transformer_speed_decorator[False-False] 19.2803ms 18.6162ms 53.7166 Ops/s 53.9660 Ops/s $\color{#d91a1a}-0.46\%$
test_to_module_speed[True] 2.4251ms 1.5036ms 665.0677 Ops/s 673.6730 Ops/s $\color{#d91a1a}-1.28\%$
test_to_module_speed[False] 1.5862ms 1.4755ms 677.7271 Ops/s 682.4586 Ops/s $\color{#d91a1a}-0.69\%$
test_tc_init 77.3040μs 55.9290μs 17.8798 KOps/s 21.1002 KOps/s $\textbf{\color{#d91a1a}-15.26\%}$
test_tc_init_nested 0.1387ms 0.1117ms 8.9551 KOps/s 10.4090 KOps/s $\textbf{\color{#d91a1a}-13.97\%}$
test_tc_first_layer_tensor 19.7410μs 3.6684μs 272.5952 KOps/s 269.2817 KOps/s $\color{#35bf28}+1.23\%$
test_tc_first_layer_nontensor 23.7010μs 3.7046μs 269.9375 KOps/s 265.5087 KOps/s $\color{#35bf28}+1.67\%$
test_tc_second_layer_tensor 16.2710μs 1.2680μs 788.6214 KOps/s 849.4965 KOps/s $\textbf{\color{#d91a1a}-7.17\%}$
test_tc_second_layer_nontensor 27.2610μs 4.2375μs 235.9909 KOps/s 234.5332 KOps/s $\color{#35bf28}+0.62\%$
test_unbind 0.1108s 13.8201ms 72.3584 Ops/s 66.8213 Ops/s $\textbf{\color{#35bf28}+8.29\%}$
test_full_like 14.4315ms 13.8134ms 72.3936 Ops/s 103.7252 Ops/s $\textbf{\color{#d91a1a}-30.21\%}$
test_zeros_like 8.6206ms 8.0063ms 124.9009 Ops/s 125.1378 Ops/s $\color{#d91a1a}-0.19\%$
test_ones_like 0.1034s 8.5027ms 117.6090 Ops/s 124.3366 Ops/s $\textbf{\color{#d91a1a}-5.41\%}$
test_clone 10.1076ms 9.6679ms 103.4347 Ops/s 103.1578 Ops/s $\color{#35bf28}+0.27\%$
test_squeeze 64.3030μs 10.7954μs 92.6321 KOps/s 92.9269 KOps/s $\color{#d91a1a}-0.32\%$
test_unsqueeze 0.1415ms 87.9811μs 11.3661 KOps/s 11.7469 KOps/s $\color{#d91a1a}-3.24\%$
test_split 3.4454ms 3.1632ms 316.1307 Ops/s 323.6583 Ops/s $\color{#d91a1a}-2.33\%$
test_permute 0.2630ms 0.2053ms 4.8717 KOps/s 4.9326 KOps/s $\color{#d91a1a}-1.23\%$
test_stack 28.8851ms 27.9357ms 35.7965 Ops/s 35.5650 Ops/s $\color{#35bf28}+0.65\%$
test_cat 28.4016ms 27.4755ms 36.3961 Ops/s 36.0620 Ops/s $\color{#35bf28}+0.93\%$

@vmoens vmoens added the CI label Jul 9, 2024
@vmoens vmoens merged commit 222e126 into main Jul 9, 2024
25 of 34 checks passed
@vmoens vmoens deleted the fix-nightly branch July 9, 2024 12:13
Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment
Labels
CI CLA Signed This label is managed by the Facebook bot. Authors need to sign the CLA before a PR can be reviewed.
Projects
None yet
Development

Successfully merging this pull request may close these issues.

2 participants