Skip to content
New issue

Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.

By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.

Already on GitHub? Sign in to your account

[Refactor] Fix property handling in TC #1178

Merged
merged 2 commits into from
Jan 10, 2025
Merged

Conversation

vmoens
Copy link
Contributor

@vmoens vmoens commented Jan 10, 2025

Stack from ghstack (oldest at bottom):

[ghstack-poisoned]
vmoens added a commit that referenced this pull request Jan 10, 2025
ghstack-source-id: ad564f84401bb619d0a6163c352d3de43ea24437
Pull Request resolved: #1178
@facebook-github-bot facebook-github-bot added the CLA Signed This label is managed by the Facebook bot. Authors need to sign the CLA before a PR can be reviewed. label Jan 10, 2025
Copy link

github-actions bot commented Jan 10, 2025

$\color{#D29922}\textsf{\Large⚠\kern{0.2cm}\normalsize Warning}$ Result of CPU Benchmark Tests

Total Benchmarks: 217. Improved: $\large\color{#35bf28}15$. Worsened: $\large\color{#d91a1a}15$.

Expand to view detailed results
Name Max Mean Ops Ops on Repo HEAD Change
test_plain_set_nested 55.7840μs 21.1371μs 47.3101 KOps/s 48.8291 KOps/s $\color{#d91a1a}-3.11\%$
test_plain_set_stack_nested 60.0010μs 21.1479μs 47.2860 KOps/s 48.7623 KOps/s $\color{#d91a1a}-3.03\%$
test_plain_set_nested_inplace 61.5840μs 22.8868μs 43.6934 KOps/s 45.2845 KOps/s $\color{#d91a1a}-3.51\%$
test_plain_set_stack_nested_inplace 86.2300μs 23.1803μs 43.1401 KOps/s 45.4033 KOps/s $\color{#d91a1a}-4.98\%$
test_items 23.6440μs 4.2871μs 233.2585 KOps/s 232.2122 KOps/s $\color{#35bf28}+0.45\%$
test_items_nested 0.5340ms 0.3937ms 2.5398 KOps/s 2.5352 KOps/s $\color{#35bf28}+0.18\%$
test_items_nested_locked 0.6050ms 0.3930ms 2.5443 KOps/s 2.5317 KOps/s $\color{#35bf28}+0.50\%$
test_items_nested_leaf 0.1748ms 76.8172μs 13.0179 KOps/s 13.0947 KOps/s $\color{#d91a1a}-0.59\%$
test_items_stack_nested 0.9122ms 0.3987ms 2.5083 KOps/s 2.5255 KOps/s $\color{#d91a1a}-0.68\%$
test_items_stack_nested_leaf 0.1499ms 77.9788μs 12.8240 KOps/s 13.2199 KOps/s $\color{#d91a1a}-2.99\%$
test_items_stack_nested_locked 0.7348ms 0.3950ms 2.5314 KOps/s 2.5532 KOps/s $\color{#d91a1a}-0.85\%$
test_keys 24.7160μs 3.4914μs 286.4220 KOps/s 284.0352 KOps/s $\color{#35bf28}+0.84\%$
test_keys_nested 0.2472ms 0.1623ms 6.1620 KOps/s 6.0388 KOps/s $\color{#35bf28}+2.04\%$
test_keys_nested_locked 0.8728ms 0.1686ms 5.9306 KOps/s 5.8010 KOps/s $\color{#35bf28}+2.23\%$
test_keys_nested_leaf 0.2241ms 0.1415ms 7.0653 KOps/s 6.9266 KOps/s $\color{#35bf28}+2.00\%$
test_keys_stack_nested 0.2576ms 0.1620ms 6.1713 KOps/s 6.0245 KOps/s $\color{#35bf28}+2.44\%$
test_keys_stack_nested_leaf 0.2291ms 0.1419ms 7.0471 KOps/s 6.9280 KOps/s $\color{#35bf28}+1.72\%$
test_keys_stack_nested_locked 0.2651ms 0.1692ms 5.9098 KOps/s 5.7807 KOps/s $\color{#35bf28}+2.23\%$
test_values 12.2182μs 1.0316μs 969.3534 KOps/s 968.7487 KOps/s $\color{#35bf28}+0.06\%$
test_values_nested 0.1141ms 62.0899μs 16.1057 KOps/s 15.7492 KOps/s $\color{#35bf28}+2.26\%$
test_values_nested_locked 0.1142ms 62.0401μs 16.1186 KOps/s 15.8751 KOps/s $\color{#35bf28}+1.53\%$
test_values_nested_leaf 0.1223ms 70.5280μs 14.1788 KOps/s 12.7146 KOps/s $\textbf{\color{#35bf28}+11.52\%}$
test_values_stack_nested 0.1165ms 62.7806μs 15.9285 KOps/s 15.8772 KOps/s $\color{#35bf28}+0.32\%$
test_values_stack_nested_leaf 0.1255ms 70.9429μs 14.0958 KOps/s 13.6993 KOps/s $\color{#35bf28}+2.89\%$
test_values_stack_nested_locked 0.1100ms 62.8878μs 15.9013 KOps/s 15.8677 KOps/s $\color{#35bf28}+0.21\%$
test_membership 18.4040μs 0.8740μs 1.1441 MOps/s 1.4018 MOps/s $\textbf{\color{#d91a1a}-18.38\%}$
test_membership_nested 44.1620μs 2.9951μs 333.8741 KOps/s 343.1624 KOps/s $\color{#d91a1a}-2.71\%$
test_membership_nested_leaf 42.1280μs 3.0282μs 330.2275 KOps/s 335.3976 KOps/s $\color{#d91a1a}-1.54\%$
test_membership_stacked_nested 30.2660μs 2.9683μs 336.8879 KOps/s 346.6637 KOps/s $\color{#d91a1a}-2.82\%$
test_membership_stacked_nested_leaf 24.7160μs 3.0253μs 330.5475 KOps/s 324.8417 KOps/s $\color{#35bf28}+1.76\%$
test_membership_nested_last 0.1015ms 4.5539μs 219.5938 KOps/s 231.2102 KOps/s $\textbf{\color{#d91a1a}-5.02\%}$
test_membership_nested_leaf_last 0.1343ms 4.5978μs 217.4943 KOps/s 232.0835 KOps/s $\textbf{\color{#d91a1a}-6.29\%}$
test_membership_stacked_nested_last 42.2080μs 4.3930μs 227.6336 KOps/s 232.2085 KOps/s $\color{#d91a1a}-1.97\%$
test_membership_stacked_nested_leaf_last 34.4840μs 4.3632μs 229.1902 KOps/s 231.3675 KOps/s $\color{#d91a1a}-0.94\%$
test_nested_getleaf 39.5030μs 10.6598μs 93.8107 KOps/s 95.1380 KOps/s $\color{#d91a1a}-1.40\%$
test_nested_get 54.9020μs 10.0339μs 99.6623 KOps/s 100.1976 KOps/s $\color{#d91a1a}-0.53\%$
test_stacked_getleaf 30.4370μs 10.5341μs 94.9298 KOps/s 94.5952 KOps/s $\color{#35bf28}+0.35\%$
test_stacked_get 33.2120μs 10.0953μs 99.0557 KOps/s 100.8129 KOps/s $\color{#d91a1a}-1.74\%$
test_nested_getitemleaf 0.1011ms 11.2825μs 88.6326 KOps/s 90.4922 KOps/s $\color{#d91a1a}-2.06\%$
test_nested_getitem 81.3710μs 10.6936μs 93.5140 KOps/s 95.8747 KOps/s $\color{#d91a1a}-2.46\%$
test_stacked_getitemleaf 37.0290μs 11.1551μs 89.6451 KOps/s 91.5681 KOps/s $\color{#d91a1a}-2.10\%$
test_stacked_getitem 41.4270μs 10.6618μs 93.7931 KOps/s 94.6362 KOps/s $\color{#d91a1a}-0.89\%$
test_lock_nested 6.8420ms 0.4598ms 2.1750 KOps/s 1.7820 KOps/s $\textbf{\color{#35bf28}+22.05\%}$
test_lock_stack_nested 0.6465ms 0.4231ms 2.3633 KOps/s 2.3782 KOps/s $\color{#d91a1a}-0.63\%$
test_unlock_nested 0.8342ms 0.3757ms 2.6619 KOps/s 2.6488 KOps/s $\color{#35bf28}+0.49\%$
test_unlock_stack_nested 0.7499ms 0.3429ms 2.9167 KOps/s 2.9190 KOps/s $\color{#d91a1a}-0.08\%$
test_flatten_speed 0.2066ms 0.1005ms 9.9544 KOps/s 9.9366 KOps/s $\color{#35bf28}+0.18\%$
test_unflatten_speed 0.7114ms 0.5218ms 1.9166 KOps/s 1.9457 KOps/s $\color{#d91a1a}-1.49\%$
test_common_ops 1.8049ms 0.8137ms 1.2290 KOps/s 1.2656 KOps/s $\color{#d91a1a}-2.89\%$
test_creation 26.8600μs 2.5068μs 398.9126 KOps/s 402.5116 KOps/s $\color{#d91a1a}-0.89\%$
test_creation_empty 46.9880μs 13.2077μs 75.7134 KOps/s 89.2207 KOps/s $\textbf{\color{#d91a1a}-15.14\%}$
test_creation_nested_1 55.1330μs 16.3487μs 61.1668 KOps/s 70.5575 KOps/s $\textbf{\color{#d91a1a}-13.31\%}$
test_creation_nested_2 63.0370μs 20.7858μs 48.1098 KOps/s 53.6683 KOps/s $\textbf{\color{#d91a1a}-10.36\%}$
test_clone 0.1170ms 13.2501μs 75.4709 KOps/s 74.7309 KOps/s $\color{#35bf28}+0.99\%$
test_getitem[int] 1.1895ms 13.1341μs 76.1379 KOps/s 76.8875 KOps/s $\color{#d91a1a}-0.97\%$
test_getitem[slice_int] 0.1413ms 25.1703μs 39.7293 KOps/s 39.4542 KOps/s $\color{#35bf28}+0.70\%$
test_getitem[range] 0.1997ms 49.0460μs 20.3890 KOps/s 20.0793 KOps/s $\color{#35bf28}+1.54\%$
test_getitem[tuple] 0.1347ms 20.7401μs 48.2158 KOps/s 47.5712 KOps/s $\color{#35bf28}+1.36\%$
test_getitem[list] 0.3456ms 43.9431μs 22.7567 KOps/s 22.1090 KOps/s $\color{#35bf28}+2.93\%$
test_setitem_dim[int] 51.2850μs 25.9501μs 38.5355 KOps/s 38.0744 KOps/s $\color{#35bf28}+1.21\%$
test_setitem_dim[slice_int] 84.4570μs 51.4600μs 19.4326 KOps/s 18.5864 KOps/s $\color{#35bf28}+4.55\%$
test_setitem_dim[range] 0.1312ms 73.8033μs 13.5495 KOps/s 13.2286 KOps/s $\color{#35bf28}+2.43\%$
test_setitem_dim[tuple] 86.7520μs 40.9283μs 24.4330 KOps/s 23.4690 KOps/s $\color{#35bf28}+4.11\%$
test_setitem 0.1333ms 21.1639μs 47.2503 KOps/s 49.2135 KOps/s $\color{#d91a1a}-3.99\%$
test_set 0.1333ms 20.8757μs 47.9025 KOps/s 51.3996 KOps/s $\textbf{\color{#d91a1a}-6.80\%}$
test_set_shared 3.6236ms 0.1704ms 5.8691 KOps/s 5.8490 KOps/s $\color{#35bf28}+0.34\%$
test_update 0.2640ms 24.2838μs 41.1797 KOps/s 44.3744 KOps/s $\textbf{\color{#d91a1a}-7.20\%}$
test_update_nested 0.1915ms 34.4934μs 28.9911 KOps/s 29.9096 KOps/s $\color{#d91a1a}-3.07\%$
test_update__nested 0.8270ms 34.0178μs 29.3963 KOps/s 29.4564 KOps/s $\color{#d91a1a}-0.20\%$
test_set_nested 0.1540ms 23.0841μs 43.3199 KOps/s 45.6106 KOps/s $\textbf{\color{#d91a1a}-5.02\%}$
test_set_nested_new 0.1625ms 27.4345μs 36.4505 KOps/s 37.6370 KOps/s $\color{#d91a1a}-3.15\%$
test_select 0.2161ms 43.7309μs 22.8671 KOps/s 23.5294 KOps/s $\color{#d91a1a}-2.81\%$
test_select_nested 0.1228ms 62.8386μs 15.9138 KOps/s 15.9912 KOps/s $\color{#d91a1a}-0.48\%$
test_exclude_nested 0.1702ms 81.0858μs 12.3326 KOps/s 12.3310 KOps/s $\color{#35bf28}+0.01\%$
test_empty[True] 0.6179ms 0.4025ms 2.4843 KOps/s 2.4781 KOps/s $\color{#35bf28}+0.25\%$
test_empty[False] 9.9210μs 1.3739μs 727.8511 KOps/s 692.3693 KOps/s $\textbf{\color{#35bf28}+5.12\%}$
test_unbind_speed 0.5158ms 0.2685ms 3.7251 KOps/s 3.7066 KOps/s $\color{#35bf28}+0.50\%$
test_unbind_speed_stack0 0.5231ms 0.2696ms 3.7093 KOps/s 3.7314 KOps/s $\color{#d91a1a}-0.59\%$
test_unbind_speed_stack1 0.1162s 0.8221ms 1.2165 KOps/s 1.3321 KOps/s $\textbf{\color{#d91a1a}-8.68\%}$
test_split 2.5365ms 1.6076ms 622.0416 Ops/s 560.1719 Ops/s $\textbf{\color{#35bf28}+11.04\%}$
test_chunk 0.1182s 1.9848ms 503.8217 Ops/s 561.2586 Ops/s $\textbf{\color{#d91a1a}-10.23\%}$
test_consolidate_njt[False-None] 11.3687ms 8.5047ms 117.5820 Ops/s 121.6690 Ops/s $\color{#d91a1a}-3.36\%$
test_creation[device0] 0.3029ms 92.4164μs 10.8206 KOps/s 10.6728 KOps/s $\color{#35bf28}+1.38\%$
test_creation_from_tensor 4.0045ms 97.0670μs 10.3022 KOps/s 10.3898 KOps/s $\color{#d91a1a}-0.84\%$
test_add_one[memmap_tensor0] 0.2668ms 5.0312μs 198.7604 KOps/s 200.9296 KOps/s $\color{#d91a1a}-1.08\%$
test_contiguous[memmap_tensor0] 10.7300μs 0.5084μs 1.9668 MOps/s 1.9072 MOps/s $\color{#35bf28}+3.13\%$
test_stack[memmap_tensor0] 50.5640μs 3.4284μs 291.6779 KOps/s 286.1609 KOps/s $\color{#35bf28}+1.93\%$
test_memmaptd_index 1.0349ms 0.2447ms 4.0867 KOps/s 4.2660 KOps/s $\color{#d91a1a}-4.20\%$
test_memmaptd_index_astensor 0.6668ms 0.3289ms 3.0408 KOps/s 3.1169 KOps/s $\color{#d91a1a}-2.44\%$
test_memmaptd_index_op 1.0589ms 0.6245ms 1.6013 KOps/s 1.6989 KOps/s $\textbf{\color{#d91a1a}-5.75\%}$
test_serialize_model 0.1273s 0.1196s 8.3615 Ops/s 8.6529 Ops/s $\color{#d91a1a}-3.37\%$
test_serialize_model_pickle 0.5352s 0.4091s 2.4441 Ops/s 2.4782 Ops/s $\color{#d91a1a}-1.37\%$
test_serialize_weights 0.1271s 0.1175s 8.5106 Ops/s 7.4557 Ops/s $\textbf{\color{#35bf28}+14.15\%}$
test_serialize_weights_returnearly 0.1616s 0.1561s 6.4044 Ops/s 6.2105 Ops/s $\color{#35bf28}+3.12\%$
test_serialize_weights_pickle 0.5546s 0.4289s 2.3314 Ops/s 2.5789 Ops/s $\textbf{\color{#d91a1a}-9.60\%}$
test_serialize_weights_filesystem 0.1513s 0.1466s 6.8231 Ops/s 6.9506 Ops/s $\color{#d91a1a}-1.84\%$
test_serialize_model_filesystem 0.1550s 0.1522s 6.5721 Ops/s 6.0367 Ops/s $\textbf{\color{#35bf28}+8.87\%}$
test_reshape_pytree 66.7940μs 26.4825μs 37.7608 KOps/s 37.4194 KOps/s $\color{#35bf28}+0.91\%$
test_reshape_td 79.9490μs 33.0669μs 30.2417 KOps/s 30.8028 KOps/s $\color{#d91a1a}-1.82\%$
test_view_pytree 65.2310μs 26.4720μs 37.7757 KOps/s 37.3416 KOps/s $\color{#35bf28}+1.16\%$
test_view_td 0.1034ms 39.8190μs 25.1136 KOps/s 26.8348 KOps/s $\textbf{\color{#d91a1a}-6.41\%}$
test_unbind_pytree 66.0630μs 29.5966μs 33.7877 KOps/s 33.1312 KOps/s $\color{#35bf28}+1.98\%$
test_unbind_td 0.3137ms 39.7948μs 25.1289 KOps/s 25.2762 KOps/s $\color{#d91a1a}-0.58\%$
test_split_pytree 66.2730μs 29.6247μs 33.7556 KOps/s 33.0873 KOps/s $\color{#35bf28}+2.02\%$
test_split_td 0.5800ms 45.2224μs 22.1129 KOps/s 21.9842 KOps/s $\color{#35bf28}+0.59\%$
test_add_pytree 0.1086ms 35.4491μs 28.2095 KOps/s 27.8984 KOps/s $\color{#35bf28}+1.12\%$
test_add_td 0.1376ms 57.9392μs 17.2595 KOps/s 17.9575 KOps/s $\color{#d91a1a}-3.89\%$
test_compile_add_one_nested[tensordict-compile] 0.1584ms 62.8514μs 15.9105 KOps/s 15.6504 KOps/s $\color{#35bf28}+1.66\%$
test_compile_add_one_nested[tensordict-eager] 0.3928ms 0.1705ms 5.8654 KOps/s 5.7759 KOps/s $\color{#35bf28}+1.55\%$
test_compile_add_one_nested[pytree-compile] 0.1251ms 45.8061μs 21.8311 KOps/s 22.0087 KOps/s $\color{#d91a1a}-0.81\%$
test_compile_add_one_nested[pytree-eager] 0.2219ms 0.1177ms 8.4946 KOps/s 8.4763 KOps/s $\color{#35bf28}+0.22\%$
test_compile_copy_nested[tensordict-compile] 71.1520μs 27.0849μs 36.9209 KOps/s 38.2010 KOps/s $\color{#d91a1a}-3.35\%$
test_compile_copy_nested[tensordict-eager] 0.1181ms 58.2441μs 17.1691 KOps/s 17.1418 KOps/s $\color{#35bf28}+0.16\%$
test_compile_copy_nested[pytree-compile] 0.1439ms 77.7006μs 12.8699 KOps/s 13.0183 KOps/s $\color{#d91a1a}-1.14\%$
test_compile_copy_nested[pytree-eager] 0.1324ms 67.1035μs 14.9023 KOps/s 15.1301 KOps/s $\color{#d91a1a}-1.51\%$
test_compile_add_one_flat[tensordict-compile] 0.1929ms 0.1060ms 9.4352 KOps/s 9.3606 KOps/s $\color{#35bf28}+0.80\%$
test_compile_add_one_flat[tensordict-eager] 0.4024ms 0.2120ms 4.7175 KOps/s 4.6700 KOps/s $\color{#35bf28}+1.02\%$
test_compile_add_one_flat[tensorclass-compile] 0.1429ms 46.1481μs 21.6693 KOps/s 21.2690 KOps/s $\color{#35bf28}+1.88\%$
test_compile_add_one_flat[tensorclass-eager] 0.4716ms 66.9093μs 14.9456 KOps/s 14.6299 KOps/s $\color{#35bf28}+2.16\%$
test_compile_add_one_flat[pytree-compile] 0.1751ms 0.1043ms 9.5877 KOps/s 9.7832 KOps/s $\color{#d91a1a}-2.00\%$
test_compile_add_one_flat[pytree-eager] 0.3388ms 0.2014ms 4.9657 KOps/s 4.9678 KOps/s $\color{#d91a1a}-0.04\%$
test_compile_add_self_flat[tensordict-eager] 0.4281ms 0.2312ms 4.3247 KOps/s 4.2824 KOps/s $\color{#35bf28}+0.99\%$
test_compile_add_self_flat[tensordict-compile] 0.2352ms 0.1057ms 9.4566 KOps/s 9.4190 KOps/s $\color{#35bf28}+0.40\%$
test_compile_add_self_flat[tensorclass-eager] 0.2572ms 63.6587μs 15.7088 KOps/s 16.0125 KOps/s $\color{#d91a1a}-1.90\%$
test_compile_add_self_flat[tensorclass-compile] 0.2295ms 48.9226μs 20.4405 KOps/s 21.5057 KOps/s $\color{#d91a1a}-4.95\%$
test_compile_add_self_flat[pytree-eager] 0.3371ms 0.1587ms 6.3027 KOps/s 6.3759 KOps/s $\color{#d91a1a}-1.15\%$
test_compile_add_self_flat[pytree-compile] 0.1812ms 0.1031ms 9.6987 KOps/s 9.7751 KOps/s $\color{#d91a1a}-0.78\%$
test_compile_copy_flat[tensordict-compile] 70.8220μs 22.1259μs 45.1959 KOps/s 46.4960 KOps/s $\color{#d91a1a}-2.80\%$
test_compile_copy_flat[tensordict-eager] 0.1365ms 65.6000μs 15.2439 KOps/s 14.6291 KOps/s $\color{#35bf28}+4.20\%$
test_compile_copy_flat[pytree-compile] 0.1979ms 79.0183μs 12.6553 KOps/s 12.8094 KOps/s $\color{#d91a1a}-1.20\%$
test_compile_copy_flat[pytree-eager] 0.1358ms 67.8446μs 14.7396 KOps/s 14.9046 KOps/s $\color{#d91a1a}-1.11\%$
test_compile_assign_and_add[tensordict-compile] 0.3060ms 0.2048ms 4.8816 KOps/s 4.8116 KOps/s $\color{#35bf28}+1.45\%$
test_compile_assign_and_add[tensordict-eager] 1.4619ms 1.3154ms 760.2240 Ops/s 765.0792 Ops/s $\color{#d91a1a}-0.63\%$
test_compile_assign_and_add[pytree-compile] 0.3255ms 0.2034ms 4.9169 KOps/s 4.9896 KOps/s $\color{#d91a1a}-1.46\%$
test_compile_assign_and_add[pytree-eager] 1.6498ms 0.7862ms 1.2719 KOps/s 1.2905 KOps/s $\color{#d91a1a}-1.43\%$
test_compile_assign_and_add_stack[compile] 0.6239ms 0.4438ms 2.2532 KOps/s 2.2396 KOps/s $\color{#35bf28}+0.61\%$
test_compile_assign_and_add_stack[eager] 3.4945ms 2.7541ms 363.0943 Ops/s 372.6680 Ops/s $\color{#d91a1a}-2.57\%$
test_compile_indexing[tensor-tensordict-compile] 0.1046ms 36.5482μs 27.3612 KOps/s 27.6239 KOps/s $\color{#d91a1a}-0.95\%$
test_compile_indexing[tensor-tensordict-eager] 0.5396ms 33.1730μs 30.1450 KOps/s 30.3158 KOps/s $\color{#d91a1a}-0.56\%$
test_compile_indexing[tensor-tensorclass-compile] 93.2040μs 29.5056μs 33.8918 KOps/s 34.2202 KOps/s $\color{#d91a1a}-0.96\%$
test_compile_indexing[tensor-tensorclass-eager] 0.1052ms 22.9329μs 43.6054 KOps/s 42.5862 KOps/s $\color{#35bf28}+2.39\%$
test_compile_indexing[tensor-pytree-compile] 77.0330μs 29.9021μs 33.4425 KOps/s 33.5314 KOps/s $\color{#d91a1a}-0.27\%$
test_compile_indexing[tensor-pytree-eager] 0.1245ms 22.7395μs 43.9764 KOps/s 43.3300 KOps/s $\color{#35bf28}+1.49\%$
test_compile_indexing[slice-tensordict-compile] 0.1079ms 51.0421μs 19.5917 KOps/s 19.1593 KOps/s $\color{#35bf28}+2.26\%$
test_compile_indexing[slice-tensordict-eager] 0.6726ms 20.2053μs 49.4920 KOps/s 49.4389 KOps/s $\color{#35bf28}+0.11\%$
test_compile_indexing[slice-tensorclass-compile] 0.1002ms 44.2626μs 22.5924 KOps/s 22.1605 KOps/s $\color{#35bf28}+1.95\%$
test_compile_indexing[slice-tensorclass-eager] 52.5780μs 18.8981μs 52.9154 KOps/s 53.1178 KOps/s $\color{#d91a1a}-0.38\%$
test_compile_indexing[slice-pytree-compile] 0.1093ms 45.1000μs 22.1729 KOps/s 21.6933 KOps/s $\color{#35bf28}+2.21\%$
test_compile_indexing[slice-pytree-eager] 53.6600μs 18.7678μs 53.2828 KOps/s 52.8962 KOps/s $\color{#35bf28}+0.73\%$
test_compile_indexing[int-tensordict-compile] 0.1173ms 51.8985μs 19.2684 KOps/s 18.5026 KOps/s $\color{#35bf28}+4.14\%$
test_compile_indexing[int-tensordict-eager] 1.1863ms 20.2174μs 49.4625 KOps/s 49.5503 KOps/s $\color{#d91a1a}-0.18\%$
test_compile_indexing[int-tensorclass-compile] 0.1121ms 45.3905μs 22.0310 KOps/s 21.9091 KOps/s $\color{#35bf28}+0.56\%$
test_compile_indexing[int-tensorclass-eager] 78.0080μs 18.8864μs 52.9481 KOps/s 53.4157 KOps/s $\color{#d91a1a}-0.88\%$
test_compile_indexing[int-pytree-compile] 0.1020ms 44.4831μs 22.4804 KOps/s 21.8550 KOps/s $\color{#35bf28}+2.86\%$
test_compile_indexing[int-pytree-eager] 57.0560μs 19.0030μs 52.6233 KOps/s 53.2760 KOps/s $\color{#d91a1a}-1.23\%$
test_mod_add[eager] 87.2030μs 35.7222μs 27.9938 KOps/s 28.5550 KOps/s $\color{#d91a1a}-1.97\%$
test_mod_add[compile] 0.2230ms 49.6873μs 20.1259 KOps/s 20.3864 KOps/s $\color{#d91a1a}-1.28\%$
test_mod_add[compile-overhead] 0.1203ms 48.4703μs 20.6312 KOps/s 20.2302 KOps/s $\color{#35bf28}+1.98\%$
test_mod_wrap[eager] 0.3616ms 0.2210ms 4.5259 KOps/s 4.4111 KOps/s $\color{#35bf28}+2.60\%$
test_mod_wrap[compile] 0.3130ms 0.2063ms 4.8483 KOps/s 4.8380 KOps/s $\color{#35bf28}+0.21\%$
test_mod_wrap[compile-overhead] 0.3193ms 0.2040ms 4.9027 KOps/s 4.8167 KOps/s $\color{#35bf28}+1.78\%$
test_mod_wrap_and_backward[eager] 12.3277ms 11.1438ms 89.7361 Ops/s 81.3521 Ops/s $\textbf{\color{#35bf28}+10.31\%}$
test_mod_wrap_and_backward[compile] 12.8491ms 11.0829ms 90.2291 Ops/s 76.9347 Ops/s $\textbf{\color{#35bf28}+17.28\%}$
test_mod_wrap_and_backward[compile-overhead] 13.4415ms 11.1312ms 89.8375 Ops/s 75.6392 Ops/s $\textbf{\color{#35bf28}+18.77\%}$
test_seq_add[eager] 0.2285ms 0.1168ms 8.5589 KOps/s 8.5417 KOps/s $\color{#35bf28}+0.20\%$
test_seq_add[compile] 0.1520ms 62.5276μs 15.9929 KOps/s 15.6267 KOps/s $\color{#35bf28}+2.34\%$
test_seq_add[compile-overhead] 0.1299ms 60.4078μs 16.5541 KOps/s 16.1587 KOps/s $\color{#35bf28}+2.45\%$
test_seq_wrap[eager] 0.7518ms 0.4519ms 2.2131 KOps/s 2.1867 KOps/s $\color{#35bf28}+1.21\%$
test_seq_wrap[compile] 0.4180ms 0.2288ms 4.3709 KOps/s 4.2090 KOps/s $\color{#35bf28}+3.84\%$
test_seq_wrap[compile-overhead] 0.4133ms 0.2283ms 4.3808 KOps/s 4.3297 KOps/s $\color{#35bf28}+1.18\%$
test_func_call_runtime[False-eager] 0.6789ms 0.5345ms 1.8708 KOps/s 1.7945 KOps/s $\color{#35bf28}+4.25\%$
test_func_call_runtime[False-compile] 0.5661ms 0.4255ms 2.3504 KOps/s 2.3312 KOps/s $\color{#35bf28}+0.82\%$
test_func_call_runtime[False-compile-overhead] 0.5619ms 0.4277ms 2.3379 KOps/s 2.3131 KOps/s $\color{#35bf28}+1.07\%$
test_func_call_runtime[True-eager] 1.1544ms 0.7526ms 1.3287 KOps/s 1.2752 KOps/s $\color{#35bf28}+4.20\%$
test_func_call_runtime[True-compile] 0.8984ms 0.4744ms 2.1078 KOps/s 2.1469 KOps/s $\color{#d91a1a}-1.82\%$
test_func_call_runtime[True-compile-overhead] 0.8314ms 0.4718ms 2.1195 KOps/s 2.1437 KOps/s $\color{#d91a1a}-1.13\%$
test_func_call_cm_runtime[False-eager] 0.9291ms 0.5407ms 1.8493 KOps/s 1.7970 KOps/s $\color{#35bf28}+2.91\%$
test_func_call_cm_runtime[False-compile] 0.6082ms 0.4267ms 2.3438 KOps/s 2.3558 KOps/s $\color{#d91a1a}-0.51\%$
test_func_call_cm_runtime[False-compile-overhead] 0.8174ms 0.4273ms 2.3405 KOps/s 2.3642 KOps/s $\color{#d91a1a}-1.00\%$
test_func_call_cm_runtime[True-eager] 1.1130ms 0.8944ms 1.1181 KOps/s 1.0818 KOps/s $\color{#35bf28}+3.35\%$
test_func_call_cm_runtime[True-compile] 0.6508ms 0.4945ms 2.0224 KOps/s 2.0298 KOps/s $\color{#d91a1a}-0.36\%$
test_func_call_cm_runtime[True-compile-overhead] 0.6100ms 0.4916ms 2.0342 KOps/s 2.0375 KOps/s $\color{#d91a1a}-0.16\%$
test_vmap_func_call_cm_runtime[eager] 2.6097ms 1.9130ms 522.7327 Ops/s 507.9864 Ops/s $\color{#35bf28}+2.90\%$
test_vmap_func_call_cm_runtime[compile] 0.7906ms 0.5264ms 1.8997 KOps/s 1.8048 KOps/s $\textbf{\color{#35bf28}+5.26\%}$
test_vmap_func_call_cm_runtime[compile-overhead] 0.6480ms 0.5235ms 1.9101 KOps/s 1.8646 KOps/s $\color{#35bf28}+2.44\%$
test_distributed 0.2687ms 0.1279ms 7.8202 KOps/s 7.4707 KOps/s $\color{#35bf28}+4.68\%$
test_tdmodule 45.0240μs 26.6981μs 37.4559 KOps/s 38.2915 KOps/s $\color{#d91a1a}-2.18\%$
test_tdmodule_dispatch 77.3640μs 49.2327μs 20.3117 KOps/s 21.0302 KOps/s $\color{#d91a1a}-3.42\%$
test_tdseq 60.2320μs 29.8015μs 33.5553 KOps/s 33.2147 KOps/s $\color{#35bf28}+1.03\%$
test_tdseq_dispatch 96.2600μs 55.6381μs 17.9733 KOps/s 18.2258 KOps/s $\color{#d91a1a}-1.39\%$
test_instantiation_functorch 1.6815ms 1.5330ms 652.2995 Ops/s 631.6163 Ops/s $\color{#35bf28}+3.27\%$
test_exec_functorch 0.3233ms 0.1782ms 5.6122 KOps/s 5.5374 KOps/s $\color{#35bf28}+1.35\%$
test_exec_functional_call 0.2610ms 0.1719ms 5.8162 KOps/s 5.7364 KOps/s $\color{#35bf28}+1.39\%$
test_exec_td_decorator 0.5197ms 0.2324ms 4.3021 KOps/s 4.1947 KOps/s $\color{#35bf28}+2.56\%$
test_vmap_mlp_speed_decorator[True-True] 0.9171ms 0.6568ms 1.5226 KOps/s 1.4583 KOps/s $\color{#35bf28}+4.41\%$
test_vmap_mlp_speed_decorator[True-False] 1.0423ms 0.6558ms 1.5249 KOps/s 1.4950 KOps/s $\color{#35bf28}+2.00\%$
test_vmap_mlp_speed_decorator[False-True] 0.8212ms 0.5289ms 1.8907 KOps/s 1.8518 KOps/s $\color{#35bf28}+2.10\%$
test_vmap_mlp_speed_decorator[False-False] 0.8321ms 0.5302ms 1.8862 KOps/s 1.8676 KOps/s $\color{#35bf28}+0.99\%$
test_to_module_speed[True] 1.7483ms 1.3210ms 757.0192 Ops/s 725.5831 Ops/s $\color{#35bf28}+4.33\%$
test_to_module_speed[False] 2.0885ms 1.2993ms 769.6181 Ops/s 741.0235 Ops/s $\color{#35bf28}+3.86\%$
test_tc_init 89.8680μs 47.0798μs 21.2405 KOps/s 20.4582 KOps/s $\color{#35bf28}+3.82\%$
test_tc_init_nested 0.1774ms 91.5030μs 10.9286 KOps/s 10.0887 KOps/s $\textbf{\color{#35bf28}+8.33\%}$
test_tc_first_layer_tensor 17.5530μs 1.5574μs 642.0989 KOps/s 659.0196 KOps/s $\color{#d91a1a}-2.57\%$
test_tc_first_layer_nontensor 41.7580μs 4.7942μs 208.5850 KOps/s 215.1429 KOps/s $\color{#d91a1a}-3.05\%$
test_tc_second_layer_tensor 40.9360μs 2.8697μs 348.4700 KOps/s 354.0390 KOps/s $\color{#d91a1a}-1.57\%$
test_tc_second_layer_nontensor 33.0410μs 6.1145μs 163.5460 KOps/s 167.1968 KOps/s $\color{#d91a1a}-2.18\%$
test_unbind 0.2350s 13.5328ms 73.8943 Ops/s 78.6957 Ops/s $\textbf{\color{#d91a1a}-6.10\%}$
test_full_like 8.5336ms 7.7915ms 128.3447 Ops/s 72.8023 Ops/s $\textbf{\color{#35bf28}+76.29\%}$
test_zeros_like 3.7197ms 3.0018ms 333.1373 Ops/s 125.5598 Ops/s $\textbf{\color{#35bf28}+165.32\%}$
test_ones_like 3.9409ms 3.5519ms 281.5402 Ops/s 125.3765 Ops/s $\textbf{\color{#35bf28}+124.56\%}$
test_clone 6.0259ms 5.2053ms 192.1122 Ops/s 102.5179 Ops/s $\textbf{\color{#35bf28}+87.39\%}$
test_squeeze 60.2620μs 11.8147μs 84.6402 KOps/s 82.8546 KOps/s $\color{#35bf28}+2.16\%$
test_unsqueeze 0.3032ms 90.3668μs 11.0660 KOps/s 11.0386 KOps/s $\color{#35bf28}+0.25\%$
test_split 0.3257ms 0.1951ms 5.1247 KOps/s 5.0139 KOps/s $\color{#35bf28}+2.21\%$
test_permute 0.2984ms 0.2004ms 4.9902 KOps/s 5.0495 KOps/s $\color{#d91a1a}-1.17\%$
test_stack 28.4580ms 24.8341ms 40.2673 Ops/s 38.7454 Ops/s $\color{#35bf28}+3.93\%$
test_cat 27.8976ms 24.6248ms 40.6094 Ops/s 38.9901 Ops/s $\color{#35bf28}+4.15\%$

Copy link

$\color{#D29922}\textsf{\Large⚠\kern{0.2cm}\normalsize Warning}$ Result of GPU Benchmark Tests

Total Benchmarks: 229. Improved: $\large\color{#35bf28}31$. Worsened: $\large\color{#d91a1a}10$.

Expand to view detailed results
Name Max Mean Ops Ops on Repo HEAD Change
test_plain_set_nested 26.0300μs 11.3641μs 87.9966 KOps/s 77.1256 KOps/s $\textbf{\color{#35bf28}+14.10\%}$
test_plain_set_stack_nested 28.2310μs 11.5727μs 86.4100 KOps/s 76.1548 KOps/s $\textbf{\color{#35bf28}+13.47\%}$
test_plain_set_nested_inplace 38.2500μs 12.4839μs 80.1034 KOps/s 71.2887 KOps/s $\textbf{\color{#35bf28}+12.36\%}$
test_plain_set_stack_nested_inplace 77.1420μs 12.4413μs 80.3773 KOps/s 70.8596 KOps/s $\textbf{\color{#35bf28}+13.43\%}$
test_items 25.6210μs 2.9116μs 343.4494 KOps/s 335.1050 KOps/s $\color{#35bf28}+2.49\%$
test_items_nested 0.4043ms 0.3737ms 2.6761 KOps/s 2.6301 KOps/s $\color{#35bf28}+1.75\%$
test_items_nested_locked 0.4394ms 0.3734ms 2.6778 KOps/s 2.6650 KOps/s $\color{#35bf28}+0.48\%$
test_items_nested_leaf 89.2420μs 62.2365μs 16.0677 KOps/s 16.1339 KOps/s $\color{#d91a1a}-0.41\%$
test_items_stack_nested 0.4035ms 0.3770ms 2.6527 KOps/s 2.6131 KOps/s $\color{#35bf28}+1.52\%$
test_items_stack_nested_leaf 94.9620μs 62.6572μs 15.9598 KOps/s 15.8717 KOps/s $\color{#35bf28}+0.56\%$
test_items_stack_nested_locked 0.4423ms 0.3749ms 2.6674 KOps/s 2.6234 KOps/s $\color{#35bf28}+1.68\%$
test_keys 28.8900μs 3.4758μs 287.7041 KOps/s 285.8228 KOps/s $\color{#35bf28}+0.66\%$
test_keys_nested 0.1264ms 88.1189μs 11.3483 KOps/s 11.2285 KOps/s $\color{#35bf28}+1.07\%$
test_keys_nested_locked 0.7792ms 93.8314μs 10.6574 KOps/s 10.5643 KOps/s $\color{#35bf28}+0.88\%$
test_keys_nested_leaf 0.1442ms 78.8028μs 12.6899 KOps/s 12.5549 KOps/s $\color{#35bf28}+1.08\%$
test_keys_stack_nested 0.1211ms 88.8791μs 11.2512 KOps/s 11.0161 KOps/s $\color{#35bf28}+2.13\%$
test_keys_stack_nested_leaf 0.1127ms 79.3545μs 12.6017 KOps/s 12.2561 KOps/s $\color{#35bf28}+2.82\%$
test_keys_stack_nested_locked 0.1423ms 94.4548μs 10.5871 KOps/s 10.3834 KOps/s $\color{#35bf28}+1.96\%$
test_values 6.2485μs 0.8617μs 1.1604 MOps/s 1.1520 MOps/s $\color{#35bf28}+0.73\%$
test_values_nested 97.9320μs 38.2996μs 26.1100 KOps/s 26.3218 KOps/s $\color{#d91a1a}-0.80\%$
test_values_nested_locked 68.9210μs 39.5914μs 25.2580 KOps/s 25.4562 KOps/s $\color{#d91a1a}-0.78\%$
test_values_nested_leaf 73.4410μs 42.4872μs 23.5365 KOps/s 23.6332 KOps/s $\color{#d91a1a}-0.41\%$
test_values_stack_nested 70.2310μs 38.3404μs 26.0822 KOps/s 25.6010 KOps/s $\color{#35bf28}+1.88\%$
test_values_stack_nested_leaf 70.2810μs 43.2597μs 23.1162 KOps/s 23.0461 KOps/s $\color{#35bf28}+0.30\%$
test_values_stack_nested_locked 0.1125ms 39.6488μs 25.2215 KOps/s 24.7858 KOps/s $\color{#35bf28}+1.76\%$
test_membership 1.6581μs 0.5581μs 1.7916 MOps/s 1.8141 MOps/s $\color{#d91a1a}-1.24\%$
test_membership_nested 15.7900μs 2.0522μs 487.2793 KOps/s 468.2853 KOps/s $\color{#35bf28}+4.06\%$
test_membership_nested_leaf 14.6805μs 2.0602μs 485.3793 KOps/s 488.9194 KOps/s $\color{#d91a1a}-0.72\%$
test_membership_stacked_nested 40.8510μs 2.1262μs 470.3157 KOps/s 468.8351 KOps/s $\color{#35bf28}+0.32\%$
test_membership_stacked_nested_leaf 26.4600μs 2.1258μs 470.4154 KOps/s 475.5069 KOps/s $\color{#d91a1a}-1.07\%$
test_membership_nested_last 38.2810μs 3.1552μs 316.9342 KOps/s 316.5543 KOps/s $\color{#35bf28}+0.12\%$
test_membership_nested_leaf_last 27.6000μs 3.1435μs 318.1214 KOps/s 316.0421 KOps/s $\color{#35bf28}+0.66\%$
test_membership_stacked_nested_last 28.0610μs 3.6420μs 274.5769 KOps/s 314.2446 KOps/s $\textbf{\color{#d91a1a}-12.62\%}$
test_membership_stacked_nested_leaf_last 31.7000μs 3.6705μs 272.4393 KOps/s 315.1670 KOps/s $\textbf{\color{#d91a1a}-13.56\%}$
test_nested_getleaf 35.2310μs 6.2066μs 161.1176 KOps/s 161.6268 KOps/s $\color{#d91a1a}-0.32\%$
test_nested_get 30.1010μs 5.8986μs 169.5329 KOps/s 169.6707 KOps/s $\color{#d91a1a}-0.08\%$
test_stacked_getleaf 30.3100μs 6.2076μs 161.0928 KOps/s 161.1698 KOps/s $\color{#d91a1a}-0.05\%$
test_stacked_get 28.7710μs 5.8836μs 169.9653 KOps/s 171.1834 KOps/s $\color{#d91a1a}-0.71\%$
test_nested_getitemleaf 30.6310μs 6.5194μs 153.3882 KOps/s 153.3021 KOps/s $\color{#35bf28}+0.06\%$
test_nested_getitem 26.8710μs 6.1313μs 163.0987 KOps/s 160.7462 KOps/s $\color{#35bf28}+1.46\%$
test_stacked_getitemleaf 33.0710μs 6.5152μs 153.4872 KOps/s 154.5264 KOps/s $\color{#d91a1a}-0.67\%$
test_stacked_getitem 38.8610μs 6.1262μs 163.2328 KOps/s 161.9371 KOps/s $\color{#35bf28}+0.80\%$
test_lock_nested 9.1671ms 0.3833ms 2.6090 KOps/s 2.5970 KOps/s $\color{#35bf28}+0.46\%$
test_lock_stack_nested 0.3907ms 0.3465ms 2.8857 KOps/s 2.8665 KOps/s $\color{#35bf28}+0.67\%$
test_unlock_nested 0.7718ms 0.3168ms 3.1567 KOps/s 3.1624 KOps/s $\color{#d91a1a}-0.18\%$
test_unlock_stack_nested 0.3235ms 0.2863ms 3.4923 KOps/s 3.4895 KOps/s $\color{#35bf28}+0.08\%$
test_flatten_speed 0.1200ms 78.6205μs 12.7193 KOps/s 12.7368 KOps/s $\color{#d91a1a}-0.14\%$
test_unflatten_speed 0.4696ms 0.3195ms 3.1302 KOps/s 3.0743 KOps/s $\color{#35bf28}+1.82\%$
test_common_ops 1.6760ms 0.5728ms 1.7459 KOps/s 1.5619 KOps/s $\textbf{\color{#35bf28}+11.78\%}$
test_creation 0.1006ms 1.7708μs 564.7243 KOps/s 558.3026 KOps/s $\color{#35bf28}+1.15\%$
test_creation_empty 44.4510μs 6.3891μs 156.5162 KOps/s 105.9219 KOps/s $\textbf{\color{#35bf28}+47.77\%}$
test_creation_nested_1 37.3510μs 8.1700μs 122.3991 KOps/s 89.4513 KOps/s $\textbf{\color{#35bf28}+36.83\%}$
test_creation_nested_2 33.1800μs 10.8349μs 92.2941 KOps/s 71.8689 KOps/s $\textbf{\color{#35bf28}+28.42\%}$
test_clone 81.6820μs 10.4893μs 95.3348 KOps/s 91.9503 KOps/s $\color{#35bf28}+3.68\%$
test_getitem[int] 1.8158ms 10.9730μs 91.1331 KOps/s 93.1328 KOps/s $\color{#d91a1a}-2.15\%$
test_getitem[slice_int] 0.1241ms 20.8775μs 47.8985 KOps/s 47.7077 KOps/s $\color{#35bf28}+0.40\%$
test_getitem[range] 0.2158ms 37.8165μs 26.4435 KOps/s 26.0945 KOps/s $\color{#35bf28}+1.34\%$
test_getitem[tuple] 0.1067ms 18.1901μs 54.9750 KOps/s 55.7531 KOps/s $\color{#d91a1a}-1.40\%$
test_getitem[list] 0.3195ms 33.4193μs 29.9228 KOps/s 29.9571 KOps/s $\color{#d91a1a}-0.11\%$
test_setitem_dim[int] 39.7410μs 20.0059μs 49.9854 KOps/s 50.1016 KOps/s $\color{#d91a1a}-0.23\%$
test_setitem_dim[slice_int] 74.8920μs 40.8287μs 24.4926 KOps/s 25.3427 KOps/s $\color{#d91a1a}-3.35\%$
test_setitem_dim[range] 94.1620μs 58.1118μs 17.2082 KOps/s 18.1563 KOps/s $\textbf{\color{#d91a1a}-5.22\%}$
test_setitem_dim[tuple] 69.5310μs 34.1936μs 29.2452 KOps/s 30.6284 KOps/s $\color{#d91a1a}-4.52\%$
test_setitem 97.6730μs 14.1572μs 70.6353 KOps/s 63.2857 KOps/s $\textbf{\color{#35bf28}+11.61\%}$
test_set 84.9220μs 13.6069μs 73.4920 KOps/s 64.8651 KOps/s $\textbf{\color{#35bf28}+13.30\%}$
test_set_shared 1.4044ms 0.1534ms 6.5198 KOps/s 6.4908 KOps/s $\color{#35bf28}+0.45\%$
test_update 1.0085ms 15.5192μs 64.4365 KOps/s 52.9927 KOps/s $\textbf{\color{#35bf28}+21.60\%}$
test_update_nested 88.1920μs 21.2258μs 47.1124 KOps/s 40.2530 KOps/s $\textbf{\color{#35bf28}+17.04\%}$
test_update__nested 0.1399ms 25.8606μs 38.6689 KOps/s 38.9582 KOps/s $\color{#d91a1a}-0.74\%$
test_set_nested 93.6130μs 14.9750μs 66.7781 KOps/s 59.6250 KOps/s $\textbf{\color{#35bf28}+12.00\%}$
test_set_nested_new 92.1020μs 16.8587μs 59.3167 KOps/s 52.5721 KOps/s $\textbf{\color{#35bf28}+12.83\%}$
test_select 92.4020μs 28.4939μs 35.0952 KOps/s 31.8688 KOps/s $\textbf{\color{#35bf28}+10.12\%}$
test_select_nested 70.9420μs 44.1469μs 22.6516 KOps/s 22.6664 KOps/s $\color{#d91a1a}-0.07\%$
test_exclude_nested 0.1206ms 61.8636μs 16.1646 KOps/s 15.5851 KOps/s $\color{#35bf28}+3.72\%$
test_empty[True] 0.3668ms 0.2952ms 3.3873 KOps/s 3.3680 KOps/s $\color{#35bf28}+0.57\%$
test_empty[False] 3.7701μs 0.8465μs 1.1813 MOps/s 1.1608 MOps/s $\color{#35bf28}+1.77\%$
test_to 0.1007ms 59.2911μs 16.8659 KOps/s 17.0748 KOps/s $\color{#d91a1a}-1.22\%$
test_to_nonblocking 89.4120μs 48.2730μs 20.7155 KOps/s 19.5697 KOps/s $\textbf{\color{#35bf28}+5.85\%}$
test_unbind_speed 0.2764ms 0.2353ms 4.2505 KOps/s 4.2369 KOps/s $\color{#35bf28}+0.32\%$
test_unbind_speed_stack0 0.2921ms 0.2355ms 4.2466 KOps/s 4.2381 KOps/s $\color{#35bf28}+0.20\%$
test_unbind_speed_stack1 0.6749ms 0.6210ms 1.6103 KOps/s 1.4906 KOps/s $\textbf{\color{#35bf28}+8.03\%}$
test_split 94.9583ms 1.5781ms 633.6613 Ops/s 578.2829 Ops/s $\textbf{\color{#35bf28}+9.58\%}$
test_chunk 96.3327ms 1.5814ms 632.3420 Ops/s 687.0053 Ops/s $\textbf{\color{#d91a1a}-7.96\%}$
test_consolidate[False-None] 96.4715ms 2.9193ms 342.5523 Ops/s 329.0016 Ops/s $\color{#35bf28}+4.12\%$
test_consolidate[default-None] 1.8395ms 1.7219ms 580.7424 Ops/s 583.2083 Ops/s $\color{#d91a1a}-0.42\%$
test_consolidate[reduce-overhead-None] 1.8269ms 1.7434ms 573.5915 Ops/s 571.9658 Ops/s $\color{#35bf28}+0.28\%$
test_consolidate_njt[False-None] 6.9130ms 6.5508ms 152.6539 Ops/s 149.9359 Ops/s $\color{#35bf28}+1.81\%$
test_to[False-False-None] 1.7979ms 1.7237ms 580.1554 Ops/s 570.3021 Ops/s $\color{#35bf28}+1.73\%$
test_to[True-False-None] 1.5628ms 1.3351ms 748.9909 Ops/s 751.7799 Ops/s $\color{#d91a1a}-0.37\%$
test_to[within-False-None] 4.2256ms 4.1079ms 243.4351 Ops/s 243.3795 Ops/s $\color{#35bf28}+0.02\%$
test_to[True-default-None] 5.5925ms 5.3773ms 185.9657 Ops/s 190.5674 Ops/s $\color{#d91a1a}-2.41\%$
test_to_njt[False-False-None] 7.1325ms 6.9774ms 143.3194 Ops/s 143.9295 Ops/s $\color{#d91a1a}-0.42\%$
test_to_njt[True-False-None] 5.6810ms 5.4662ms 182.9417 Ops/s 181.7383 Ops/s $\color{#35bf28}+0.66\%$
test_to_njt[within-False-None] 12.4365ms 12.3020ms 81.2877 Ops/s 81.6354 Ops/s $\color{#d91a1a}-0.43\%$
test_creation[device0] 0.4585ms 81.0971μs 12.3309 KOps/s 12.3085 KOps/s $\color{#35bf28}+0.18\%$
test_creation_from_tensor 0.5061ms 84.5742μs 11.8239 KOps/s 11.5942 KOps/s $\color{#35bf28}+1.98\%$
test_add_one[memmap_tensor0] 0.4124ms 6.8996μs 144.9367 KOps/s 144.4252 KOps/s $\color{#35bf28}+0.35\%$
test_contiguous[memmap_tensor0] 1.9001μs 0.4424μs 2.2606 MOps/s 2.3038 MOps/s $\color{#d91a1a}-1.88\%$
test_stack[memmap_tensor0] 39.1900μs 4.4224μs 226.1218 KOps/s 232.9507 KOps/s $\color{#d91a1a}-2.93\%$
test_memmaptd_index 1.6586ms 0.2499ms 4.0018 KOps/s 3.9507 KOps/s $\color{#35bf28}+1.29\%$
test_memmaptd_index_astensor 0.5784ms 0.3140ms 3.1851 KOps/s 3.1765 KOps/s $\color{#35bf28}+0.27\%$
test_memmaptd_index_op 0.9974ms 0.5608ms 1.7830 KOps/s 1.6278 KOps/s $\textbf{\color{#35bf28}+9.53\%}$
test_serialize_model 0.1317s 0.1309s 7.6421 Ops/s 7.6770 Ops/s $\color{#d91a1a}-0.45\%$
test_serialize_model_pickle 1.3499s 1.1868s 0.8426 Ops/s 0.8251 Ops/s $\color{#35bf28}+2.13\%$
test_serialize_weights 0.1309s 0.1303s 7.6758 Ops/s 7.6935 Ops/s $\color{#d91a1a}-0.23\%$
test_serialize_weights_returnearly 0.3342s 53.5536ms 18.6729 Ops/s 14.0273 Ops/s $\textbf{\color{#35bf28}+33.12\%}$
test_serialize_weights_pickle 1.3823s 1.2229s 0.8177 Ops/s 0.8257 Ops/s $\color{#d91a1a}-0.96\%$
test_reshape_pytree 58.4810μs 22.3891μs 44.6646 KOps/s 44.4645 KOps/s $\color{#35bf28}+0.45\%$
test_reshape_td 47.6410μs 26.5591μs 37.6519 KOps/s 35.5816 KOps/s $\textbf{\color{#35bf28}+5.82\%}$
test_view_pytree 80.9320μs 22.3097μs 44.8235 KOps/s 45.2810 KOps/s $\color{#d91a1a}-1.01\%$
test_view_td 71.4510μs 30.4880μs 32.7998 KOps/s 31.7535 KOps/s $\color{#35bf28}+3.30\%$
test_unbind_pytree 66.6420μs 28.1505μs 35.5233 KOps/s 35.7436 KOps/s $\color{#d91a1a}-0.62\%$
test_unbind_td 0.7617ms 36.1326μs 27.6758 KOps/s 27.5141 KOps/s $\color{#35bf28}+0.59\%$
test_split_pytree 68.3710μs 29.2609μs 34.1752 KOps/s 33.2899 KOps/s $\color{#35bf28}+2.66\%$
test_split_td 0.9281ms 37.4469μs 26.7044 KOps/s 25.7265 KOps/s $\color{#35bf28}+3.80\%$
test_add_pytree 72.9820μs 34.4318μs 29.0429 KOps/s 28.9657 KOps/s $\color{#35bf28}+0.27\%$
test_add_td 0.4467ms 44.3079μs 22.5694 KOps/s 20.1804 KOps/s $\textbf{\color{#35bf28}+11.84\%}$
test_compile_add_one_nested[tensordict-compile] 0.2314ms 0.1222ms 8.1849 KOps/s 7.8258 KOps/s $\color{#35bf28}+4.59\%$
test_compile_add_one_nested[tensordict-eager] 0.2381ms 0.1320ms 7.5759 KOps/s 7.4102 KOps/s $\color{#35bf28}+2.24\%$
test_compile_add_one_nested[pytree-compile] 0.4968ms 95.9723μs 10.4197 KOps/s 10.2983 KOps/s $\color{#35bf28}+1.18\%$
test_compile_add_one_nested[pytree-eager] 0.5898ms 0.1522ms 6.5710 KOps/s 6.5802 KOps/s $\color{#d91a1a}-0.14\%$
test_compile_copy_nested[tensordict-compile] 0.4161ms 21.9925μs 45.4700 KOps/s 45.4362 KOps/s $\color{#35bf28}+0.07\%$
test_compile_copy_nested[tensordict-eager] 0.4358ms 29.6271μs 33.7529 KOps/s 33.3782 KOps/s $\color{#35bf28}+1.12\%$
test_compile_copy_nested[pytree-compile] 0.4695ms 65.3212μs 15.3090 KOps/s 14.8338 KOps/s $\color{#35bf28}+3.20\%$
test_compile_copy_nested[pytree-eager] 0.4549ms 50.3204μs 19.8727 KOps/s 19.3531 KOps/s $\color{#35bf28}+2.68\%$
test_compile_add_one_flat[tensordict-compile] 0.1878ms 0.1431ms 6.9899 KOps/s 6.9680 KOps/s $\color{#35bf28}+0.31\%$
test_compile_add_one_flat[tensordict-eager] 0.6249ms 0.2204ms 4.5378 KOps/s 4.5711 KOps/s $\color{#d91a1a}-0.73\%$
test_compile_add_one_flat[tensorclass-compile] 0.1696ms 98.4834μs 10.1540 KOps/s 10.1226 KOps/s $\color{#35bf28}+0.31\%$
test_compile_add_one_flat[tensorclass-eager] 0.1147ms 56.6085μs 17.6652 KOps/s 17.3973 KOps/s $\color{#35bf28}+1.54\%$
test_compile_add_one_flat[pytree-compile] 0.2721ms 0.1364ms 7.3299 KOps/s 7.2328 KOps/s $\color{#35bf28}+1.34\%$
test_compile_add_one_flat[pytree-eager] 0.5906ms 0.4786ms 2.0895 KOps/s 2.0629 KOps/s $\color{#35bf28}+1.29\%$
test_compile_add_self_flat[tensordict-eager] 0.4047ms 0.2620ms 3.8162 KOps/s 3.8143 KOps/s $\color{#35bf28}+0.05\%$
test_compile_add_self_flat[tensordict-compile] 0.1830ms 0.1437ms 6.9578 KOps/s 6.7877 KOps/s $\color{#35bf28}+2.51\%$
test_compile_add_self_flat[tensorclass-eager] 0.1555ms 68.3829μs 14.6235 KOps/s 14.6266 KOps/s $\color{#d91a1a}-0.02\%$
test_compile_add_self_flat[tensorclass-compile] 0.1552ms 0.1041ms 9.6060 KOps/s 9.6783 KOps/s $\color{#d91a1a}-0.75\%$
test_compile_add_self_flat[pytree-eager] 0.4447ms 0.4004ms 2.4973 KOps/s 2.4336 KOps/s $\color{#35bf28}+2.62\%$
test_compile_add_self_flat[pytree-compile] 0.1880ms 0.1427ms 7.0067 KOps/s 7.4415 KOps/s $\textbf{\color{#d91a1a}-5.84\%}$
test_compile_copy_flat[tensordict-compile] 55.7510μs 19.8763μs 50.3112 KOps/s 55.8602 KOps/s $\textbf{\color{#d91a1a}-9.93\%}$
test_compile_copy_flat[tensordict-eager] 66.9510μs 31.2038μs 32.0474 KOps/s 31.4124 KOps/s $\color{#35bf28}+2.02\%$
test_compile_copy_flat[pytree-compile] 0.1256ms 69.6438μs 14.3588 KOps/s 14.1180 KOps/s $\color{#35bf28}+1.71\%$
test_compile_copy_flat[pytree-eager] 0.1805ms 51.3463μs 19.4756 KOps/s 19.0294 KOps/s $\color{#35bf28}+2.34\%$
test_compile_assign_and_add[tensordict-compile] 1.6377ms 0.3927ms 2.5463 KOps/s 2.1990 KOps/s $\textbf{\color{#35bf28}+15.80\%}$
test_compile_assign_and_add[tensordict-eager] 3.1067ms 2.7243ms 367.0651 Ops/s 373.2142 Ops/s $\color{#d91a1a}-1.65\%$
test_compile_assign_and_add[pytree-compile] 1.6068ms 0.4340ms 2.3040 KOps/s 2.2508 KOps/s $\color{#35bf28}+2.36\%$
test_compile_assign_and_add[pytree-eager] 3.0955ms 2.6923ms 371.4352 Ops/s 374.9626 Ops/s $\color{#d91a1a}-0.94\%$
test_compile_indexing[tensor-tensordict-compile] 0.6159ms 0.1216ms 8.2203 KOps/s 8.5609 KOps/s $\color{#d91a1a}-3.98\%$
test_compile_indexing[tensor-tensordict-eager] 0.5988ms 84.1106μs 11.8891 KOps/s 12.4891 KOps/s $\color{#d91a1a}-4.80\%$
test_compile_indexing[tensor-tensorclass-compile] 0.6540ms 0.1095ms 9.1346 KOps/s 9.2453 KOps/s $\color{#d91a1a}-1.20\%$
test_compile_indexing[tensor-tensorclass-eager] 0.1183ms 72.6196μs 13.7704 KOps/s 14.3578 KOps/s $\color{#d91a1a}-4.09\%$
test_compile_indexing[tensor-pytree-compile] 0.1618ms 0.1124ms 8.8956 KOps/s 8.7190 KOps/s $\color{#35bf28}+2.03\%$
test_compile_indexing[tensor-pytree-eager] 0.1352ms 71.6051μs 13.9655 KOps/s 13.7117 KOps/s $\color{#35bf28}+1.85\%$
test_compile_indexing[slice-tensordict-compile] 0.2025ms 0.1027ms 9.7374 KOps/s 9.8407 KOps/s $\color{#d91a1a}-1.05\%$
test_compile_indexing[slice-tensordict-eager] 0.1699ms 17.0187μs 58.7590 KOps/s 56.6883 KOps/s $\color{#35bf28}+3.65\%$
test_compile_indexing[slice-tensorclass-compile] 0.1402ms 0.1000ms 9.9982 KOps/s 10.2561 KOps/s $\color{#d91a1a}-2.51\%$
test_compile_indexing[slice-tensorclass-eager] 59.3620μs 15.7547μs 63.4732 KOps/s 62.6360 KOps/s $\color{#35bf28}+1.34\%$
test_compile_indexing[slice-pytree-compile] 0.1425ms 98.2368μs 10.1795 KOps/s 10.1320 KOps/s $\color{#35bf28}+0.47\%$
test_compile_indexing[slice-pytree-eager] 67.9220μs 16.0005μs 62.4979 KOps/s 63.2279 KOps/s $\color{#d91a1a}-1.15\%$
test_compile_indexing[int-tensordict-compile] 0.1447ms 0.1017ms 9.8349 KOps/s 9.8613 KOps/s $\color{#d91a1a}-0.27\%$
test_compile_indexing[int-tensordict-eager] 0.5798ms 16.9805μs 58.8912 KOps/s 57.5203 KOps/s $\color{#35bf28}+2.38\%$
test_compile_indexing[int-tensorclass-compile] 0.1512ms 97.7632μs 10.2288 KOps/s 10.1682 KOps/s $\color{#35bf28}+0.60\%$
test_compile_indexing[int-tensorclass-eager] 47.5010μs 15.9737μs 62.6027 KOps/s 62.2293 KOps/s $\color{#35bf28}+0.60\%$
test_compile_indexing[int-pytree-compile] 0.1475ms 99.8387μs 10.0162 KOps/s 9.8261 KOps/s $\color{#35bf28}+1.93\%$
test_compile_indexing[int-pytree-eager] 49.4510μs 15.9860μs 62.5547 KOps/s 63.1826 KOps/s $\color{#d91a1a}-0.99\%$
test_mod_add[eager] 0.1075ms 36.6067μs 27.3174 KOps/s 25.7108 KOps/s $\textbf{\color{#35bf28}+6.25\%}$
test_mod_add[compile] 0.3828ms 81.5653μs 12.2601 KOps/s 12.2654 KOps/s $\color{#d91a1a}-0.04\%$
test_mod_add[compile-overhead] 0.3283ms 0.1671ms 5.9832 KOps/s 5.7135 KOps/s $\color{#35bf28}+4.72\%$
test_mod_wrap[eager] 0.3332ms 0.2498ms 4.0026 KOps/s 3.8970 KOps/s $\color{#35bf28}+2.71\%$
test_mod_wrap[compile] 0.3967ms 0.2847ms 3.5129 KOps/s 3.3958 KOps/s $\color{#35bf28}+3.45\%$
test_mod_wrap[compile-overhead] 7.0233ms 3.7179ms 268.9676 Ops/s 282.9737 Ops/s $\color{#d91a1a}-4.95\%$
test_mod_wrap_and_backward[eager] 1.5906ms 1.4516ms 688.8792 Ops/s 682.7755 Ops/s $\color{#35bf28}+0.89\%$
test_mod_wrap_and_backward[compile] 1.4906ms 1.3765ms 726.4935 Ops/s 722.7884 Ops/s $\color{#35bf28}+0.51\%$
test_mod_wrap_and_backward[compile-overhead] 1.5124ms 1.0434ms 958.3984 Ops/s 950.3611 Ops/s $\color{#35bf28}+0.85\%$
test_seq_add[eager] 0.1609ms 0.1140ms 8.7709 KOps/s 8.3536 KOps/s $\color{#35bf28}+5.00\%$
test_seq_add[compile] 0.1573ms 87.9066μs 11.3757 KOps/s 11.3467 KOps/s $\color{#35bf28}+0.26\%$
test_seq_add[compile-overhead] 0.1695ms 0.1292ms 7.7408 KOps/s 7.7014 KOps/s $\color{#35bf28}+0.51\%$
test_seq_wrap[eager] 0.4754ms 0.4180ms 2.3923 KOps/s 2.3136 KOps/s $\color{#35bf28}+3.40\%$
test_seq_wrap[compile] 0.3491ms 0.3010ms 3.3223 KOps/s 3.2634 KOps/s $\color{#35bf28}+1.81\%$
test_seq_wrap[compile-overhead] 0.2856ms 0.2254ms 4.4361 KOps/s 4.3601 KOps/s $\color{#35bf28}+1.74\%$
test_func_call_runtime[False-eager] 0.8045ms 0.7456ms 1.3411 KOps/s 1.3212 KOps/s $\color{#35bf28}+1.51\%$
test_func_call_runtime[False-compile] 0.7979ms 0.7458ms 1.3408 KOps/s 1.3200 KOps/s $\color{#35bf28}+1.58\%$
test_func_call_runtime[False-compile-overhead] 0.4152ms 0.3668ms 2.7262 KOps/s 2.7182 KOps/s $\color{#35bf28}+0.30\%$
test_func_call_runtime[True-eager] 0.9835ms 0.9131ms 1.0952 KOps/s 1.0778 KOps/s $\color{#35bf28}+1.61\%$
test_func_call_runtime[True-compile] 0.8251ms 0.7667ms 1.3042 KOps/s 1.3015 KOps/s $\color{#35bf28}+0.21\%$
test_func_call_runtime[True-compile-overhead] 0.4317ms 0.3872ms 2.5827 KOps/s 2.5803 KOps/s $\color{#35bf28}+0.09\%$
test_func_call_cm_runtime[False-eager] 0.8826ms 0.7726ms 1.2943 KOps/s 1.3336 KOps/s $\color{#d91a1a}-2.95\%$
test_func_call_cm_runtime[False-compile] 0.8302ms 0.7524ms 1.3291 KOps/s 1.3296 KOps/s $\color{#d91a1a}-0.04\%$
test_func_call_cm_runtime[False-compile-overhead] 0.4347ms 0.3687ms 2.7125 KOps/s 2.6943 KOps/s $\color{#35bf28}+0.68\%$
test_func_call_cm_runtime[True-eager] 1.1779ms 1.0235ms 977.0415 Ops/s 973.3739 Ops/s $\color{#35bf28}+0.38\%$
test_func_call_cm_runtime[True-compile] 0.8742ms 0.7950ms 1.2578 KOps/s 1.2435 KOps/s $\color{#35bf28}+1.15\%$
test_func_call_cm_runtime[True-compile-overhead] 0.4587ms 0.4120ms 2.4269 KOps/s 2.4133 KOps/s $\color{#35bf28}+0.57\%$
test_vmap_func_call_cm_runtime[eager] 2.6047ms 2.1164ms 472.5114 Ops/s 469.0104 Ops/s $\color{#35bf28}+0.75\%$
test_vmap_func_call_cm_runtime[compile] 0.8880ms 0.8099ms 1.2348 KOps/s 1.2351 KOps/s $\color{#d91a1a}-0.03\%$
test_vmap_func_call_cm_runtime[compile-overhead] 0.4719ms 0.4161ms 2.4031 KOps/s 2.3887 KOps/s $\color{#35bf28}+0.60\%$
test_distributed 11.4659ms 0.2140ms 4.6738 KOps/s 8.4312 KOps/s $\textbf{\color{#d91a1a}-44.57\%}$
test_tdmodule 52.7210μs 19.6687μs 50.8421 KOps/s 46.9097 KOps/s $\textbf{\color{#35bf28}+8.38\%}$
test_tdmodule_dispatch 55.0110μs 34.4916μs 28.9926 KOps/s 26.3862 KOps/s $\textbf{\color{#35bf28}+9.88\%}$
test_tdseq 39.9710μs 19.9304μs 50.1746 KOps/s 46.3891 KOps/s $\textbf{\color{#35bf28}+8.16\%}$
test_tdseq_dispatch 57.0710μs 36.7432μs 27.2159 KOps/s 24.8817 KOps/s $\textbf{\color{#35bf28}+9.38\%}$
test_instantiation_functorch 1.6538ms 1.5887ms 629.4291 Ops/s 632.9240 Ops/s $\color{#d91a1a}-0.55\%$
test_exec_functorch 0.1909ms 0.1473ms 6.7873 KOps/s 6.8588 KOps/s $\color{#d91a1a}-1.04\%$
test_exec_functional_call 0.1692ms 0.1386ms 7.2128 KOps/s 7.2631 KOps/s $\color{#d91a1a}-0.69\%$
test_exec_td_decorator 0.3941ms 0.1880ms 5.3189 KOps/s 5.2810 KOps/s $\color{#35bf28}+0.72\%$
test_vmap_mlp_speed_decorator[True-True] 0.8019ms 0.6931ms 1.4428 KOps/s 1.4224 KOps/s $\color{#35bf28}+1.43\%$
test_vmap_mlp_speed_decorator[True-False] 0.8192ms 0.6953ms 1.4382 KOps/s 1.4353 KOps/s $\color{#35bf28}+0.20\%$
test_vmap_mlp_speed_decorator[False-True] 0.7491ms 0.6171ms 1.6205 KOps/s 1.6329 KOps/s $\color{#d91a1a}-0.76\%$
test_vmap_mlp_speed_decorator[False-False] 0.7310ms 0.6065ms 1.6487 KOps/s 1.6462 KOps/s $\color{#35bf28}+0.16\%$
test_vmap_transformer_speed_decorator[True-True] 20.0681ms 19.5369ms 51.1853 Ops/s 51.2133 Ops/s $\color{#d91a1a}-0.05\%$
test_vmap_transformer_speed_decorator[True-False] 19.6527ms 19.5592ms 51.1269 Ops/s 51.5123 Ops/s $\color{#d91a1a}-0.75\%$
test_vmap_transformer_speed_decorator[False-True] 19.5581ms 19.4260ms 51.4773 Ops/s 51.7797 Ops/s $\color{#d91a1a}-0.58\%$
test_vmap_transformer_speed_decorator[False-False] 20.5728ms 19.4054ms 51.5319 Ops/s 51.8730 Ops/s $\color{#d91a1a}-0.66\%$
test_to_module_speed[True] 1.0863ms 1.0027ms 997.2638 Ops/s 1.0212 KOps/s $\color{#d91a1a}-2.34\%$
test_to_module_speed[False] 1.5572ms 0.9816ms 1.0188 KOps/s 1.0364 KOps/s $\color{#d91a1a}-1.71\%$
test_tc_init 66.0920μs 34.2689μs 29.1810 KOps/s 26.3929 KOps/s $\textbf{\color{#35bf28}+10.56\%}$
test_tc_init_nested 0.1028ms 69.0956μs 14.4727 KOps/s 13.1032 KOps/s $\textbf{\color{#35bf28}+10.45\%}$
test_tc_first_layer_tensor 29.8100μs 0.8656μs 1.1552 MOps/s 1.1829 MOps/s $\color{#d91a1a}-2.33\%$
test_tc_first_layer_nontensor 82.6820μs 2.3287μs 429.4226 KOps/s 440.8304 KOps/s $\color{#d91a1a}-2.59\%$
test_tc_second_layer_tensor 19.5380μs 1.4784μs 676.3931 KOps/s 684.1954 KOps/s $\color{#d91a1a}-1.14\%$
test_tc_second_layer_nontensor 30.7210μs 3.0907μs 323.5519 KOps/s 327.6794 KOps/s $\color{#d91a1a}-1.26\%$
test_unbind 0.2258s 10.0812ms 99.1944 Ops/s 140.8735 Ops/s $\textbf{\color{#d91a1a}-29.59\%}$
test_full_like 10.4500ms 9.2584ms 108.0104 Ops/s 106.7937 Ops/s $\color{#35bf28}+1.14\%$
test_zeros_like 11.5894ms 8.7757ms 113.9510 Ops/s 233.5947 Ops/s $\textbf{\color{#d91a1a}-51.22\%}$
test_ones_like 9.1336ms 7.1164ms 140.5196 Ops/s 236.4205 Ops/s $\textbf{\color{#d91a1a}-40.56\%}$
test_clone 7.1458ms 6.5454ms 152.7788 Ops/s 107.9270 Ops/s $\textbf{\color{#35bf28}+41.56\%}$
test_squeeze 59.3120μs 9.7554μs 102.5078 KOps/s 102.0103 KOps/s $\color{#35bf28}+0.49\%$
test_unsqueeze 0.1225ms 71.4169μs 14.0023 KOps/s 13.8372 KOps/s $\color{#35bf28}+1.19\%$
test_split 0.4119ms 0.1593ms 6.2763 KOps/s 6.2334 KOps/s $\color{#35bf28}+0.69\%$
test_permute 0.2332ms 0.1777ms 5.6263 KOps/s 5.7244 KOps/s $\color{#d91a1a}-1.71\%$
test_stack 51.4888ms 51.0898ms 19.5734 Ops/s 19.7533 Ops/s $\color{#d91a1a}-0.91\%$
test_cat 51.6159ms 51.0431ms 19.5913 Ops/s 19.7154 Ops/s $\color{#d91a1a}-0.63\%$

@vmoens vmoens added the Refactor Refactoring code - not a new feature label Jan 10, 2025
[ghstack-poisoned]
vmoens added a commit that referenced this pull request Jan 10, 2025
ghstack-source-id: 8be779a7a85fdf45000181a9ea0f830822f19e37
Pull Request resolved: #1178
@vmoens vmoens merged commit 8cf88d1 into gh/vmoens/45/base Jan 10, 2025
20 of 28 checks passed
vmoens added a commit that referenced this pull request Jan 10, 2025
ghstack-source-id: 8be779a7a85fdf45000181a9ea0f830822f19e37
Pull Request resolved: #1178
@vmoens vmoens deleted the gh/vmoens/45/head branch January 10, 2025 13:16
Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment
Labels
CLA Signed This label is managed by the Facebook bot. Authors need to sign the CLA before a PR can be reviewed. Refactor Refactoring code - not a new feature
Projects
None yet
Development

Successfully merging this pull request may close these issues.

2 participants