Skip to content
New issue

Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.

By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.

Already on GitHub? Sign in to your account

[Refactor] Add missing functions in tensorclass register #1153

Merged
merged 2 commits into from
Dec 20, 2024

Conversation

[ghstack-poisoned]
vmoens added a commit that referenced this pull request Dec 20, 2024
ghstack-source-id: 9f959be7b04c915596bdd67e149dc241cb8b4635
Pull Request resolved: #1153
@facebook-github-bot facebook-github-bot added the CLA Signed This label is managed by the Facebook bot. Authors need to sign the CLA before a PR can be reviewed. label Dec 20, 2024
[ghstack-poisoned]
vmoens added a commit that referenced this pull request Dec 20, 2024
ghstack-source-id: 48311d7a98a9895b10e5552e5b4a4f13764607e0
Pull Request resolved: #1153
Copy link

$\color{#D29922}\textsf{\Large⚠\kern{0.2cm}\normalsize Warning}$ Result of CPU Benchmark Tests

Total Benchmarks: 217. Improved: $\large\color{#35bf28}13$. Worsened: $\large\color{#d91a1a}28$.

Expand to view detailed results
Name Max Mean Ops Ops on Repo HEAD Change
test_plain_set_nested 42.4800μs 20.8839μs 47.8837 KOps/s 51.4393 KOps/s $\textbf{\color{#d91a1a}-6.91\%}$
test_plain_set_stack_nested 40.9160μs 21.0786μs 47.4414 KOps/s 50.1796 KOps/s $\textbf{\color{#d91a1a}-5.46\%}$
test_plain_set_nested_inplace 83.8690μs 22.5210μs 44.4031 KOps/s 46.8970 KOps/s $\textbf{\color{#d91a1a}-5.32\%}$
test_plain_set_stack_nested_inplace 56.0150μs 22.6759μs 44.0996 KOps/s 46.8564 KOps/s $\textbf{\color{#d91a1a}-5.88\%}$
test_items 30.9290μs 4.2114μs 237.4510 KOps/s 245.3104 KOps/s $\color{#d91a1a}-3.20\%$
test_items_nested 0.8172ms 0.4021ms 2.4871 KOps/s 2.4937 KOps/s $\color{#d91a1a}-0.26\%$
test_items_nested_locked 0.7242ms 0.4017ms 2.4895 KOps/s 2.4863 KOps/s $\color{#35bf28}+0.13\%$
test_items_nested_leaf 0.1763ms 76.1161μs 13.1378 KOps/s 12.9160 KOps/s $\color{#35bf28}+1.72\%$
test_items_stack_nested 0.4766ms 0.4038ms 2.4763 KOps/s 2.4746 KOps/s $\color{#35bf28}+0.07\%$
test_items_stack_nested_leaf 0.1545ms 79.1589μs 12.6328 KOps/s 12.4136 KOps/s $\color{#35bf28}+1.77\%$
test_items_stack_nested_locked 0.5024ms 0.4090ms 2.4451 KOps/s 2.4664 KOps/s $\color{#d91a1a}-0.86\%$
test_keys 22.2910μs 3.5270μs 283.5251 KOps/s 285.3914 KOps/s $\color{#d91a1a}-0.65\%$
test_keys_nested 0.2635ms 0.1680ms 5.9527 KOps/s 6.0114 KOps/s $\color{#d91a1a}-0.98\%$
test_keys_nested_locked 0.7546ms 0.1732ms 5.7723 KOps/s 5.8222 KOps/s $\color{#d91a1a}-0.86\%$
test_keys_nested_leaf 0.2750ms 0.1466ms 6.8194 KOps/s 6.9076 KOps/s $\color{#d91a1a}-1.28\%$
test_keys_stack_nested 0.3034ms 0.1673ms 5.9764 KOps/s 6.1445 KOps/s $\color{#d91a1a}-2.74\%$
test_keys_stack_nested_leaf 0.2498ms 0.1444ms 6.9244 KOps/s 7.1383 KOps/s $\color{#d91a1a}-3.00\%$
test_keys_stack_nested_locked 0.2814ms 0.1727ms 5.7892 KOps/s 5.8882 KOps/s $\color{#d91a1a}-1.68\%$
test_values 8.2756μs 1.0341μs 967.0530 KOps/s 944.2112 KOps/s $\color{#35bf28}+2.42\%$
test_values_nested 0.1119ms 61.9390μs 16.1449 KOps/s 16.4331 KOps/s $\color{#d91a1a}-1.75\%$
test_values_nested_locked 0.1181ms 61.5558μs 16.2454 KOps/s 16.0020 KOps/s $\color{#35bf28}+1.52\%$
test_values_nested_leaf 0.1296ms 71.9219μs 13.9040 KOps/s 13.9136 KOps/s $\color{#d91a1a}-0.07\%$
test_values_stack_nested 0.1179ms 63.5828μs 15.7275 KOps/s 16.0386 KOps/s $\color{#d91a1a}-1.94\%$
test_values_stack_nested_leaf 0.1422ms 73.1976μs 13.6616 KOps/s 14.1810 KOps/s $\color{#d91a1a}-3.66\%$
test_values_stack_nested_locked 0.1282ms 64.0188μs 15.6204 KOps/s 15.8001 KOps/s $\color{#d91a1a}-1.14\%$
test_membership 5.7461μs 0.7555μs 1.3236 MOps/s 1.1310 MOps/s $\textbf{\color{#35bf28}+17.03\%}$
test_membership_nested 22.0310μs 3.0276μs 330.2992 KOps/s 340.1407 KOps/s $\color{#d91a1a}-2.89\%$
test_membership_nested_leaf 46.6800μs 3.0406μs 328.8837 KOps/s 331.2892 KOps/s $\color{#d91a1a}-0.73\%$
test_membership_stacked_nested 20.9690μs 3.0009μs 333.2338 KOps/s 338.7493 KOps/s $\color{#d91a1a}-1.63\%$
test_membership_stacked_nested_leaf 20.8690μs 3.0221μs 330.8910 KOps/s 330.6080 KOps/s $\color{#35bf28}+0.09\%$
test_membership_nested_last 24.1150μs 4.4205μs 226.2206 KOps/s 227.7503 KOps/s $\color{#d91a1a}-0.67\%$
test_membership_nested_leaf_last 28.9240μs 4.4611μs 224.1604 KOps/s 227.3148 KOps/s $\color{#d91a1a}-1.39\%$
test_membership_stacked_nested_last 30.2270μs 4.4373μs 225.3626 KOps/s 74.5823 KOps/s $\textbf{\color{#35bf28}+202.17\%}$
test_membership_stacked_nested_leaf_last 22.0010μs 4.4276μs 225.8585 KOps/s 74.4194 KOps/s $\textbf{\color{#35bf28}+203.49\%}$
test_nested_getleaf 35.6470μs 10.7682μs 92.8661 KOps/s 94.6968 KOps/s $\color{#d91a1a}-1.93\%$
test_nested_get 37.4900μs 10.2457μs 97.6021 KOps/s 94.9969 KOps/s $\color{#35bf28}+2.74\%$
test_stacked_getleaf 31.4280μs 10.7971μs 92.6174 KOps/s 93.6384 KOps/s $\color{#d91a1a}-1.09\%$
test_stacked_get 34.1740μs 10.3772μs 96.3656 KOps/s 98.3964 KOps/s $\color{#d91a1a}-2.06\%$
test_nested_getitemleaf 36.0680μs 11.2737μs 88.7021 KOps/s 90.0195 KOps/s $\color{#d91a1a}-1.46\%$
test_nested_getitem 50.7240μs 10.4179μs 95.9886 KOps/s 97.0579 KOps/s $\color{#d91a1a}-1.10\%$
test_stacked_getitemleaf 42.8300μs 11.1203μs 89.9253 KOps/s 90.6124 KOps/s $\color{#d91a1a}-0.76\%$
test_stacked_getitem 48.6340μs 10.3130μs 96.9654 KOps/s 100.6830 KOps/s $\color{#d91a1a}-3.69\%$
test_lock_nested 7.4235ms 0.4656ms 2.1479 KOps/s 2.2004 KOps/s $\color{#d91a1a}-2.39\%$
test_lock_stack_nested 0.8572ms 0.4284ms 2.3342 KOps/s 2.4063 KOps/s $\color{#d91a1a}-3.00\%$
test_unlock_nested 0.7023ms 0.3737ms 2.6759 KOps/s 2.6573 KOps/s $\color{#35bf28}+0.70\%$
test_unlock_stack_nested 0.7732ms 0.3490ms 2.8654 KOps/s 2.9733 KOps/s $\color{#d91a1a}-3.63\%$
test_flatten_speed 0.2378ms 99.8328μs 10.0168 KOps/s 9.9103 KOps/s $\color{#35bf28}+1.07\%$
test_unflatten_speed 0.7046ms 0.5301ms 1.8865 KOps/s 1.9047 KOps/s $\color{#d91a1a}-0.96\%$
test_common_ops 7.3586ms 0.7853ms 1.2734 KOps/s 1.3457 KOps/s $\textbf{\color{#d91a1a}-5.37\%}$
test_creation 18.8450μs 2.4621μs 406.1571 KOps/s 399.7069 KOps/s $\color{#35bf28}+1.61\%$
test_creation_empty 33.2220μs 11.4621μs 87.2437 KOps/s 105.2938 KOps/s $\textbf{\color{#d91a1a}-17.14\%}$
test_creation_nested_1 70.6820μs 14.7831μs 67.6450 KOps/s 80.2342 KOps/s $\textbf{\color{#d91a1a}-15.69\%}$
test_creation_nested_2 53.9710μs 19.1349μs 52.2605 KOps/s 58.2061 KOps/s $\textbf{\color{#d91a1a}-10.21\%}$
test_clone 53.0390μs 13.6071μs 73.4908 KOps/s 73.9001 KOps/s $\color{#d91a1a}-0.55\%$
test_getitem[int] 1.3275ms 13.4932μs 74.1114 KOps/s 77.0488 KOps/s $\color{#d91a1a}-3.81\%$
test_getitem[slice_int] 0.1407ms 24.8894μs 40.1777 KOps/s 40.5283 KOps/s $\color{#d91a1a}-0.87\%$
test_getitem[range] 0.2965ms 51.6861μs 19.3475 KOps/s 19.8824 KOps/s $\color{#d91a1a}-2.69\%$
test_getitem[tuple] 0.1348ms 20.7408μs 48.2140 KOps/s 48.8710 KOps/s $\color{#d91a1a}-1.34\%$
test_getitem[list] 0.1587ms 43.7539μs 22.8551 KOps/s 22.1371 KOps/s $\color{#35bf28}+3.24\%$
test_setitem_dim[int] 50.0830μs 25.5671μs 39.1128 KOps/s 39.2149 KOps/s $\color{#d91a1a}-0.26\%$
test_setitem_dim[slice_int] 0.1043ms 51.9969μs 19.2319 KOps/s 18.9919 KOps/s $\color{#35bf28}+1.26\%$
test_setitem_dim[range] 0.1113ms 73.3763μs 13.6284 KOps/s 13.3456 KOps/s $\color{#35bf28}+2.12\%$
test_setitem_dim[tuple] 0.1005ms 40.5727μs 24.6471 KOps/s 24.4229 KOps/s $\color{#35bf28}+0.92\%$
test_setitem 64.0420μs 20.5280μs 48.7140 KOps/s 51.9164 KOps/s $\textbf{\color{#d91a1a}-6.17\%}$
test_set 73.4980μs 20.2955μs 49.2721 KOps/s 52.7755 KOps/s $\textbf{\color{#d91a1a}-6.64\%}$
test_set_shared 3.6174ms 0.1709ms 5.8518 KOps/s 5.8261 KOps/s $\color{#35bf28}+0.44\%$
test_update 0.1183ms 22.7648μs 43.9275 KOps/s 48.8001 KOps/s $\textbf{\color{#d91a1a}-9.98\%}$
test_update_nested 91.7210μs 33.2859μs 30.0427 KOps/s 32.5178 KOps/s $\textbf{\color{#d91a1a}-7.61\%}$
test_update__nested 0.3914ms 33.4133μs 29.9282 KOps/s 29.5916 KOps/s $\color{#35bf28}+1.14\%$
test_set_nested 68.0460μs 22.7687μs 43.9200 KOps/s 47.5976 KOps/s $\textbf{\color{#d91a1a}-7.73\%}$
test_set_nested_new 0.2242ms 27.2638μs 36.6786 KOps/s 38.8131 KOps/s $\textbf{\color{#d91a1a}-5.50\%}$
test_select 0.2311ms 44.2747μs 22.5863 KOps/s 24.1737 KOps/s $\textbf{\color{#d91a1a}-6.57\%}$
test_select_nested 0.1367ms 62.9610μs 15.8829 KOps/s 15.9212 KOps/s $\color{#d91a1a}-0.24\%$
test_exclude_nested 0.1657ms 82.9718μs 12.0523 KOps/s 12.0034 KOps/s $\color{#35bf28}+0.41\%$
test_empty[True] 0.8702ms 0.4174ms 2.3960 KOps/s 2.4103 KOps/s $\color{#d91a1a}-0.59\%$
test_empty[False] 11.8045μs 1.4400μs 694.4594 KOps/s 726.7631 KOps/s $\color{#d91a1a}-4.44\%$
test_unbind_speed 0.3656ms 0.2741ms 3.6482 KOps/s 3.6401 KOps/s $\color{#35bf28}+0.22\%$
test_unbind_speed_stack0 0.4130ms 0.2717ms 3.6802 KOps/s 3.8486 KOps/s $\color{#d91a1a}-4.38\%$
test_unbind_speed_stack1 94.5940ms 0.7962ms 1.2560 KOps/s 1.4135 KOps/s $\textbf{\color{#d91a1a}-11.14\%}$
test_split 1.8288ms 1.6099ms 621.1500 Ops/s 568.1169 Ops/s $\textbf{\color{#35bf28}+9.33\%}$
test_chunk 0.1001s 1.9206ms 520.6708 Ops/s 563.6132 Ops/s $\textbf{\color{#d91a1a}-7.62\%}$
test_consolidate_njt[False-None] 10.9857ms 8.0860ms 123.6701 Ops/s 122.9251 Ops/s $\color{#35bf28}+0.61\%$
test_creation[device0] 4.3612ms 90.9403μs 10.9962 KOps/s 11.0290 KOps/s $\color{#d91a1a}-0.30\%$
test_creation_from_tensor 0.2457ms 93.0437μs 10.7476 KOps/s 10.4908 KOps/s $\color{#35bf28}+2.45\%$
test_add_one[memmap_tensor0] 0.2130ms 4.8739μs 205.1761 KOps/s 199.8058 KOps/s $\color{#35bf28}+2.69\%$
test_contiguous[memmap_tensor0] 16.6800μs 0.5189μs 1.9273 MOps/s 1.9193 MOps/s $\color{#35bf28}+0.42\%$
test_stack[memmap_tensor0] 36.9690μs 3.3150μs 301.6564 KOps/s 298.9510 KOps/s $\color{#35bf28}+0.90\%$
test_memmaptd_index 1.0286ms 0.2406ms 4.1569 KOps/s 4.1276 KOps/s $\color{#35bf28}+0.71\%$
test_memmaptd_index_astensor 0.7358ms 0.3264ms 3.0633 KOps/s 3.0504 KOps/s $\color{#35bf28}+0.42\%$
test_memmaptd_index_op 0.9655ms 0.5897ms 1.6959 KOps/s 1.7638 KOps/s $\color{#d91a1a}-3.85\%$
test_serialize_model 0.1268s 0.1174s 8.5153 Ops/s 8.7844 Ops/s $\color{#d91a1a}-3.06\%$
test_serialize_model_pickle 0.4456s 0.3877s 2.5793 Ops/s 2.5745 Ops/s $\color{#35bf28}+0.18\%$
test_serialize_weights 0.1227s 0.1128s 8.8641 Ops/s 8.7842 Ops/s $\color{#35bf28}+0.91\%$
test_serialize_weights_returnearly 0.2417s 0.1698s 5.8901 Ops/s 6.4549 Ops/s $\textbf{\color{#d91a1a}-8.75\%}$
test_serialize_weights_pickle 0.5569s 0.4408s 2.2686 Ops/s 2.5402 Ops/s $\textbf{\color{#d91a1a}-10.69\%}$
test_serialize_weights_filesystem 0.1459s 0.1393s 7.1791 Ops/s 7.1087 Ops/s $\color{#35bf28}+0.99\%$
test_serialize_model_filesystem 0.1586s 0.1439s 6.9474 Ops/s 6.1430 Ops/s $\textbf{\color{#35bf28}+13.10\%}$
test_reshape_pytree 68.5680μs 26.9159μs 37.1527 KOps/s 37.5079 KOps/s $\color{#d91a1a}-0.95\%$
test_reshape_td 89.1170μs 34.3163μs 29.1407 KOps/s 29.6401 KOps/s $\color{#d91a1a}-1.68\%$
test_view_pytree 74.0980μs 26.4275μs 37.8394 KOps/s 37.5182 KOps/s $\color{#35bf28}+0.86\%$
test_view_td 0.1465ms 38.5744μs 25.9239 KOps/s 25.9836 KOps/s $\color{#d91a1a}-0.23\%$
test_unbind_pytree 65.3620μs 30.4185μs 32.8747 KOps/s 33.8781 KOps/s $\color{#d91a1a}-2.96\%$
test_unbind_td 0.3778ms 40.6486μs 24.6011 KOps/s 25.0544 KOps/s $\color{#d91a1a}-1.81\%$
test_split_pytree 75.6910μs 30.3655μs 32.9322 KOps/s 34.0014 KOps/s $\color{#d91a1a}-3.14\%$
test_split_td 0.5444ms 46.0398μs 21.7203 KOps/s 21.8153 KOps/s $\color{#d91a1a}-0.44\%$
test_add_pytree 78.4470μs 36.2243μs 27.6058 KOps/s 27.5301 KOps/s $\color{#35bf28}+0.27\%$
test_add_td 0.1335ms 61.7553μs 16.1930 KOps/s 17.8086 KOps/s $\textbf{\color{#d91a1a}-9.07\%}$
test_compile_add_one_nested[tensordict-compile] 0.1476ms 61.9405μs 16.1445 KOps/s 15.8624 KOps/s $\color{#35bf28}+1.78\%$
test_compile_add_one_nested[tensordict-eager] 0.3717ms 0.1718ms 5.8223 KOps/s 5.9132 KOps/s $\color{#d91a1a}-1.54\%$
test_compile_add_one_nested[pytree-compile] 0.1170ms 45.1431μs 22.1518 KOps/s 21.5362 KOps/s $\color{#35bf28}+2.86\%$
test_compile_add_one_nested[pytree-eager] 0.2930ms 0.1189ms 8.4118 KOps/s 8.2629 KOps/s $\color{#35bf28}+1.80\%$
test_compile_copy_nested[tensordict-compile] 76.7130μs 26.2547μs 38.0884 KOps/s 38.6363 KOps/s $\color{#d91a1a}-1.42\%$
test_compile_copy_nested[tensordict-eager] 0.1211ms 59.9486μs 16.6810 KOps/s 16.8736 KOps/s $\color{#d91a1a}-1.14\%$
test_compile_copy_nested[pytree-compile] 0.1557ms 79.7130μs 12.5450 KOps/s 12.6517 KOps/s $\color{#d91a1a}-0.84\%$
test_compile_copy_nested[pytree-eager] 0.1226ms 69.0017μs 14.4924 KOps/s 14.8130 KOps/s $\color{#d91a1a}-2.16\%$
test_compile_add_one_flat[tensordict-compile] 0.1758ms 0.1034ms 9.6690 KOps/s 9.4078 KOps/s $\color{#35bf28}+2.78\%$
test_compile_add_one_flat[tensordict-eager] 1.3129ms 0.2146ms 4.6600 KOps/s 4.6403 KOps/s $\color{#35bf28}+0.42\%$
test_compile_add_one_flat[tensorclass-compile] 88.1050μs 44.1643μs 22.6427 KOps/s 21.5324 KOps/s $\textbf{\color{#35bf28}+5.16\%}$
test_compile_add_one_flat[tensorclass-eager] 1.2316ms 64.7078μs 15.4541 KOps/s 15.4900 KOps/s $\color{#d91a1a}-0.23\%$
test_compile_add_one_flat[pytree-compile] 0.2027ms 0.1022ms 9.7830 KOps/s 9.4850 KOps/s $\color{#35bf28}+3.14\%$
test_compile_add_one_flat[pytree-eager] 0.2725ms 0.2021ms 4.9482 KOps/s 4.9513 KOps/s $\color{#d91a1a}-0.06\%$
test_compile_add_self_flat[tensordict-eager] 0.4238ms 0.2330ms 4.2916 KOps/s 4.2449 KOps/s $\color{#35bf28}+1.10\%$
test_compile_add_self_flat[tensordict-compile] 0.2041ms 0.1043ms 9.5884 KOps/s 9.4507 KOps/s $\color{#35bf28}+1.46\%$
test_compile_add_self_flat[tensorclass-eager] 0.1327ms 59.2030μs 16.8910 KOps/s 17.0595 KOps/s $\color{#d91a1a}-0.99\%$
test_compile_add_self_flat[tensorclass-compile] 0.1268ms 45.2034μs 22.1222 KOps/s 21.5027 KOps/s $\color{#35bf28}+2.88\%$
test_compile_add_self_flat[pytree-eager] 0.6174ms 0.1588ms 6.2973 KOps/s 6.2865 KOps/s $\color{#35bf28}+0.17\%$
test_compile_add_self_flat[pytree-compile] 0.2196ms 0.1036ms 9.6565 KOps/s 9.5979 KOps/s $\color{#35bf28}+0.61\%$
test_compile_copy_flat[tensordict-compile] 69.6100μs 20.3752μs 49.0792 KOps/s 45.0692 KOps/s $\textbf{\color{#35bf28}+8.90\%}$
test_compile_copy_flat[tensordict-eager] 0.1633ms 65.8349μs 15.1895 KOps/s 15.1337 KOps/s $\color{#35bf28}+0.37\%$
test_compile_copy_flat[pytree-compile] 0.1824ms 82.7556μs 12.0838 KOps/s 12.4132 KOps/s $\color{#d91a1a}-2.65\%$
test_compile_copy_flat[pytree-eager] 0.1341ms 70.0071μs 14.2843 KOps/s 14.5070 KOps/s $\color{#d91a1a}-1.54\%$
test_compile_assign_and_add[tensordict-compile] 0.3294ms 0.2078ms 4.8135 KOps/s 4.8645 KOps/s $\color{#d91a1a}-1.05\%$
test_compile_assign_and_add[tensordict-eager] 1.5501ms 1.3437ms 744.2195 Ops/s 770.3418 Ops/s $\color{#d91a1a}-3.39\%$
test_compile_assign_and_add[pytree-compile] 0.2942ms 0.2047ms 4.8842 KOps/s 5.0185 KOps/s $\color{#d91a1a}-2.68\%$
test_compile_assign_and_add[pytree-eager] 1.5833ms 0.7766ms 1.2877 KOps/s 1.3019 KOps/s $\color{#d91a1a}-1.09\%$
test_compile_assign_and_add_stack[compile] 0.7769ms 0.4615ms 2.1669 KOps/s 2.2370 KOps/s $\color{#d91a1a}-3.14\%$
test_compile_assign_and_add_stack[eager] 2.9524ms 2.6529ms 376.9470 Ops/s 391.6455 Ops/s $\color{#d91a1a}-3.75\%$
test_compile_indexing[tensor-tensordict-compile] 0.1045ms 35.2091μs 28.4018 KOps/s 27.1550 KOps/s $\color{#35bf28}+4.59\%$
test_compile_indexing[tensor-tensordict-eager] 0.5394ms 33.6464μs 29.7208 KOps/s 29.6036 KOps/s $\color{#35bf28}+0.40\%$
test_compile_indexing[tensor-tensorclass-compile] 80.5900μs 28.6116μs 34.9509 KOps/s 33.5144 KOps/s $\color{#35bf28}+4.29\%$
test_compile_indexing[tensor-tensorclass-eager] 70.0010μs 23.3011μs 42.9164 KOps/s 41.8707 KOps/s $\color{#35bf28}+2.50\%$
test_compile_indexing[tensor-pytree-compile] 93.5940μs 29.3688μs 34.0497 KOps/s 32.6132 KOps/s $\color{#35bf28}+4.40\%$
test_compile_indexing[tensor-pytree-eager] 87.2730μs 23.3267μs 42.8693 KOps/s 42.6801 KOps/s $\color{#35bf28}+0.44\%$
test_compile_indexing[slice-tensordict-compile] 0.1142ms 51.3852μs 19.4609 KOps/s 19.3500 KOps/s $\color{#35bf28}+0.57\%$
test_compile_indexing[slice-tensordict-eager] 0.5945ms 20.7605μs 48.1685 KOps/s 49.1646 KOps/s $\color{#d91a1a}-2.03\%$
test_compile_indexing[slice-tensorclass-compile] 0.1140ms 44.0273μs 22.7132 KOps/s 22.6394 KOps/s $\color{#35bf28}+0.33\%$
test_compile_indexing[slice-tensorclass-eager] 77.7350μs 19.0036μs 52.6216 KOps/s 52.8946 KOps/s $\color{#d91a1a}-0.52\%$
test_compile_indexing[slice-pytree-compile] 0.1024ms 45.0441μs 22.2004 KOps/s 22.1046 KOps/s $\color{#35bf28}+0.43\%$
test_compile_indexing[slice-pytree-eager] 57.4370μs 19.0646μs 52.4534 KOps/s 52.3673 KOps/s $\color{#35bf28}+0.16\%$
test_compile_indexing[int-tensordict-compile] 0.1434ms 52.6648μs 18.9880 KOps/s 19.0652 KOps/s $\color{#d91a1a}-0.40\%$
test_compile_indexing[int-tensordict-eager] 1.0450ms 20.6609μs 48.4006 KOps/s 50.1287 KOps/s $\color{#d91a1a}-3.45\%$
test_compile_indexing[int-tensorclass-compile] 86.5320μs 45.1027μs 22.1716 KOps/s 22.1419 KOps/s $\color{#35bf28}+0.13\%$
test_compile_indexing[int-tensorclass-eager] 60.4630μs 18.9339μs 52.8153 KOps/s 53.5839 KOps/s $\color{#d91a1a}-1.43\%$
test_compile_indexing[int-pytree-compile] 97.8640μs 44.9851μs 22.2296 KOps/s 22.2901 KOps/s $\color{#d91a1a}-0.27\%$
test_compile_indexing[int-pytree-eager] 0.5403ms 18.8816μs 52.9616 KOps/s 52.8868 KOps/s $\color{#35bf28}+0.14\%$
test_mod_add[eager] 87.8550μs 34.2987μs 29.1556 KOps/s 30.4331 KOps/s $\color{#d91a1a}-4.20\%$
test_mod_add[compile] 0.1234ms 46.4588μs 21.5245 KOps/s 20.9593 KOps/s $\color{#35bf28}+2.70\%$
test_mod_add[compile-overhead] 0.1663ms 46.4414μs 21.5325 KOps/s 21.2842 KOps/s $\color{#35bf28}+1.17\%$
test_mod_wrap[eager] 0.3637ms 0.2203ms 4.5392 KOps/s 4.5772 KOps/s $\color{#d91a1a}-0.83\%$
test_mod_wrap[compile] 0.4396ms 0.2032ms 4.9204 KOps/s 4.8460 KOps/s $\color{#35bf28}+1.54\%$
test_mod_wrap[compile-overhead] 0.3071ms 0.2002ms 4.9953 KOps/s 4.8922 KOps/s $\color{#35bf28}+2.11\%$
test_mod_wrap_and_backward[eager] 11.7884ms 10.7196ms 93.2867 Ops/s 88.5406 Ops/s $\textbf{\color{#35bf28}+5.36\%}$
test_mod_wrap_and_backward[compile] 12.6274ms 10.8309ms 92.3282 Ops/s 84.9819 Ops/s $\textbf{\color{#35bf28}+8.64\%}$
test_mod_wrap_and_backward[compile-overhead] 12.4927ms 10.8493ms 92.1715 Ops/s 81.8080 Ops/s $\textbf{\color{#35bf28}+12.67\%}$
test_seq_add[eager] 0.2345ms 0.1149ms 8.7003 KOps/s 8.7608 KOps/s $\color{#d91a1a}-0.69\%$
test_seq_add[compile] 0.1251ms 61.8769μs 16.1611 KOps/s 16.2269 KOps/s $\color{#d91a1a}-0.41\%$
test_seq_add[compile-overhead] 0.1174ms 58.9674μs 16.9585 KOps/s 16.4749 KOps/s $\color{#35bf28}+2.94\%$
test_seq_wrap[eager] 0.7334ms 0.4445ms 2.2496 KOps/s 2.2944 KOps/s $\color{#d91a1a}-1.95\%$
test_seq_wrap[compile] 0.3326ms 0.2228ms 4.4889 KOps/s 4.3984 KOps/s $\color{#35bf28}+2.06\%$
test_seq_wrap[compile-overhead] 0.3342ms 0.2240ms 4.4634 KOps/s 4.2627 KOps/s $\color{#35bf28}+4.71\%$
test_func_call_runtime[False-eager] 0.9148ms 0.5489ms 1.8217 KOps/s 1.8665 KOps/s $\color{#d91a1a}-2.40\%$
test_func_call_runtime[False-compile] 0.5343ms 0.4206ms 2.3776 KOps/s 2.3602 KOps/s $\color{#35bf28}+0.74\%$
test_func_call_runtime[False-compile-overhead] 0.5778ms 0.4201ms 2.3805 KOps/s 2.3438 KOps/s $\color{#35bf28}+1.57\%$
test_func_call_runtime[True-eager] 0.9079ms 0.7545ms 1.3254 KOps/s 1.3284 KOps/s $\color{#d91a1a}-0.23\%$
test_func_call_runtime[True-compile] 0.5675ms 0.4574ms 2.1863 KOps/s 2.1474 KOps/s $\color{#35bf28}+1.81\%$
test_func_call_runtime[True-compile-overhead] 0.8889ms 0.4652ms 2.1498 KOps/s 2.1435 KOps/s $\color{#35bf28}+0.30\%$
test_func_call_cm_runtime[False-eager] 0.7744ms 0.5514ms 1.8137 KOps/s 1.8573 KOps/s $\color{#d91a1a}-2.35\%$
test_func_call_cm_runtime[False-compile] 0.7852ms 0.4212ms 2.3739 KOps/s 2.3619 KOps/s $\color{#35bf28}+0.51\%$
test_func_call_cm_runtime[False-compile-overhead] 0.5468ms 0.4207ms 2.3772 KOps/s 2.3646 KOps/s $\color{#35bf28}+0.53\%$
test_func_call_cm_runtime[True-eager] 1.0047ms 0.9023ms 1.1082 KOps/s 1.1073 KOps/s $\color{#35bf28}+0.09\%$
test_func_call_cm_runtime[True-compile] 0.6519ms 0.4871ms 2.0532 KOps/s 2.0166 KOps/s $\color{#35bf28}+1.81\%$
test_func_call_cm_runtime[True-compile-overhead] 0.9806ms 0.4881ms 2.0490 KOps/s 2.0353 KOps/s $\color{#35bf28}+0.67\%$
test_vmap_func_call_cm_runtime[eager] 2.6827ms 1.9559ms 511.2685 Ops/s 522.0183 Ops/s $\color{#d91a1a}-2.06\%$
test_vmap_func_call_cm_runtime[compile] 0.6298ms 0.5193ms 1.9257 KOps/s 1.9286 KOps/s $\color{#d91a1a}-0.15\%$
test_vmap_func_call_cm_runtime[compile-overhead] 0.8998ms 0.5260ms 1.9013 KOps/s 1.9221 KOps/s $\color{#d91a1a}-1.08\%$
test_distributed 0.2568ms 0.1214ms 8.2396 KOps/s 7.8801 KOps/s $\color{#35bf28}+4.56\%$
test_tdmodule 82.5240μs 25.7350μs 38.8576 KOps/s 40.5513 KOps/s $\color{#d91a1a}-4.18\%$
test_tdmodule_dispatch 74.7000μs 46.6417μs 21.4400 KOps/s 22.2458 KOps/s $\color{#d91a1a}-3.62\%$
test_tdseq 53.9100μs 29.3698μs 34.0486 KOps/s 36.2995 KOps/s $\textbf{\color{#d91a1a}-6.20\%}$
test_tdseq_dispatch 87.4230μs 54.8256μs 18.2397 KOps/s 19.4561 KOps/s $\textbf{\color{#d91a1a}-6.25\%}$
test_instantiation_functorch 2.5300ms 1.5410ms 648.9420 Ops/s 648.8297 Ops/s $\color{#35bf28}+0.02\%$
test_exec_functorch 0.3384ms 0.1778ms 5.6232 KOps/s 5.5926 KOps/s $\color{#35bf28}+0.55\%$
test_exec_functional_call 0.4147ms 0.1689ms 5.9222 KOps/s 5.7700 KOps/s $\color{#35bf28}+2.64\%$
test_exec_td_decorator 0.4348ms 0.2289ms 4.3691 KOps/s 4.3297 KOps/s $\color{#35bf28}+0.91\%$
test_vmap_mlp_speed_decorator[True-True] 0.8889ms 0.6619ms 1.5108 KOps/s 1.5423 KOps/s $\color{#d91a1a}-2.04\%$
test_vmap_mlp_speed_decorator[True-False] 1.0403ms 0.6607ms 1.5136 KOps/s 1.4941 KOps/s $\color{#35bf28}+1.30\%$
test_vmap_mlp_speed_decorator[False-True] 0.7829ms 0.5356ms 1.8672 KOps/s 1.8866 KOps/s $\color{#d91a1a}-1.03\%$
test_vmap_mlp_speed_decorator[False-False] 0.7668ms 0.5367ms 1.8632 KOps/s 1.8808 KOps/s $\color{#d91a1a}-0.94\%$
test_to_module_speed[True] 1.7139ms 1.3512ms 740.0633 Ops/s 746.6755 Ops/s $\color{#d91a1a}-0.89\%$
test_to_module_speed[False] 1.8828ms 1.3127ms 761.7984 Ops/s 758.8019 Ops/s $\color{#35bf28}+0.39\%$
test_tc_init 99.4260μs 48.7439μs 20.5154 KOps/s 22.6061 KOps/s $\textbf{\color{#d91a1a}-9.25\%}$
test_tc_init_nested 0.1817ms 97.2907μs 10.2785 KOps/s 11.2608 KOps/s $\textbf{\color{#d91a1a}-8.72\%}$
test_tc_first_layer_tensor 21.3490μs 1.5729μs 635.7675 KOps/s 671.8665 KOps/s $\textbf{\color{#d91a1a}-5.37\%}$
test_tc_first_layer_nontensor 25.2580μs 4.8544μs 205.9994 KOps/s 214.6420 KOps/s $\color{#d91a1a}-4.03\%$
test_tc_second_layer_tensor 38.6020μs 2.9092μs 343.7385 KOps/s 368.1902 KOps/s $\textbf{\color{#d91a1a}-6.64\%}$
test_tc_second_layer_nontensor 26.9000μs 6.2388μs 160.2867 KOps/s 166.9336 KOps/s $\color{#d91a1a}-3.98\%$
test_unbind 0.2168s 13.1147ms 76.2500 Ops/s 79.0726 Ops/s $\color{#d91a1a}-3.57\%$
test_full_like 9.0714ms 7.0981ms 140.8831 Ops/s 142.8273 Ops/s $\color{#d91a1a}-1.36\%$
test_zeros_like 3.1526ms 2.7295ms 366.3610 Ops/s 142.6210 Ops/s $\textbf{\color{#35bf28}+156.88\%}$
test_ones_like 3.9064ms 3.2075ms 311.7670 Ops/s 134.0009 Ops/s $\textbf{\color{#35bf28}+132.66\%}$
test_clone 5.4503ms 4.9329ms 202.7202 Ops/s 108.4997 Ops/s $\textbf{\color{#35bf28}+86.84\%}$
test_squeeze 58.9900μs 11.9759μs 83.5008 KOps/s 83.2202 KOps/s $\color{#35bf28}+0.34\%$
test_unsqueeze 0.2816ms 92.2472μs 10.8404 KOps/s 10.8424 KOps/s $\color{#d91a1a}-0.02\%$
test_split 0.3960ms 0.1971ms 5.0739 KOps/s 5.0950 KOps/s $\color{#d91a1a}-0.41\%$
test_permute 0.2788ms 0.2027ms 4.9339 KOps/s 4.7464 KOps/s $\color{#35bf28}+3.95\%$
test_stack 29.6767ms 25.3196ms 39.4951 Ops/s 42.6734 Ops/s $\textbf{\color{#d91a1a}-7.45\%}$
test_cat 27.3773ms 24.8880ms 40.1801 Ops/s 42.6412 Ops/s $\textbf{\color{#d91a1a}-5.77\%}$

Copy link

$\color{#D29922}\textsf{\Large⚠\kern{0.2cm}\normalsize Warning}$ Result of GPU Benchmark Tests

Total Benchmarks: 229. Improved: $\large\color{#35bf28}13$. Worsened: $\large\color{#d91a1a}24$.

Expand to view detailed results
Name Max Mean Ops Ops on Repo HEAD Change
test_plain_set_nested 0.3939ms 13.3656μs 74.8188 KOps/s 78.8742 KOps/s $\textbf{\color{#d91a1a}-5.14\%}$
test_plain_set_stack_nested 43.1610μs 13.3909μs 74.6776 KOps/s 77.6048 KOps/s $\color{#d91a1a}-3.77\%$
test_plain_set_nested_inplace 0.3974ms 14.4055μs 69.4179 KOps/s 71.7484 KOps/s $\color{#d91a1a}-3.25\%$
test_plain_set_stack_nested_inplace 48.2000μs 14.4714μs 69.1016 KOps/s 71.9805 KOps/s $\color{#d91a1a}-4.00\%$
test_items 0.3856ms 2.8976μs 345.1132 KOps/s 342.8856 KOps/s $\color{#35bf28}+0.65\%$
test_items_nested 0.7363ms 0.3549ms 2.8180 KOps/s 2.8216 KOps/s $\color{#d91a1a}-0.13\%$
test_items_nested_locked 0.7377ms 0.3584ms 2.7898 KOps/s 2.8020 KOps/s $\color{#d91a1a}-0.43\%$
test_items_nested_leaf 85.0120μs 58.6758μs 17.0428 KOps/s 17.2407 KOps/s $\color{#d91a1a}-1.15\%$
test_items_stack_nested 0.7480ms 0.3590ms 2.7852 KOps/s 2.7767 KOps/s $\color{#35bf28}+0.31\%$
test_items_stack_nested_leaf 0.4473ms 59.3778μs 16.8413 KOps/s 16.8276 KOps/s $\color{#35bf28}+0.08\%$
test_items_stack_nested_locked 0.7323ms 0.3595ms 2.7820 KOps/s 2.7926 KOps/s $\color{#d91a1a}-0.38\%$
test_keys 0.3873ms 3.4403μs 290.6718 KOps/s 289.7814 KOps/s $\color{#35bf28}+0.31\%$
test_keys_nested 0.4542ms 80.7142μs 12.3894 KOps/s 12.2588 KOps/s $\color{#35bf28}+1.07\%$
test_keys_nested_locked 0.7901ms 87.0691μs 11.4851 KOps/s 11.4485 KOps/s $\color{#35bf28}+0.32\%$
test_keys_nested_leaf 0.4520ms 72.4407μs 13.8044 KOps/s 13.9510 KOps/s $\color{#d91a1a}-1.05\%$
test_keys_stack_nested 0.4546ms 82.0864μs 12.1823 KOps/s 11.9763 KOps/s $\color{#35bf28}+1.72\%$
test_keys_stack_nested_leaf 0.4450ms 72.6642μs 13.7619 KOps/s 13.5470 KOps/s $\color{#35bf28}+1.59\%$
test_keys_stack_nested_locked 0.4639ms 87.0890μs 11.4825 KOps/s 11.2272 KOps/s $\color{#35bf28}+2.27\%$
test_values 62.9560μs 0.8527μs 1.1728 MOps/s 1.1690 MOps/s $\color{#35bf28}+0.32\%$
test_values_nested 0.4123ms 34.8724μs 28.6760 KOps/s 28.9965 KOps/s $\color{#d91a1a}-1.11\%$
test_values_nested_locked 0.4137ms 36.3029μs 27.5460 KOps/s 27.6289 KOps/s $\color{#d91a1a}-0.30\%$
test_values_nested_leaf 64.6810μs 39.6816μs 25.2006 KOps/s 25.6536 KOps/s $\color{#d91a1a}-1.77\%$
test_values_stack_nested 0.4391ms 34.9217μs 28.6355 KOps/s 28.6253 KOps/s $\color{#35bf28}+0.04\%$
test_values_stack_nested_leaf 0.4140ms 39.9701μs 25.0187 KOps/s 25.3061 KOps/s $\color{#d91a1a}-1.14\%$
test_values_stack_nested_locked 0.4130ms 36.6897μs 27.2556 KOps/s 27.0163 KOps/s $\color{#35bf28}+0.89\%$
test_membership 19.1884μs 0.5110μs 1.9570 MOps/s 1.9845 MOps/s $\color{#d91a1a}-1.39\%$
test_membership_nested 0.1905ms 2.0964μs 477.0088 KOps/s 497.0160 KOps/s $\color{#d91a1a}-4.03\%$
test_membership_nested_leaf 0.1917ms 2.1002μs 476.1540 KOps/s 514.6824 KOps/s $\textbf{\color{#d91a1a}-7.49\%}$
test_membership_stacked_nested 28.6810μs 2.1911μs 456.3823 KOps/s 489.1851 KOps/s $\textbf{\color{#d91a1a}-6.71\%}$
test_membership_stacked_nested_leaf 25.3210μs 2.1952μs 455.5355 KOps/s 495.2048 KOps/s $\textbf{\color{#d91a1a}-8.01\%}$
test_membership_nested_last 0.3925ms 3.1962μs 312.8706 KOps/s 328.9318 KOps/s $\color{#d91a1a}-4.88\%$
test_membership_nested_leaf_last 27.8500μs 3.1989μs 312.6115 KOps/s 331.4205 KOps/s $\textbf{\color{#d91a1a}-5.68\%}$
test_membership_stacked_nested_last 0.3965ms 3.2088μs 311.6462 KOps/s 327.8174 KOps/s $\color{#d91a1a}-4.93\%$
test_membership_stacked_nested_leaf_last 27.6010μs 3.1569μs 316.7696 KOps/s 323.1960 KOps/s $\color{#d91a1a}-1.99\%$
test_nested_getleaf 0.3901ms 6.1719μs 162.0245 KOps/s 162.3205 KOps/s $\color{#d91a1a}-0.18\%$
test_nested_get 31.1410μs 5.8018μs 172.3593 KOps/s 172.9459 KOps/s $\color{#d91a1a}-0.34\%$
test_stacked_getleaf 0.3888ms 6.2062μs 161.1297 KOps/s 162.3034 KOps/s $\color{#d91a1a}-0.72\%$
test_stacked_get 35.8910μs 5.8272μs 171.6103 KOps/s 171.4799 KOps/s $\color{#35bf28}+0.08\%$
test_nested_getitemleaf 0.3968ms 6.2081μs 161.0811 KOps/s 160.8207 KOps/s $\color{#35bf28}+0.16\%$
test_nested_getitem 0.3926ms 5.8712μs 170.3227 KOps/s 168.2525 KOps/s $\color{#35bf28}+1.23\%$
test_stacked_getitemleaf 27.4200μs 6.1900μs 161.5504 KOps/s 160.2474 KOps/s $\color{#35bf28}+0.81\%$
test_stacked_getitem 30.8900μs 5.8976μs 169.5609 KOps/s 169.8012 KOps/s $\color{#d91a1a}-0.14\%$
test_lock_nested 0.8276ms 0.3726ms 2.6837 KOps/s 2.6378 KOps/s $\color{#35bf28}+1.74\%$
test_lock_stack_nested 0.3867ms 0.3422ms 2.9224 KOps/s 2.9202 KOps/s $\color{#35bf28}+0.07\%$
test_unlock_nested 0.6474ms 0.3121ms 3.2037 KOps/s 3.2371 KOps/s $\color{#d91a1a}-1.03\%$
test_unlock_stack_nested 0.3863ms 0.2774ms 3.6044 KOps/s 3.5687 KOps/s $\color{#35bf28}+1.00\%$
test_flatten_speed 0.1224ms 76.0870μs 13.1428 KOps/s 13.2687 KOps/s $\color{#d91a1a}-0.95\%$
test_unflatten_speed 0.4551ms 0.3242ms 3.0849 KOps/s 3.1588 KOps/s $\color{#d91a1a}-2.34\%$
test_common_ops 1.6890ms 0.6350ms 1.5748 KOps/s 1.6235 KOps/s $\color{#d91a1a}-3.00\%$
test_creation 0.1136ms 1.6897μs 591.8221 KOps/s 568.2145 KOps/s $\color{#35bf28}+4.15\%$
test_creation_empty 33.9600μs 10.3061μs 97.0300 KOps/s 108.2153 KOps/s $\textbf{\color{#d91a1a}-10.34\%}$
test_creation_nested_1 0.3856ms 11.8595μs 84.3207 KOps/s 91.2945 KOps/s $\textbf{\color{#d91a1a}-7.64\%}$
test_creation_nested_2 37.7710μs 14.7295μs 67.8909 KOps/s 72.4448 KOps/s $\textbf{\color{#d91a1a}-6.29\%}$
test_clone 0.3942ms 10.1828μs 98.2049 KOps/s 97.2449 KOps/s $\color{#35bf28}+0.99\%$
test_getitem[int] 1.9372ms 10.4501μs 95.6927 KOps/s 95.1307 KOps/s $\color{#35bf28}+0.59\%$
test_getitem[slice_int] 0.1069ms 20.2703μs 49.3332 KOps/s 48.3911 KOps/s $\color{#35bf28}+1.95\%$
test_getitem[range] 0.1250ms 34.8985μs 28.6545 KOps/s 28.1903 KOps/s $\color{#35bf28}+1.65\%$
test_getitem[tuple] 0.3955ms 17.6150μs 56.7697 KOps/s 55.8126 KOps/s $\color{#35bf28}+1.71\%$
test_getitem[list] 0.2695ms 30.9403μs 32.3203 KOps/s 31.4715 KOps/s $\color{#35bf28}+2.70\%$
test_setitem_dim[int] 46.4110μs 17.8498μs 56.0229 KOps/s 56.1249 KOps/s $\color{#d91a1a}-0.18\%$
test_setitem_dim[slice_int] 66.2010μs 36.9557μs 27.0594 KOps/s 27.0922 KOps/s $\color{#d91a1a}-0.12\%$
test_setitem_dim[range] 90.4320μs 49.1437μs 20.3485 KOps/s 19.8782 KOps/s $\color{#35bf28}+2.37\%$
test_setitem_dim[tuple] 51.9210μs 30.1657μs 33.1502 KOps/s 32.0972 KOps/s $\color{#35bf28}+3.28\%$
test_setitem 0.4061ms 15.6973μs 63.7050 KOps/s 66.9879 KOps/s $\color{#d91a1a}-4.90\%$
test_set 0.1030ms 14.9403μs 66.9329 KOps/s 67.6557 KOps/s $\color{#d91a1a}-1.07\%$
test_set_shared 1.4934ms 0.1480ms 6.7584 KOps/s 6.7811 KOps/s $\color{#d91a1a}-0.33\%$
test_update 0.3553ms 19.1578μs 52.1981 KOps/s 55.4493 KOps/s $\textbf{\color{#d91a1a}-5.86\%}$
test_update_nested 0.4261ms 25.2120μs 39.6637 KOps/s 42.7054 KOps/s $\textbf{\color{#d91a1a}-7.12\%}$
test_update__nested 0.2846ms 24.6285μs 40.6034 KOps/s 40.6977 KOps/s $\color{#d91a1a}-0.23\%$
test_set_nested 0.4038ms 16.5811μs 60.3096 KOps/s 62.9080 KOps/s $\color{#d91a1a}-4.13\%$
test_set_nested_new 90.6810μs 18.8066μs 53.1729 KOps/s 54.3290 KOps/s $\color{#d91a1a}-2.13\%$
test_select 0.4347ms 31.3178μs 31.9307 KOps/s 33.2710 KOps/s $\color{#d91a1a}-4.03\%$
test_select_nested 0.1015ms 42.2987μs 23.6414 KOps/s 23.0799 KOps/s $\color{#35bf28}+2.43\%$
test_exclude_nested 0.1382ms 61.2087μs 16.3375 KOps/s 16.2737 KOps/s $\color{#35bf28}+0.39\%$
test_empty[True] 0.4046ms 0.2861ms 3.4955 KOps/s 3.5108 KOps/s $\color{#d91a1a}-0.43\%$
test_empty[False] 3.5010μs 0.8248μs 1.2125 MOps/s 1.2235 MOps/s $\color{#d91a1a}-0.90\%$
test_to 87.1120μs 55.3435μs 18.0690 KOps/s 17.8068 KOps/s $\color{#35bf28}+1.47\%$
test_to_nonblocking 89.8010μs 46.9334μs 21.3068 KOps/s 21.1336 KOps/s $\color{#35bf28}+0.82\%$
test_unbind_speed 1.7026ms 0.2350ms 4.2559 KOps/s 4.2582 KOps/s $\color{#d91a1a}-0.05\%$
test_unbind_speed_stack0 0.3275ms 0.2338ms 4.2775 KOps/s 4.2618 KOps/s $\color{#35bf28}+0.37\%$
test_unbind_speed_stack1 94.3405ms 0.6658ms 1.5019 KOps/s 1.4946 KOps/s $\color{#35bf28}+0.49\%$
test_split 95.7939ms 1.7090ms 585.1417 Ops/s 637.6674 Ops/s $\textbf{\color{#d91a1a}-8.24\%}$
test_chunk 1.5329ms 1.4257ms 701.3921 Ops/s 581.6947 Ops/s $\textbf{\color{#35bf28}+20.58\%}$
test_consolidate[False-None] 98.2467ms 2.9006ms 344.7504 Ops/s 380.7737 Ops/s $\textbf{\color{#d91a1a}-9.46\%}$
test_consolidate[default-None] 1.6872ms 1.6259ms 615.0586 Ops/s 612.3963 Ops/s $\color{#35bf28}+0.43\%$
test_consolidate[reduce-overhead-None] 1.7848ms 1.7164ms 582.6126 Ops/s 597.9262 Ops/s $\color{#d91a1a}-2.56\%$
test_consolidate_njt[False-None] 6.5359ms 6.2948ms 158.8612 Ops/s 156.1006 Ops/s $\color{#35bf28}+1.77\%$
test_to[False-False-None] 1.8402ms 1.7324ms 577.2377 Ops/s 584.5831 Ops/s $\color{#d91a1a}-1.26\%$
test_to[True-False-None] 1.5040ms 1.2775ms 782.7596 Ops/s 772.2573 Ops/s $\color{#35bf28}+1.36\%$
test_to[within-False-None] 4.2416ms 4.0455ms 247.1876 Ops/s 250.1933 Ops/s $\color{#d91a1a}-1.20\%$
test_to[True-default-None] 5.4517ms 5.2025ms 192.2161 Ops/s 191.1417 Ops/s $\color{#35bf28}+0.56\%$
test_to_njt[False-False-None] 6.9315ms 6.7374ms 148.4251 Ops/s 144.3035 Ops/s $\color{#35bf28}+2.86\%$
test_to_njt[True-False-None] 5.5771ms 5.3004ms 188.6657 Ops/s 188.2343 Ops/s $\color{#35bf28}+0.23\%$
test_to_njt[within-False-None] 12.0260ms 11.8829ms 84.1545 Ops/s 85.4667 Ops/s $\color{#d91a1a}-1.54\%$
test_creation[device0] 0.4584ms 77.3193μs 12.9334 KOps/s 12.2090 KOps/s $\textbf{\color{#35bf28}+5.93\%}$
test_creation_from_tensor 0.4459ms 81.4529μs 12.2770 KOps/s 11.9703 KOps/s $\color{#35bf28}+2.56\%$
test_add_one[memmap_tensor0] 0.4478ms 7.0272μs 142.3047 KOps/s 155.3833 KOps/s $\textbf{\color{#d91a1a}-8.42\%}$
test_contiguous[memmap_tensor0] 1.8611μs 0.4051μs 2.4683 MOps/s 2.5096 MOps/s $\color{#d91a1a}-1.64\%$
test_stack[memmap_tensor0] 28.5210μs 4.1476μs 241.1061 KOps/s 240.1238 KOps/s $\color{#35bf28}+0.41\%$
test_memmaptd_index 1.8598ms 0.2430ms 4.1147 KOps/s 4.0873 KOps/s $\color{#35bf28}+0.67\%$
test_memmaptd_index_astensor 0.8028ms 0.3012ms 3.3200 KOps/s 3.2833 KOps/s $\color{#35bf28}+1.12\%$
test_memmaptd_index_op 1.0347ms 0.5965ms 1.6766 KOps/s 1.7266 KOps/s $\color{#d91a1a}-2.90\%$
test_serialize_model 0.1314s 0.1301s 7.6875 Ops/s 7.6445 Ops/s $\color{#35bf28}+0.56\%$
test_serialize_model_pickle 1.3512s 1.1921s 0.8388 Ops/s 0.8063 Ops/s $\color{#35bf28}+4.04\%$
test_serialize_weights 0.4189s 0.1708s 5.8561 Ops/s 7.6864 Ops/s $\textbf{\color{#d91a1a}-23.81\%}$
test_serialize_weights_returnearly 0.3454s 53.2736ms 18.7710 Ops/s 14.5072 Ops/s $\textbf{\color{#35bf28}+29.39\%}$
test_serialize_weights_pickle 1.3842s 1.1951s 0.8367 Ops/s 0.8174 Ops/s $\color{#35bf28}+2.36\%$
test_reshape_pytree 53.3610μs 21.6860μs 46.1127 KOps/s 45.5722 KOps/s $\color{#35bf28}+1.19\%$
test_reshape_td 63.0910μs 27.2149μs 36.7446 KOps/s 36.4216 KOps/s $\color{#35bf28}+0.89\%$
test_view_pytree 48.5300μs 21.5252μs 46.4572 KOps/s 46.4260 KOps/s $\color{#35bf28}+0.07\%$
test_view_td 68.1710μs 29.5883μs 33.7972 KOps/s 33.0528 KOps/s $\color{#35bf28}+2.25\%$
test_unbind_pytree 60.6810μs 27.7576μs 36.0262 KOps/s 36.3385 KOps/s $\color{#d91a1a}-0.86\%$
test_unbind_td 0.8356ms 36.5301μs 27.3747 KOps/s 28.3102 KOps/s $\color{#d91a1a}-3.30\%$
test_split_pytree 61.4110μs 29.5678μs 33.8205 KOps/s 33.7351 KOps/s $\color{#35bf28}+0.25\%$
test_split_td 1.0887ms 38.6411μs 25.8792 KOps/s 26.0185 KOps/s $\color{#d91a1a}-0.54\%$
test_add_pytree 67.8810μs 34.3747μs 29.0912 KOps/s 28.8700 KOps/s $\color{#35bf28}+0.77\%$
test_add_td 94.0520μs 51.9943μs 19.2329 KOps/s 20.7396 KOps/s $\textbf{\color{#d91a1a}-7.26\%}$
test_compile_add_one_nested[tensordict-compile] 0.1695ms 0.1182ms 8.4595 KOps/s 8.3170 KOps/s $\color{#35bf28}+1.71\%$
test_compile_add_one_nested[tensordict-eager] 0.2185ms 0.1303ms 7.6735 KOps/s 7.7171 KOps/s $\color{#d91a1a}-0.57\%$
test_compile_add_one_nested[pytree-compile] 0.1588ms 94.5884μs 10.5721 KOps/s 10.2935 KOps/s $\color{#35bf28}+2.71\%$
test_compile_add_one_nested[pytree-eager] 2.7617ms 0.1510ms 6.6204 KOps/s 6.7595 KOps/s $\color{#d91a1a}-2.06\%$
test_compile_copy_nested[tensordict-compile] 0.1515ms 22.7164μs 44.0211 KOps/s 47.5781 KOps/s $\textbf{\color{#d91a1a}-7.48\%}$
test_compile_copy_nested[tensordict-eager] 77.8310μs 29.3068μs 34.1218 KOps/s 34.6085 KOps/s $\color{#d91a1a}-1.41\%$
test_compile_copy_nested[pytree-compile] 0.1064ms 64.2360μs 15.5676 KOps/s 15.5619 KOps/s $\color{#35bf28}+0.04\%$
test_compile_copy_nested[pytree-eager] 89.7110μs 48.8204μs 20.4832 KOps/s 20.3306 KOps/s $\color{#35bf28}+0.75\%$
test_compile_add_one_flat[tensordict-compile] 0.1801ms 0.1408ms 7.1045 KOps/s 7.0879 KOps/s $\color{#35bf28}+0.23\%$
test_compile_add_one_flat[tensordict-eager] 0.3176ms 0.2155ms 4.6398 KOps/s 4.6389 KOps/s $\color{#35bf28}+0.02\%$
test_compile_add_one_flat[tensorclass-compile] 0.1374ms 97.0694μs 10.3019 KOps/s 10.4225 KOps/s $\color{#d91a1a}-1.16\%$
test_compile_add_one_flat[tensorclass-eager] 0.1130ms 53.5615μs 18.6701 KOps/s 18.7145 KOps/s $\color{#d91a1a}-0.24\%$
test_compile_add_one_flat[pytree-compile] 0.1771ms 0.1380ms 7.2454 KOps/s 7.4739 KOps/s $\color{#d91a1a}-3.06\%$
test_compile_add_one_flat[pytree-eager] 0.5442ms 0.4832ms 2.0694 KOps/s 2.0857 KOps/s $\color{#d91a1a}-0.78\%$
test_compile_add_self_flat[tensordict-eager] 0.4057ms 0.2588ms 3.8643 KOps/s 3.8550 KOps/s $\color{#35bf28}+0.24\%$
test_compile_add_self_flat[tensordict-compile] 0.2782ms 0.1468ms 6.8106 KOps/s 7.0971 KOps/s $\color{#d91a1a}-4.04\%$
test_compile_add_self_flat[tensorclass-eager] 0.1624ms 66.4273μs 15.0540 KOps/s 15.0739 KOps/s $\color{#d91a1a}-0.13\%$
test_compile_add_self_flat[tensorclass-compile] 0.1459ms 0.1021ms 9.7901 KOps/s 10.3683 KOps/s $\textbf{\color{#d91a1a}-5.58\%}$
test_compile_add_self_flat[pytree-eager] 0.4597ms 0.4076ms 2.4534 KOps/s 2.4679 KOps/s $\color{#d91a1a}-0.59\%$
test_compile_add_self_flat[pytree-compile] 0.1812ms 0.1362ms 7.3448 KOps/s 7.5489 KOps/s $\color{#d91a1a}-2.70\%$
test_compile_copy_flat[tensordict-compile] 55.9810μs 18.9979μs 52.6374 KOps/s 58.1463 KOps/s $\textbf{\color{#d91a1a}-9.47\%}$
test_compile_copy_flat[tensordict-eager] 62.8120μs 31.3133μs 31.9353 KOps/s 32.7470 KOps/s $\color{#d91a1a}-2.48\%$
test_compile_copy_flat[pytree-compile] 0.1048ms 69.3601μs 14.4175 KOps/s 14.3738 KOps/s $\color{#35bf28}+0.30\%$
test_compile_copy_flat[pytree-eager] 0.1757ms 51.5638μs 19.3935 KOps/s 19.5680 KOps/s $\color{#d91a1a}-0.89\%$
test_compile_assign_and_add[tensordict-compile] 1.5972ms 0.3842ms 2.6026 KOps/s 2.2726 KOps/s $\textbf{\color{#35bf28}+14.52\%}$
test_compile_assign_and_add[tensordict-eager] 2.9054ms 2.6449ms 378.0829 Ops/s 387.3894 Ops/s $\color{#d91a1a}-2.40\%$
test_compile_assign_and_add[pytree-compile] 1.5905ms 0.4321ms 2.3141 KOps/s 2.2990 KOps/s $\color{#35bf28}+0.66\%$
test_compile_assign_and_add[pytree-eager] 2.7744ms 2.6807ms 373.0331 Ops/s 379.4252 Ops/s $\color{#d91a1a}-1.68\%$
test_compile_indexing[tensor-tensordict-compile] 0.2654ms 0.1156ms 8.6475 KOps/s 8.7849 KOps/s $\color{#d91a1a}-1.56\%$
test_compile_indexing[tensor-tensordict-eager] 0.5641ms 82.9199μs 12.0598 KOps/s 11.7801 KOps/s $\color{#35bf28}+2.37\%$
test_compile_indexing[tensor-tensorclass-compile] 0.5302ms 0.1100ms 9.0941 KOps/s 9.0615 KOps/s $\color{#35bf28}+0.36\%$
test_compile_indexing[tensor-tensorclass-eager] 0.2228ms 71.3963μs 14.0063 KOps/s 13.6012 KOps/s $\color{#35bf28}+2.98\%$
test_compile_indexing[tensor-pytree-compile] 0.1850ms 0.1088ms 9.1888 KOps/s 8.9686 KOps/s $\color{#35bf28}+2.46\%$
test_compile_indexing[tensor-pytree-eager] 0.1143ms 70.3731μs 14.2100 KOps/s 13.6162 KOps/s $\color{#35bf28}+4.36\%$
test_compile_indexing[slice-tensordict-compile] 0.2471ms 0.1032ms 9.6853 KOps/s 10.0434 KOps/s $\color{#d91a1a}-3.57\%$
test_compile_indexing[slice-tensordict-eager] 0.1445ms 16.8045μs 59.5078 KOps/s 57.5834 KOps/s $\color{#35bf28}+3.34\%$
test_compile_indexing[slice-tensorclass-compile] 0.1441ms 97.9321μs 10.2112 KOps/s 10.5993 KOps/s $\color{#d91a1a}-3.66\%$
test_compile_indexing[slice-tensorclass-eager] 55.5710μs 15.7624μs 63.4422 KOps/s 50.9103 KOps/s $\textbf{\color{#35bf28}+24.62\%}$
test_compile_indexing[slice-pytree-compile] 0.1738ms 98.0328μs 10.2007 KOps/s 10.5503 KOps/s $\color{#d91a1a}-3.31\%$
test_compile_indexing[slice-pytree-eager] 0.1059ms 15.3067μs 65.3307 KOps/s 64.4480 KOps/s $\color{#35bf28}+1.37\%$
test_compile_indexing[int-tensordict-compile] 0.1470ms 0.1032ms 9.6920 KOps/s 10.0897 KOps/s $\color{#d91a1a}-3.94\%$
test_compile_indexing[int-tensordict-eager] 0.6157ms 16.6919μs 59.9093 KOps/s 58.8217 KOps/s $\color{#35bf28}+1.85\%$
test_compile_indexing[int-tensorclass-compile] 0.1381ms 99.4047μs 10.0599 KOps/s 10.5568 KOps/s $\color{#d91a1a}-4.71\%$
test_compile_indexing[int-tensorclass-eager] 58.5610μs 15.8189μs 63.2154 KOps/s 65.2958 KOps/s $\color{#d91a1a}-3.19\%$
test_compile_indexing[int-pytree-compile] 0.1501ms 99.4297μs 10.0574 KOps/s 10.4797 KOps/s $\color{#d91a1a}-4.03\%$
test_compile_indexing[int-pytree-eager] 47.0310μs 15.6248μs 64.0007 KOps/s 65.3317 KOps/s $\color{#d91a1a}-2.04\%$
test_mod_add[eager] 86.8210μs 38.8715μs 25.7258 KOps/s 26.7702 KOps/s $\color{#d91a1a}-3.90\%$
test_mod_add[compile] 0.3846ms 79.1847μs 12.6287 KOps/s 12.2004 KOps/s $\color{#35bf28}+3.51\%$
test_mod_add[compile-overhead] 0.3252ms 0.1669ms 5.9913 KOps/s 5.6891 KOps/s $\textbf{\color{#35bf28}+5.31\%}$
test_mod_wrap[eager] 0.3304ms 0.2596ms 3.8514 KOps/s 3.8262 KOps/s $\color{#35bf28}+0.66\%$
test_mod_wrap[compile] 0.3723ms 0.2861ms 3.4947 KOps/s 3.5969 KOps/s $\color{#d91a1a}-2.84\%$
test_mod_wrap[compile-overhead] 7.1035ms 3.7667ms 265.4814 Ops/s 277.5916 Ops/s $\color{#d91a1a}-4.36\%$
test_mod_wrap_and_backward[eager] 1.6514ms 1.3661ms 732.0336 Ops/s 690.4860 Ops/s $\textbf{\color{#35bf28}+6.02\%}$
test_mod_wrap_and_backward[compile] 1.3405ms 1.2502ms 799.8534 Ops/s 741.7167 Ops/s $\textbf{\color{#35bf28}+7.84\%}$
test_mod_wrap_and_backward[compile-overhead] 1.3948ms 0.9227ms 1.0838 KOps/s 961.6564 Ops/s $\textbf{\color{#35bf28}+12.70\%}$
test_seq_add[eager] 0.1826ms 0.1204ms 8.3051 KOps/s 8.7244 KOps/s $\color{#d91a1a}-4.81\%$
test_seq_add[compile] 0.1366ms 88.5925μs 11.2876 KOps/s 11.6546 KOps/s $\color{#d91a1a}-3.15\%$
test_seq_add[compile-overhead] 0.2588ms 0.1274ms 7.8502 KOps/s 7.8369 KOps/s $\color{#35bf28}+0.17\%$
test_seq_wrap[eager] 0.4979ms 0.4168ms 2.3991 KOps/s 2.3756 KOps/s $\color{#35bf28}+0.99\%$
test_seq_wrap[compile] 0.3700ms 0.2959ms 3.3798 KOps/s 3.3923 KOps/s $\color{#d91a1a}-0.37\%$
test_seq_wrap[compile-overhead] 0.3278ms 0.2213ms 4.5194 KOps/s 4.5229 KOps/s $\color{#d91a1a}-0.08\%$
test_func_call_runtime[False-eager] 0.8724ms 0.7325ms 1.3653 KOps/s 1.3663 KOps/s $\color{#d91a1a}-0.07\%$
test_func_call_runtime[False-compile] 0.8823ms 0.7218ms 1.3855 KOps/s 1.3731 KOps/s $\color{#35bf28}+0.90\%$
test_func_call_runtime[False-compile-overhead] 0.4039ms 0.3599ms 2.7782 KOps/s 2.8018 KOps/s $\color{#d91a1a}-0.84\%$
test_func_call_runtime[True-eager] 0.9828ms 0.8868ms 1.1276 KOps/s 1.1146 KOps/s $\color{#35bf28}+1.16\%$
test_func_call_runtime[True-compile] 0.7953ms 0.7429ms 1.3461 KOps/s 1.3355 KOps/s $\color{#35bf28}+0.79\%$
test_func_call_runtime[True-compile-overhead] 0.4997ms 0.3790ms 2.6388 KOps/s 2.5825 KOps/s $\color{#35bf28}+2.18\%$
test_func_call_cm_runtime[False-eager] 0.7987ms 0.7252ms 1.3789 KOps/s 1.3064 KOps/s $\textbf{\color{#35bf28}+5.55\%}$
test_func_call_cm_runtime[False-compile] 0.8020ms 0.7309ms 1.3681 KOps/s 1.3188 KOps/s $\color{#35bf28}+3.74\%$
test_func_call_cm_runtime[False-compile-overhead] 0.4011ms 0.3597ms 2.7800 KOps/s 2.7971 KOps/s $\color{#d91a1a}-0.61\%$
test_func_call_cm_runtime[True-eager] 1.0921ms 0.9901ms 1.0100 KOps/s 995.1820 Ops/s $\color{#35bf28}+1.49\%$
test_func_call_cm_runtime[True-compile] 0.8397ms 0.7780ms 1.2854 KOps/s 1.2917 KOps/s $\color{#d91a1a}-0.49\%$
test_func_call_cm_runtime[True-compile-overhead] 0.4502ms 0.4055ms 2.4661 KOps/s 2.4705 KOps/s $\color{#d91a1a}-0.18\%$
test_vmap_func_call_cm_runtime[eager] 2.5500ms 2.0734ms 482.3012 Ops/s 478.1972 Ops/s $\color{#35bf28}+0.86\%$
test_vmap_func_call_cm_runtime[compile] 0.9542ms 0.7922ms 1.2623 KOps/s 1.2635 KOps/s $\color{#d91a1a}-0.09\%$
test_vmap_func_call_cm_runtime[compile-overhead] 0.5509ms 0.4101ms 2.4387 KOps/s 2.4501 KOps/s $\color{#d91a1a}-0.47\%$
test_distributed 3.9673ms 0.1906ms 5.2473 KOps/s 8.6352 KOps/s $\textbf{\color{#d91a1a}-39.23\%}$
test_tdmodule 64.9610μs 21.0847μs 47.4278 KOps/s 49.8220 KOps/s $\color{#d91a1a}-4.81\%$
test_tdmodule_dispatch 84.7820μs 38.5522μs 25.9389 KOps/s 27.4361 KOps/s $\textbf{\color{#d91a1a}-5.46\%}$
test_tdseq 42.9300μs 22.2571μs 44.9295 KOps/s 47.4589 KOps/s $\textbf{\color{#d91a1a}-5.33\%}$
test_tdseq_dispatch 60.5810μs 40.3654μs 24.7737 KOps/s 25.3321 KOps/s $\color{#d91a1a}-2.20\%$
test_instantiation_functorch 1.6313ms 1.5479ms 646.0338 Ops/s 662.7182 Ops/s $\color{#d91a1a}-2.52\%$
test_exec_functorch 0.2142ms 0.1462ms 6.8421 KOps/s 7.2215 KOps/s $\textbf{\color{#d91a1a}-5.25\%}$
test_exec_functional_call 0.1749ms 0.1362ms 7.3404 KOps/s 7.7121 KOps/s $\color{#d91a1a}-4.82\%$
test_exec_td_decorator 0.3879ms 0.1856ms 5.3892 KOps/s 5.5858 KOps/s $\color{#d91a1a}-3.52\%$
test_vmap_mlp_speed_decorator[True-True] 0.8297ms 0.6825ms 1.4653 KOps/s 1.4657 KOps/s $\color{#d91a1a}-0.03\%$
test_vmap_mlp_speed_decorator[True-False] 0.7917ms 0.6807ms 1.4691 KOps/s 1.4651 KOps/s $\color{#35bf28}+0.27\%$
test_vmap_mlp_speed_decorator[False-True] 0.7033ms 0.5890ms 1.6978 KOps/s 1.6935 KOps/s $\color{#35bf28}+0.25\%$
test_vmap_mlp_speed_decorator[False-False] 0.7309ms 0.5907ms 1.6928 KOps/s 1.6930 KOps/s $-0.01\%$
test_vmap_transformer_speed_decorator[True-True] 19.4051ms 19.1844ms 52.1257 Ops/s 52.2399 Ops/s $\color{#d91a1a}-0.22\%$
test_vmap_transformer_speed_decorator[True-False] 19.9357ms 19.3117ms 51.7821 Ops/s 52.1430 Ops/s $\color{#d91a1a}-0.69\%$
test_vmap_transformer_speed_decorator[False-True] 19.5048ms 19.0820ms 52.4053 Ops/s 52.8270 Ops/s $\color{#d91a1a}-0.80\%$
test_vmap_transformer_speed_decorator[False-False] 19.1506ms 19.0146ms 52.5913 Ops/s 52.5573 Ops/s $\color{#35bf28}+0.06\%$
test_to_module_speed[True] 1.0739ms 0.9730ms 1.0278 KOps/s 1.0246 KOps/s $\color{#35bf28}+0.31\%$
test_to_module_speed[False] 1.0368ms 0.9507ms 1.0518 KOps/s 1.0274 KOps/s $\color{#35bf28}+2.38\%$
test_tc_init 73.3410μs 38.2171μs 26.1663 KOps/s 25.7245 KOps/s $\color{#35bf28}+1.72\%$
test_tc_init_nested 0.1590ms 76.0705μs 13.1457 KOps/s 12.6595 KOps/s $\color{#35bf28}+3.84\%$
test_tc_first_layer_tensor 4.3771μs 0.7279μs 1.3738 MOps/s 1.4245 MOps/s $\color{#d91a1a}-3.56\%$
test_tc_first_layer_nontensor 22.9410μs 2.3776μs 420.5912 KOps/s 425.1506 KOps/s $\color{#d91a1a}-1.07\%$
test_tc_second_layer_tensor 30.7107μs 1.4600μs 684.9270 KOps/s 691.8700 KOps/s $\color{#d91a1a}-1.00\%$
test_tc_second_layer_nontensor 36.9500μs 3.1334μs 319.1453 KOps/s 324.1446 KOps/s $\color{#d91a1a}-1.54\%$
test_unbind 0.2285s 13.3588ms 74.8573 Ops/s 142.1178 Ops/s $\textbf{\color{#d91a1a}-47.33\%}$
test_full_like 12.4294ms 9.4684ms 105.6150 Ops/s 104.7369 Ops/s $\color{#35bf28}+0.84\%$
test_zeros_like 9.3961ms 7.2017ms 138.8557 Ops/s 228.5277 Ops/s $\textbf{\color{#d91a1a}-39.24\%}$
test_ones_like 5.4071ms 4.3718ms 228.7400 Ops/s 229.1588 Ops/s $\color{#d91a1a}-0.18\%$
test_clone 7.3631ms 6.6630ms 150.0825 Ops/s 101.3250 Ops/s $\textbf{\color{#35bf28}+48.12\%}$
test_squeeze 81.1810μs 9.6498μs 103.6287 KOps/s 106.2483 KOps/s $\color{#d91a1a}-2.47\%$
test_unsqueeze 0.1267ms 71.5902μs 13.9684 KOps/s 13.8398 KOps/s $\color{#35bf28}+0.93\%$
test_split 0.2889ms 0.1598ms 6.2561 KOps/s 6.3687 KOps/s $\color{#d91a1a}-1.77\%$
test_permute 0.2519ms 0.1785ms 5.6016 KOps/s 5.5137 KOps/s $\color{#35bf28}+1.59\%$
test_stack 53.5196ms 52.0889ms 19.1980 Ops/s 17.1623 Ops/s $\textbf{\color{#35bf28}+11.86\%}$
test_cat 52.6881ms 51.8927ms 19.2705 Ops/s 17.1705 Ops/s $\textbf{\color{#35bf28}+12.23\%}$

@vmoens vmoens added the Refactor Refactoring code - not a new feature label Dec 20, 2024
@vmoens vmoens merged commit b3bb6cc into gh/vmoens/39/base Dec 20, 2024
49 of 55 checks passed
vmoens added a commit that referenced this pull request Dec 20, 2024
ghstack-source-id: 48311d7a98a9895b10e5552e5b4a4f13764607e0
Pull Request resolved: #1153
@vmoens vmoens deleted the gh/vmoens/39/head branch December 20, 2024 17:58
Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment
Labels
CLA Signed This label is managed by the Facebook bot. Authors need to sign the CLA before a PR can be reviewed. Refactor Refactoring code - not a new feature
Projects
None yet
Development

Successfully merging this pull request may close these issues.

2 participants