-
Notifications
You must be signed in to change notification settings - Fork 78
New issue
Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.
By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.
Already on GitHub? Sign in to your account
[Refactor] Add missing functions in tensorclass register #1153
Merged
Conversation
This file contains bidirectional Unicode text that may be interpreted or compiled differently than what appears below. To review, open the file in an editor that reveals hidden Unicode characters.
Learn more about bidirectional Unicode characters
vmoens
added a commit
that referenced
this pull request
Dec 20, 2024
ghstack-source-id: 9f959be7b04c915596bdd67e149dc241cb8b4635 Pull Request resolved: #1153
facebook-github-bot
added
the
CLA Signed
This label is managed by the Facebook bot. Authors need to sign the CLA before a PR can be reviewed.
label
Dec 20, 2024
vmoens
added a commit
that referenced
this pull request
Dec 20, 2024
ghstack-source-id: 48311d7a98a9895b10e5552e5b4a4f13764607e0 Pull Request resolved: #1153
|
Name | Max | Mean | Ops | Ops on Repo HEAD
|
Change |
---|---|---|---|---|---|
test_plain_set_nested | 42.4800μs | 20.8839μs | 47.8837 KOps/s | 51.4393 KOps/s | |
test_plain_set_stack_nested | 40.9160μs | 21.0786μs | 47.4414 KOps/s | 50.1796 KOps/s | |
test_plain_set_nested_inplace | 83.8690μs | 22.5210μs | 44.4031 KOps/s | 46.8970 KOps/s | |
test_plain_set_stack_nested_inplace | 56.0150μs | 22.6759μs | 44.0996 KOps/s | 46.8564 KOps/s | |
test_items | 30.9290μs | 4.2114μs | 237.4510 KOps/s | 245.3104 KOps/s | |
test_items_nested | 0.8172ms | 0.4021ms | 2.4871 KOps/s | 2.4937 KOps/s | |
test_items_nested_locked | 0.7242ms | 0.4017ms | 2.4895 KOps/s | 2.4863 KOps/s | |
test_items_nested_leaf | 0.1763ms | 76.1161μs | 13.1378 KOps/s | 12.9160 KOps/s | |
test_items_stack_nested | 0.4766ms | 0.4038ms | 2.4763 KOps/s | 2.4746 KOps/s | |
test_items_stack_nested_leaf | 0.1545ms | 79.1589μs | 12.6328 KOps/s | 12.4136 KOps/s | |
test_items_stack_nested_locked | 0.5024ms | 0.4090ms | 2.4451 KOps/s | 2.4664 KOps/s | |
test_keys | 22.2910μs | 3.5270μs | 283.5251 KOps/s | 285.3914 KOps/s | |
test_keys_nested | 0.2635ms | 0.1680ms | 5.9527 KOps/s | 6.0114 KOps/s | |
test_keys_nested_locked | 0.7546ms | 0.1732ms | 5.7723 KOps/s | 5.8222 KOps/s | |
test_keys_nested_leaf | 0.2750ms | 0.1466ms | 6.8194 KOps/s | 6.9076 KOps/s | |
test_keys_stack_nested | 0.3034ms | 0.1673ms | 5.9764 KOps/s | 6.1445 KOps/s | |
test_keys_stack_nested_leaf | 0.2498ms | 0.1444ms | 6.9244 KOps/s | 7.1383 KOps/s | |
test_keys_stack_nested_locked | 0.2814ms | 0.1727ms | 5.7892 KOps/s | 5.8882 KOps/s | |
test_values | 8.2756μs | 1.0341μs | 967.0530 KOps/s | 944.2112 KOps/s | |
test_values_nested | 0.1119ms | 61.9390μs | 16.1449 KOps/s | 16.4331 KOps/s | |
test_values_nested_locked | 0.1181ms | 61.5558μs | 16.2454 KOps/s | 16.0020 KOps/s | |
test_values_nested_leaf | 0.1296ms | 71.9219μs | 13.9040 KOps/s | 13.9136 KOps/s | |
test_values_stack_nested | 0.1179ms | 63.5828μs | 15.7275 KOps/s | 16.0386 KOps/s | |
test_values_stack_nested_leaf | 0.1422ms | 73.1976μs | 13.6616 KOps/s | 14.1810 KOps/s | |
test_values_stack_nested_locked | 0.1282ms | 64.0188μs | 15.6204 KOps/s | 15.8001 KOps/s | |
test_membership | 5.7461μs | 0.7555μs | 1.3236 MOps/s | 1.1310 MOps/s | |
test_membership_nested | 22.0310μs | 3.0276μs | 330.2992 KOps/s | 340.1407 KOps/s | |
test_membership_nested_leaf | 46.6800μs | 3.0406μs | 328.8837 KOps/s | 331.2892 KOps/s | |
test_membership_stacked_nested | 20.9690μs | 3.0009μs | 333.2338 KOps/s | 338.7493 KOps/s | |
test_membership_stacked_nested_leaf | 20.8690μs | 3.0221μs | 330.8910 KOps/s | 330.6080 KOps/s | |
test_membership_nested_last | 24.1150μs | 4.4205μs | 226.2206 KOps/s | 227.7503 KOps/s | |
test_membership_nested_leaf_last | 28.9240μs | 4.4611μs | 224.1604 KOps/s | 227.3148 KOps/s | |
test_membership_stacked_nested_last | 30.2270μs | 4.4373μs | 225.3626 KOps/s | 74.5823 KOps/s | |
test_membership_stacked_nested_leaf_last | 22.0010μs | 4.4276μs | 225.8585 KOps/s | 74.4194 KOps/s | |
test_nested_getleaf | 35.6470μs | 10.7682μs | 92.8661 KOps/s | 94.6968 KOps/s | |
test_nested_get | 37.4900μs | 10.2457μs | 97.6021 KOps/s | 94.9969 KOps/s | |
test_stacked_getleaf | 31.4280μs | 10.7971μs | 92.6174 KOps/s | 93.6384 KOps/s | |
test_stacked_get | 34.1740μs | 10.3772μs | 96.3656 KOps/s | 98.3964 KOps/s | |
test_nested_getitemleaf | 36.0680μs | 11.2737μs | 88.7021 KOps/s | 90.0195 KOps/s | |
test_nested_getitem | 50.7240μs | 10.4179μs | 95.9886 KOps/s | 97.0579 KOps/s | |
test_stacked_getitemleaf | 42.8300μs | 11.1203μs | 89.9253 KOps/s | 90.6124 KOps/s | |
test_stacked_getitem | 48.6340μs | 10.3130μs | 96.9654 KOps/s | 100.6830 KOps/s | |
test_lock_nested | 7.4235ms | 0.4656ms | 2.1479 KOps/s | 2.2004 KOps/s | |
test_lock_stack_nested | 0.8572ms | 0.4284ms | 2.3342 KOps/s | 2.4063 KOps/s | |
test_unlock_nested | 0.7023ms | 0.3737ms | 2.6759 KOps/s | 2.6573 KOps/s | |
test_unlock_stack_nested | 0.7732ms | 0.3490ms | 2.8654 KOps/s | 2.9733 KOps/s | |
test_flatten_speed | 0.2378ms | 99.8328μs | 10.0168 KOps/s | 9.9103 KOps/s | |
test_unflatten_speed | 0.7046ms | 0.5301ms | 1.8865 KOps/s | 1.9047 KOps/s | |
test_common_ops | 7.3586ms | 0.7853ms | 1.2734 KOps/s | 1.3457 KOps/s | |
test_creation | 18.8450μs | 2.4621μs | 406.1571 KOps/s | 399.7069 KOps/s | |
test_creation_empty | 33.2220μs | 11.4621μs | 87.2437 KOps/s | 105.2938 KOps/s | |
test_creation_nested_1 | 70.6820μs | 14.7831μs | 67.6450 KOps/s | 80.2342 KOps/s | |
test_creation_nested_2 | 53.9710μs | 19.1349μs | 52.2605 KOps/s | 58.2061 KOps/s | |
test_clone | 53.0390μs | 13.6071μs | 73.4908 KOps/s | 73.9001 KOps/s | |
test_getitem[int] | 1.3275ms | 13.4932μs | 74.1114 KOps/s | 77.0488 KOps/s | |
test_getitem[slice_int] | 0.1407ms | 24.8894μs | 40.1777 KOps/s | 40.5283 KOps/s | |
test_getitem[range] | 0.2965ms | 51.6861μs | 19.3475 KOps/s | 19.8824 KOps/s | |
test_getitem[tuple] | 0.1348ms | 20.7408μs | 48.2140 KOps/s | 48.8710 KOps/s | |
test_getitem[list] | 0.1587ms | 43.7539μs | 22.8551 KOps/s | 22.1371 KOps/s | |
test_setitem_dim[int] | 50.0830μs | 25.5671μs | 39.1128 KOps/s | 39.2149 KOps/s | |
test_setitem_dim[slice_int] | 0.1043ms | 51.9969μs | 19.2319 KOps/s | 18.9919 KOps/s | |
test_setitem_dim[range] | 0.1113ms | 73.3763μs | 13.6284 KOps/s | 13.3456 KOps/s | |
test_setitem_dim[tuple] | 0.1005ms | 40.5727μs | 24.6471 KOps/s | 24.4229 KOps/s | |
test_setitem | 64.0420μs | 20.5280μs | 48.7140 KOps/s | 51.9164 KOps/s | |
test_set | 73.4980μs | 20.2955μs | 49.2721 KOps/s | 52.7755 KOps/s | |
test_set_shared | 3.6174ms | 0.1709ms | 5.8518 KOps/s | 5.8261 KOps/s | |
test_update | 0.1183ms | 22.7648μs | 43.9275 KOps/s | 48.8001 KOps/s | |
test_update_nested | 91.7210μs | 33.2859μs | 30.0427 KOps/s | 32.5178 KOps/s | |
test_update__nested | 0.3914ms | 33.4133μs | 29.9282 KOps/s | 29.5916 KOps/s | |
test_set_nested | 68.0460μs | 22.7687μs | 43.9200 KOps/s | 47.5976 KOps/s | |
test_set_nested_new | 0.2242ms | 27.2638μs | 36.6786 KOps/s | 38.8131 KOps/s | |
test_select | 0.2311ms | 44.2747μs | 22.5863 KOps/s | 24.1737 KOps/s | |
test_select_nested | 0.1367ms | 62.9610μs | 15.8829 KOps/s | 15.9212 KOps/s | |
test_exclude_nested | 0.1657ms | 82.9718μs | 12.0523 KOps/s | 12.0034 KOps/s | |
test_empty[True] | 0.8702ms | 0.4174ms | 2.3960 KOps/s | 2.4103 KOps/s | |
test_empty[False] | 11.8045μs | 1.4400μs | 694.4594 KOps/s | 726.7631 KOps/s | |
test_unbind_speed | 0.3656ms | 0.2741ms | 3.6482 KOps/s | 3.6401 KOps/s | |
test_unbind_speed_stack0 | 0.4130ms | 0.2717ms | 3.6802 KOps/s | 3.8486 KOps/s | |
test_unbind_speed_stack1 | 94.5940ms | 0.7962ms | 1.2560 KOps/s | 1.4135 KOps/s | |
test_split | 1.8288ms | 1.6099ms | 621.1500 Ops/s | 568.1169 Ops/s | |
test_chunk | 0.1001s | 1.9206ms | 520.6708 Ops/s | 563.6132 Ops/s | |
test_consolidate_njt[False-None] | 10.9857ms | 8.0860ms | 123.6701 Ops/s | 122.9251 Ops/s | |
test_creation[device0] | 4.3612ms | 90.9403μs | 10.9962 KOps/s | 11.0290 KOps/s | |
test_creation_from_tensor | 0.2457ms | 93.0437μs | 10.7476 KOps/s | 10.4908 KOps/s | |
test_add_one[memmap_tensor0] | 0.2130ms | 4.8739μs | 205.1761 KOps/s | 199.8058 KOps/s | |
test_contiguous[memmap_tensor0] | 16.6800μs | 0.5189μs | 1.9273 MOps/s | 1.9193 MOps/s | |
test_stack[memmap_tensor0] | 36.9690μs | 3.3150μs | 301.6564 KOps/s | 298.9510 KOps/s | |
test_memmaptd_index | 1.0286ms | 0.2406ms | 4.1569 KOps/s | 4.1276 KOps/s | |
test_memmaptd_index_astensor | 0.7358ms | 0.3264ms | 3.0633 KOps/s | 3.0504 KOps/s | |
test_memmaptd_index_op | 0.9655ms | 0.5897ms | 1.6959 KOps/s | 1.7638 KOps/s | |
test_serialize_model | 0.1268s | 0.1174s | 8.5153 Ops/s | 8.7844 Ops/s | |
test_serialize_model_pickle | 0.4456s | 0.3877s | 2.5793 Ops/s | 2.5745 Ops/s | |
test_serialize_weights | 0.1227s | 0.1128s | 8.8641 Ops/s | 8.7842 Ops/s | |
test_serialize_weights_returnearly | 0.2417s | 0.1698s | 5.8901 Ops/s | 6.4549 Ops/s | |
test_serialize_weights_pickle | 0.5569s | 0.4408s | 2.2686 Ops/s | 2.5402 Ops/s | |
test_serialize_weights_filesystem | 0.1459s | 0.1393s | 7.1791 Ops/s | 7.1087 Ops/s | |
test_serialize_model_filesystem | 0.1586s | 0.1439s | 6.9474 Ops/s | 6.1430 Ops/s | |
test_reshape_pytree | 68.5680μs | 26.9159μs | 37.1527 KOps/s | 37.5079 KOps/s | |
test_reshape_td | 89.1170μs | 34.3163μs | 29.1407 KOps/s | 29.6401 KOps/s | |
test_view_pytree | 74.0980μs | 26.4275μs | 37.8394 KOps/s | 37.5182 KOps/s | |
test_view_td | 0.1465ms | 38.5744μs | 25.9239 KOps/s | 25.9836 KOps/s | |
test_unbind_pytree | 65.3620μs | 30.4185μs | 32.8747 KOps/s | 33.8781 KOps/s | |
test_unbind_td | 0.3778ms | 40.6486μs | 24.6011 KOps/s | 25.0544 KOps/s | |
test_split_pytree | 75.6910μs | 30.3655μs | 32.9322 KOps/s | 34.0014 KOps/s | |
test_split_td | 0.5444ms | 46.0398μs | 21.7203 KOps/s | 21.8153 KOps/s | |
test_add_pytree | 78.4470μs | 36.2243μs | 27.6058 KOps/s | 27.5301 KOps/s | |
test_add_td | 0.1335ms | 61.7553μs | 16.1930 KOps/s | 17.8086 KOps/s | |
test_compile_add_one_nested[tensordict-compile] | 0.1476ms | 61.9405μs | 16.1445 KOps/s | 15.8624 KOps/s | |
test_compile_add_one_nested[tensordict-eager] | 0.3717ms | 0.1718ms | 5.8223 KOps/s | 5.9132 KOps/s | |
test_compile_add_one_nested[pytree-compile] | 0.1170ms | 45.1431μs | 22.1518 KOps/s | 21.5362 KOps/s | |
test_compile_add_one_nested[pytree-eager] | 0.2930ms | 0.1189ms | 8.4118 KOps/s | 8.2629 KOps/s | |
test_compile_copy_nested[tensordict-compile] | 76.7130μs | 26.2547μs | 38.0884 KOps/s | 38.6363 KOps/s | |
test_compile_copy_nested[tensordict-eager] | 0.1211ms | 59.9486μs | 16.6810 KOps/s | 16.8736 KOps/s | |
test_compile_copy_nested[pytree-compile] | 0.1557ms | 79.7130μs | 12.5450 KOps/s | 12.6517 KOps/s | |
test_compile_copy_nested[pytree-eager] | 0.1226ms | 69.0017μs | 14.4924 KOps/s | 14.8130 KOps/s | |
test_compile_add_one_flat[tensordict-compile] | 0.1758ms | 0.1034ms | 9.6690 KOps/s | 9.4078 KOps/s | |
test_compile_add_one_flat[tensordict-eager] | 1.3129ms | 0.2146ms | 4.6600 KOps/s | 4.6403 KOps/s | |
test_compile_add_one_flat[tensorclass-compile] | 88.1050μs | 44.1643μs | 22.6427 KOps/s | 21.5324 KOps/s | |
test_compile_add_one_flat[tensorclass-eager] | 1.2316ms | 64.7078μs | 15.4541 KOps/s | 15.4900 KOps/s | |
test_compile_add_one_flat[pytree-compile] | 0.2027ms | 0.1022ms | 9.7830 KOps/s | 9.4850 KOps/s | |
test_compile_add_one_flat[pytree-eager] | 0.2725ms | 0.2021ms | 4.9482 KOps/s | 4.9513 KOps/s | |
test_compile_add_self_flat[tensordict-eager] | 0.4238ms | 0.2330ms | 4.2916 KOps/s | 4.2449 KOps/s | |
test_compile_add_self_flat[tensordict-compile] | 0.2041ms | 0.1043ms | 9.5884 KOps/s | 9.4507 KOps/s | |
test_compile_add_self_flat[tensorclass-eager] | 0.1327ms | 59.2030μs | 16.8910 KOps/s | 17.0595 KOps/s | |
test_compile_add_self_flat[tensorclass-compile] | 0.1268ms | 45.2034μs | 22.1222 KOps/s | 21.5027 KOps/s | |
test_compile_add_self_flat[pytree-eager] | 0.6174ms | 0.1588ms | 6.2973 KOps/s | 6.2865 KOps/s | |
test_compile_add_self_flat[pytree-compile] | 0.2196ms | 0.1036ms | 9.6565 KOps/s | 9.5979 KOps/s | |
test_compile_copy_flat[tensordict-compile] | 69.6100μs | 20.3752μs | 49.0792 KOps/s | 45.0692 KOps/s | |
test_compile_copy_flat[tensordict-eager] | 0.1633ms | 65.8349μs | 15.1895 KOps/s | 15.1337 KOps/s | |
test_compile_copy_flat[pytree-compile] | 0.1824ms | 82.7556μs | 12.0838 KOps/s | 12.4132 KOps/s | |
test_compile_copy_flat[pytree-eager] | 0.1341ms | 70.0071μs | 14.2843 KOps/s | 14.5070 KOps/s | |
test_compile_assign_and_add[tensordict-compile] | 0.3294ms | 0.2078ms | 4.8135 KOps/s | 4.8645 KOps/s | |
test_compile_assign_and_add[tensordict-eager] | 1.5501ms | 1.3437ms | 744.2195 Ops/s | 770.3418 Ops/s | |
test_compile_assign_and_add[pytree-compile] | 0.2942ms | 0.2047ms | 4.8842 KOps/s | 5.0185 KOps/s | |
test_compile_assign_and_add[pytree-eager] | 1.5833ms | 0.7766ms | 1.2877 KOps/s | 1.3019 KOps/s | |
test_compile_assign_and_add_stack[compile] | 0.7769ms | 0.4615ms | 2.1669 KOps/s | 2.2370 KOps/s | |
test_compile_assign_and_add_stack[eager] | 2.9524ms | 2.6529ms | 376.9470 Ops/s | 391.6455 Ops/s | |
test_compile_indexing[tensor-tensordict-compile] | 0.1045ms | 35.2091μs | 28.4018 KOps/s | 27.1550 KOps/s | |
test_compile_indexing[tensor-tensordict-eager] | 0.5394ms | 33.6464μs | 29.7208 KOps/s | 29.6036 KOps/s | |
test_compile_indexing[tensor-tensorclass-compile] | 80.5900μs | 28.6116μs | 34.9509 KOps/s | 33.5144 KOps/s | |
test_compile_indexing[tensor-tensorclass-eager] | 70.0010μs | 23.3011μs | 42.9164 KOps/s | 41.8707 KOps/s | |
test_compile_indexing[tensor-pytree-compile] | 93.5940μs | 29.3688μs | 34.0497 KOps/s | 32.6132 KOps/s | |
test_compile_indexing[tensor-pytree-eager] | 87.2730μs | 23.3267μs | 42.8693 KOps/s | 42.6801 KOps/s | |
test_compile_indexing[slice-tensordict-compile] | 0.1142ms | 51.3852μs | 19.4609 KOps/s | 19.3500 KOps/s | |
test_compile_indexing[slice-tensordict-eager] | 0.5945ms | 20.7605μs | 48.1685 KOps/s | 49.1646 KOps/s | |
test_compile_indexing[slice-tensorclass-compile] | 0.1140ms | 44.0273μs | 22.7132 KOps/s | 22.6394 KOps/s | |
test_compile_indexing[slice-tensorclass-eager] | 77.7350μs | 19.0036μs | 52.6216 KOps/s | 52.8946 KOps/s | |
test_compile_indexing[slice-pytree-compile] | 0.1024ms | 45.0441μs | 22.2004 KOps/s | 22.1046 KOps/s | |
test_compile_indexing[slice-pytree-eager] | 57.4370μs | 19.0646μs | 52.4534 KOps/s | 52.3673 KOps/s | |
test_compile_indexing[int-tensordict-compile] | 0.1434ms | 52.6648μs | 18.9880 KOps/s | 19.0652 KOps/s | |
test_compile_indexing[int-tensordict-eager] | 1.0450ms | 20.6609μs | 48.4006 KOps/s | 50.1287 KOps/s | |
test_compile_indexing[int-tensorclass-compile] | 86.5320μs | 45.1027μs | 22.1716 KOps/s | 22.1419 KOps/s | |
test_compile_indexing[int-tensorclass-eager] | 60.4630μs | 18.9339μs | 52.8153 KOps/s | 53.5839 KOps/s | |
test_compile_indexing[int-pytree-compile] | 97.8640μs | 44.9851μs | 22.2296 KOps/s | 22.2901 KOps/s | |
test_compile_indexing[int-pytree-eager] | 0.5403ms | 18.8816μs | 52.9616 KOps/s | 52.8868 KOps/s | |
test_mod_add[eager] | 87.8550μs | 34.2987μs | 29.1556 KOps/s | 30.4331 KOps/s | |
test_mod_add[compile] | 0.1234ms | 46.4588μs | 21.5245 KOps/s | 20.9593 KOps/s | |
test_mod_add[compile-overhead] | 0.1663ms | 46.4414μs | 21.5325 KOps/s | 21.2842 KOps/s | |
test_mod_wrap[eager] | 0.3637ms | 0.2203ms | 4.5392 KOps/s | 4.5772 KOps/s | |
test_mod_wrap[compile] | 0.4396ms | 0.2032ms | 4.9204 KOps/s | 4.8460 KOps/s | |
test_mod_wrap[compile-overhead] | 0.3071ms | 0.2002ms | 4.9953 KOps/s | 4.8922 KOps/s | |
test_mod_wrap_and_backward[eager] | 11.7884ms | 10.7196ms | 93.2867 Ops/s | 88.5406 Ops/s | |
test_mod_wrap_and_backward[compile] | 12.6274ms | 10.8309ms | 92.3282 Ops/s | 84.9819 Ops/s | |
test_mod_wrap_and_backward[compile-overhead] | 12.4927ms | 10.8493ms | 92.1715 Ops/s | 81.8080 Ops/s | |
test_seq_add[eager] | 0.2345ms | 0.1149ms | 8.7003 KOps/s | 8.7608 KOps/s | |
test_seq_add[compile] | 0.1251ms | 61.8769μs | 16.1611 KOps/s | 16.2269 KOps/s | |
test_seq_add[compile-overhead] | 0.1174ms | 58.9674μs | 16.9585 KOps/s | 16.4749 KOps/s | |
test_seq_wrap[eager] | 0.7334ms | 0.4445ms | 2.2496 KOps/s | 2.2944 KOps/s | |
test_seq_wrap[compile] | 0.3326ms | 0.2228ms | 4.4889 KOps/s | 4.3984 KOps/s | |
test_seq_wrap[compile-overhead] | 0.3342ms | 0.2240ms | 4.4634 KOps/s | 4.2627 KOps/s | |
test_func_call_runtime[False-eager] | 0.9148ms | 0.5489ms | 1.8217 KOps/s | 1.8665 KOps/s | |
test_func_call_runtime[False-compile] | 0.5343ms | 0.4206ms | 2.3776 KOps/s | 2.3602 KOps/s | |
test_func_call_runtime[False-compile-overhead] | 0.5778ms | 0.4201ms | 2.3805 KOps/s | 2.3438 KOps/s | |
test_func_call_runtime[True-eager] | 0.9079ms | 0.7545ms | 1.3254 KOps/s | 1.3284 KOps/s | |
test_func_call_runtime[True-compile] | 0.5675ms | 0.4574ms | 2.1863 KOps/s | 2.1474 KOps/s | |
test_func_call_runtime[True-compile-overhead] | 0.8889ms | 0.4652ms | 2.1498 KOps/s | 2.1435 KOps/s | |
test_func_call_cm_runtime[False-eager] | 0.7744ms | 0.5514ms | 1.8137 KOps/s | 1.8573 KOps/s | |
test_func_call_cm_runtime[False-compile] | 0.7852ms | 0.4212ms | 2.3739 KOps/s | 2.3619 KOps/s | |
test_func_call_cm_runtime[False-compile-overhead] | 0.5468ms | 0.4207ms | 2.3772 KOps/s | 2.3646 KOps/s | |
test_func_call_cm_runtime[True-eager] | 1.0047ms | 0.9023ms | 1.1082 KOps/s | 1.1073 KOps/s | |
test_func_call_cm_runtime[True-compile] | 0.6519ms | 0.4871ms | 2.0532 KOps/s | 2.0166 KOps/s | |
test_func_call_cm_runtime[True-compile-overhead] | 0.9806ms | 0.4881ms | 2.0490 KOps/s | 2.0353 KOps/s | |
test_vmap_func_call_cm_runtime[eager] | 2.6827ms | 1.9559ms | 511.2685 Ops/s | 522.0183 Ops/s | |
test_vmap_func_call_cm_runtime[compile] | 0.6298ms | 0.5193ms | 1.9257 KOps/s | 1.9286 KOps/s | |
test_vmap_func_call_cm_runtime[compile-overhead] | 0.8998ms | 0.5260ms | 1.9013 KOps/s | 1.9221 KOps/s | |
test_distributed | 0.2568ms | 0.1214ms | 8.2396 KOps/s | 7.8801 KOps/s | |
test_tdmodule | 82.5240μs | 25.7350μs | 38.8576 KOps/s | 40.5513 KOps/s | |
test_tdmodule_dispatch | 74.7000μs | 46.6417μs | 21.4400 KOps/s | 22.2458 KOps/s | |
test_tdseq | 53.9100μs | 29.3698μs | 34.0486 KOps/s | 36.2995 KOps/s | |
test_tdseq_dispatch | 87.4230μs | 54.8256μs | 18.2397 KOps/s | 19.4561 KOps/s | |
test_instantiation_functorch | 2.5300ms | 1.5410ms | 648.9420 Ops/s | 648.8297 Ops/s | |
test_exec_functorch | 0.3384ms | 0.1778ms | 5.6232 KOps/s | 5.5926 KOps/s | |
test_exec_functional_call | 0.4147ms | 0.1689ms | 5.9222 KOps/s | 5.7700 KOps/s | |
test_exec_td_decorator | 0.4348ms | 0.2289ms | 4.3691 KOps/s | 4.3297 KOps/s | |
test_vmap_mlp_speed_decorator[True-True] | 0.8889ms | 0.6619ms | 1.5108 KOps/s | 1.5423 KOps/s | |
test_vmap_mlp_speed_decorator[True-False] | 1.0403ms | 0.6607ms | 1.5136 KOps/s | 1.4941 KOps/s | |
test_vmap_mlp_speed_decorator[False-True] | 0.7829ms | 0.5356ms | 1.8672 KOps/s | 1.8866 KOps/s | |
test_vmap_mlp_speed_decorator[False-False] | 0.7668ms | 0.5367ms | 1.8632 KOps/s | 1.8808 KOps/s | |
test_to_module_speed[True] | 1.7139ms | 1.3512ms | 740.0633 Ops/s | 746.6755 Ops/s | |
test_to_module_speed[False] | 1.8828ms | 1.3127ms | 761.7984 Ops/s | 758.8019 Ops/s | |
test_tc_init | 99.4260μs | 48.7439μs | 20.5154 KOps/s | 22.6061 KOps/s | |
test_tc_init_nested | 0.1817ms | 97.2907μs | 10.2785 KOps/s | 11.2608 KOps/s | |
test_tc_first_layer_tensor | 21.3490μs | 1.5729μs | 635.7675 KOps/s | 671.8665 KOps/s | |
test_tc_first_layer_nontensor | 25.2580μs | 4.8544μs | 205.9994 KOps/s | 214.6420 KOps/s | |
test_tc_second_layer_tensor | 38.6020μs | 2.9092μs | 343.7385 KOps/s | 368.1902 KOps/s | |
test_tc_second_layer_nontensor | 26.9000μs | 6.2388μs | 160.2867 KOps/s | 166.9336 KOps/s | |
test_unbind | 0.2168s | 13.1147ms | 76.2500 Ops/s | 79.0726 Ops/s | |
test_full_like | 9.0714ms | 7.0981ms | 140.8831 Ops/s | 142.8273 Ops/s | |
test_zeros_like | 3.1526ms | 2.7295ms | 366.3610 Ops/s | 142.6210 Ops/s | |
test_ones_like | 3.9064ms | 3.2075ms | 311.7670 Ops/s | 134.0009 Ops/s | |
test_clone | 5.4503ms | 4.9329ms | 202.7202 Ops/s | 108.4997 Ops/s | |
test_squeeze | 58.9900μs | 11.9759μs | 83.5008 KOps/s | 83.2202 KOps/s | |
test_unsqueeze | 0.2816ms | 92.2472μs | 10.8404 KOps/s | 10.8424 KOps/s | |
test_split | 0.3960ms | 0.1971ms | 5.0739 KOps/s | 5.0950 KOps/s | |
test_permute | 0.2788ms | 0.2027ms | 4.9339 KOps/s | 4.7464 KOps/s | |
test_stack | 29.6767ms | 25.3196ms | 39.4951 Ops/s | 42.6734 Ops/s | |
test_cat | 27.3773ms | 24.8880ms | 40.1801 Ops/s | 42.6412 Ops/s |
|
Name | Max | Mean | Ops | Ops on Repo HEAD
|
Change |
---|---|---|---|---|---|
test_plain_set_nested | 0.3939ms | 13.3656μs | 74.8188 KOps/s | 78.8742 KOps/s | |
test_plain_set_stack_nested | 43.1610μs | 13.3909μs | 74.6776 KOps/s | 77.6048 KOps/s | |
test_plain_set_nested_inplace | 0.3974ms | 14.4055μs | 69.4179 KOps/s | 71.7484 KOps/s | |
test_plain_set_stack_nested_inplace | 48.2000μs | 14.4714μs | 69.1016 KOps/s | 71.9805 KOps/s | |
test_items | 0.3856ms | 2.8976μs | 345.1132 KOps/s | 342.8856 KOps/s | |
test_items_nested | 0.7363ms | 0.3549ms | 2.8180 KOps/s | 2.8216 KOps/s | |
test_items_nested_locked | 0.7377ms | 0.3584ms | 2.7898 KOps/s | 2.8020 KOps/s | |
test_items_nested_leaf | 85.0120μs | 58.6758μs | 17.0428 KOps/s | 17.2407 KOps/s | |
test_items_stack_nested | 0.7480ms | 0.3590ms | 2.7852 KOps/s | 2.7767 KOps/s | |
test_items_stack_nested_leaf | 0.4473ms | 59.3778μs | 16.8413 KOps/s | 16.8276 KOps/s | |
test_items_stack_nested_locked | 0.7323ms | 0.3595ms | 2.7820 KOps/s | 2.7926 KOps/s | |
test_keys | 0.3873ms | 3.4403μs | 290.6718 KOps/s | 289.7814 KOps/s | |
test_keys_nested | 0.4542ms | 80.7142μs | 12.3894 KOps/s | 12.2588 KOps/s | |
test_keys_nested_locked | 0.7901ms | 87.0691μs | 11.4851 KOps/s | 11.4485 KOps/s | |
test_keys_nested_leaf | 0.4520ms | 72.4407μs | 13.8044 KOps/s | 13.9510 KOps/s | |
test_keys_stack_nested | 0.4546ms | 82.0864μs | 12.1823 KOps/s | 11.9763 KOps/s | |
test_keys_stack_nested_leaf | 0.4450ms | 72.6642μs | 13.7619 KOps/s | 13.5470 KOps/s | |
test_keys_stack_nested_locked | 0.4639ms | 87.0890μs | 11.4825 KOps/s | 11.2272 KOps/s | |
test_values | 62.9560μs | 0.8527μs | 1.1728 MOps/s | 1.1690 MOps/s | |
test_values_nested | 0.4123ms | 34.8724μs | 28.6760 KOps/s | 28.9965 KOps/s | |
test_values_nested_locked | 0.4137ms | 36.3029μs | 27.5460 KOps/s | 27.6289 KOps/s | |
test_values_nested_leaf | 64.6810μs | 39.6816μs | 25.2006 KOps/s | 25.6536 KOps/s | |
test_values_stack_nested | 0.4391ms | 34.9217μs | 28.6355 KOps/s | 28.6253 KOps/s | |
test_values_stack_nested_leaf | 0.4140ms | 39.9701μs | 25.0187 KOps/s | 25.3061 KOps/s | |
test_values_stack_nested_locked | 0.4130ms | 36.6897μs | 27.2556 KOps/s | 27.0163 KOps/s | |
test_membership | 19.1884μs | 0.5110μs | 1.9570 MOps/s | 1.9845 MOps/s | |
test_membership_nested | 0.1905ms | 2.0964μs | 477.0088 KOps/s | 497.0160 KOps/s | |
test_membership_nested_leaf | 0.1917ms | 2.1002μs | 476.1540 KOps/s | 514.6824 KOps/s | |
test_membership_stacked_nested | 28.6810μs | 2.1911μs | 456.3823 KOps/s | 489.1851 KOps/s | |
test_membership_stacked_nested_leaf | 25.3210μs | 2.1952μs | 455.5355 KOps/s | 495.2048 KOps/s | |
test_membership_nested_last | 0.3925ms | 3.1962μs | 312.8706 KOps/s | 328.9318 KOps/s | |
test_membership_nested_leaf_last | 27.8500μs | 3.1989μs | 312.6115 KOps/s | 331.4205 KOps/s | |
test_membership_stacked_nested_last | 0.3965ms | 3.2088μs | 311.6462 KOps/s | 327.8174 KOps/s | |
test_membership_stacked_nested_leaf_last | 27.6010μs | 3.1569μs | 316.7696 KOps/s | 323.1960 KOps/s | |
test_nested_getleaf | 0.3901ms | 6.1719μs | 162.0245 KOps/s | 162.3205 KOps/s | |
test_nested_get | 31.1410μs | 5.8018μs | 172.3593 KOps/s | 172.9459 KOps/s | |
test_stacked_getleaf | 0.3888ms | 6.2062μs | 161.1297 KOps/s | 162.3034 KOps/s | |
test_stacked_get | 35.8910μs | 5.8272μs | 171.6103 KOps/s | 171.4799 KOps/s | |
test_nested_getitemleaf | 0.3968ms | 6.2081μs | 161.0811 KOps/s | 160.8207 KOps/s | |
test_nested_getitem | 0.3926ms | 5.8712μs | 170.3227 KOps/s | 168.2525 KOps/s | |
test_stacked_getitemleaf | 27.4200μs | 6.1900μs | 161.5504 KOps/s | 160.2474 KOps/s | |
test_stacked_getitem | 30.8900μs | 5.8976μs | 169.5609 KOps/s | 169.8012 KOps/s | |
test_lock_nested | 0.8276ms | 0.3726ms | 2.6837 KOps/s | 2.6378 KOps/s | |
test_lock_stack_nested | 0.3867ms | 0.3422ms | 2.9224 KOps/s | 2.9202 KOps/s | |
test_unlock_nested | 0.6474ms | 0.3121ms | 3.2037 KOps/s | 3.2371 KOps/s | |
test_unlock_stack_nested | 0.3863ms | 0.2774ms | 3.6044 KOps/s | 3.5687 KOps/s | |
test_flatten_speed | 0.1224ms | 76.0870μs | 13.1428 KOps/s | 13.2687 KOps/s | |
test_unflatten_speed | 0.4551ms | 0.3242ms | 3.0849 KOps/s | 3.1588 KOps/s | |
test_common_ops | 1.6890ms | 0.6350ms | 1.5748 KOps/s | 1.6235 KOps/s | |
test_creation | 0.1136ms | 1.6897μs | 591.8221 KOps/s | 568.2145 KOps/s | |
test_creation_empty | 33.9600μs | 10.3061μs | 97.0300 KOps/s | 108.2153 KOps/s | |
test_creation_nested_1 | 0.3856ms | 11.8595μs | 84.3207 KOps/s | 91.2945 KOps/s | |
test_creation_nested_2 | 37.7710μs | 14.7295μs | 67.8909 KOps/s | 72.4448 KOps/s | |
test_clone | 0.3942ms | 10.1828μs | 98.2049 KOps/s | 97.2449 KOps/s | |
test_getitem[int] | 1.9372ms | 10.4501μs | 95.6927 KOps/s | 95.1307 KOps/s | |
test_getitem[slice_int] | 0.1069ms | 20.2703μs | 49.3332 KOps/s | 48.3911 KOps/s | |
test_getitem[range] | 0.1250ms | 34.8985μs | 28.6545 KOps/s | 28.1903 KOps/s | |
test_getitem[tuple] | 0.3955ms | 17.6150μs | 56.7697 KOps/s | 55.8126 KOps/s | |
test_getitem[list] | 0.2695ms | 30.9403μs | 32.3203 KOps/s | 31.4715 KOps/s | |
test_setitem_dim[int] | 46.4110μs | 17.8498μs | 56.0229 KOps/s | 56.1249 KOps/s | |
test_setitem_dim[slice_int] | 66.2010μs | 36.9557μs | 27.0594 KOps/s | 27.0922 KOps/s | |
test_setitem_dim[range] | 90.4320μs | 49.1437μs | 20.3485 KOps/s | 19.8782 KOps/s | |
test_setitem_dim[tuple] | 51.9210μs | 30.1657μs | 33.1502 KOps/s | 32.0972 KOps/s | |
test_setitem | 0.4061ms | 15.6973μs | 63.7050 KOps/s | 66.9879 KOps/s | |
test_set | 0.1030ms | 14.9403μs | 66.9329 KOps/s | 67.6557 KOps/s | |
test_set_shared | 1.4934ms | 0.1480ms | 6.7584 KOps/s | 6.7811 KOps/s | |
test_update | 0.3553ms | 19.1578μs | 52.1981 KOps/s | 55.4493 KOps/s | |
test_update_nested | 0.4261ms | 25.2120μs | 39.6637 KOps/s | 42.7054 KOps/s | |
test_update__nested | 0.2846ms | 24.6285μs | 40.6034 KOps/s | 40.6977 KOps/s | |
test_set_nested | 0.4038ms | 16.5811μs | 60.3096 KOps/s | 62.9080 KOps/s | |
test_set_nested_new | 90.6810μs | 18.8066μs | 53.1729 KOps/s | 54.3290 KOps/s | |
test_select | 0.4347ms | 31.3178μs | 31.9307 KOps/s | 33.2710 KOps/s | |
test_select_nested | 0.1015ms | 42.2987μs | 23.6414 KOps/s | 23.0799 KOps/s | |
test_exclude_nested | 0.1382ms | 61.2087μs | 16.3375 KOps/s | 16.2737 KOps/s | |
test_empty[True] | 0.4046ms | 0.2861ms | 3.4955 KOps/s | 3.5108 KOps/s | |
test_empty[False] | 3.5010μs | 0.8248μs | 1.2125 MOps/s | 1.2235 MOps/s | |
test_to | 87.1120μs | 55.3435μs | 18.0690 KOps/s | 17.8068 KOps/s | |
test_to_nonblocking | 89.8010μs | 46.9334μs | 21.3068 KOps/s | 21.1336 KOps/s | |
test_unbind_speed | 1.7026ms | 0.2350ms | 4.2559 KOps/s | 4.2582 KOps/s | |
test_unbind_speed_stack0 | 0.3275ms | 0.2338ms | 4.2775 KOps/s | 4.2618 KOps/s | |
test_unbind_speed_stack1 | 94.3405ms | 0.6658ms | 1.5019 KOps/s | 1.4946 KOps/s | |
test_split | 95.7939ms | 1.7090ms | 585.1417 Ops/s | 637.6674 Ops/s | |
test_chunk | 1.5329ms | 1.4257ms | 701.3921 Ops/s | 581.6947 Ops/s | |
test_consolidate[False-None] | 98.2467ms | 2.9006ms | 344.7504 Ops/s | 380.7737 Ops/s | |
test_consolidate[default-None] | 1.6872ms | 1.6259ms | 615.0586 Ops/s | 612.3963 Ops/s | |
test_consolidate[reduce-overhead-None] | 1.7848ms | 1.7164ms | 582.6126 Ops/s | 597.9262 Ops/s | |
test_consolidate_njt[False-None] | 6.5359ms | 6.2948ms | 158.8612 Ops/s | 156.1006 Ops/s | |
test_to[False-False-None] | 1.8402ms | 1.7324ms | 577.2377 Ops/s | 584.5831 Ops/s | |
test_to[True-False-None] | 1.5040ms | 1.2775ms | 782.7596 Ops/s | 772.2573 Ops/s | |
test_to[within-False-None] | 4.2416ms | 4.0455ms | 247.1876 Ops/s | 250.1933 Ops/s | |
test_to[True-default-None] | 5.4517ms | 5.2025ms | 192.2161 Ops/s | 191.1417 Ops/s | |
test_to_njt[False-False-None] | 6.9315ms | 6.7374ms | 148.4251 Ops/s | 144.3035 Ops/s | |
test_to_njt[True-False-None] | 5.5771ms | 5.3004ms | 188.6657 Ops/s | 188.2343 Ops/s | |
test_to_njt[within-False-None] | 12.0260ms | 11.8829ms | 84.1545 Ops/s | 85.4667 Ops/s | |
test_creation[device0] | 0.4584ms | 77.3193μs | 12.9334 KOps/s | 12.2090 KOps/s | |
test_creation_from_tensor | 0.4459ms | 81.4529μs | 12.2770 KOps/s | 11.9703 KOps/s | |
test_add_one[memmap_tensor0] | 0.4478ms | 7.0272μs | 142.3047 KOps/s | 155.3833 KOps/s | |
test_contiguous[memmap_tensor0] | 1.8611μs | 0.4051μs | 2.4683 MOps/s | 2.5096 MOps/s | |
test_stack[memmap_tensor0] | 28.5210μs | 4.1476μs | 241.1061 KOps/s | 240.1238 KOps/s | |
test_memmaptd_index | 1.8598ms | 0.2430ms | 4.1147 KOps/s | 4.0873 KOps/s | |
test_memmaptd_index_astensor | 0.8028ms | 0.3012ms | 3.3200 KOps/s | 3.2833 KOps/s | |
test_memmaptd_index_op | 1.0347ms | 0.5965ms | 1.6766 KOps/s | 1.7266 KOps/s | |
test_serialize_model | 0.1314s | 0.1301s | 7.6875 Ops/s | 7.6445 Ops/s | |
test_serialize_model_pickle | 1.3512s | 1.1921s | 0.8388 Ops/s | 0.8063 Ops/s | |
test_serialize_weights | 0.4189s | 0.1708s | 5.8561 Ops/s | 7.6864 Ops/s | |
test_serialize_weights_returnearly | 0.3454s | 53.2736ms | 18.7710 Ops/s | 14.5072 Ops/s | |
test_serialize_weights_pickle | 1.3842s | 1.1951s | 0.8367 Ops/s | 0.8174 Ops/s | |
test_reshape_pytree | 53.3610μs | 21.6860μs | 46.1127 KOps/s | 45.5722 KOps/s | |
test_reshape_td | 63.0910μs | 27.2149μs | 36.7446 KOps/s | 36.4216 KOps/s | |
test_view_pytree | 48.5300μs | 21.5252μs | 46.4572 KOps/s | 46.4260 KOps/s | |
test_view_td | 68.1710μs | 29.5883μs | 33.7972 KOps/s | 33.0528 KOps/s | |
test_unbind_pytree | 60.6810μs | 27.7576μs | 36.0262 KOps/s | 36.3385 KOps/s | |
test_unbind_td | 0.8356ms | 36.5301μs | 27.3747 KOps/s | 28.3102 KOps/s | |
test_split_pytree | 61.4110μs | 29.5678μs | 33.8205 KOps/s | 33.7351 KOps/s | |
test_split_td | 1.0887ms | 38.6411μs | 25.8792 KOps/s | 26.0185 KOps/s | |
test_add_pytree | 67.8810μs | 34.3747μs | 29.0912 KOps/s | 28.8700 KOps/s | |
test_add_td | 94.0520μs | 51.9943μs | 19.2329 KOps/s | 20.7396 KOps/s | |
test_compile_add_one_nested[tensordict-compile] | 0.1695ms | 0.1182ms | 8.4595 KOps/s | 8.3170 KOps/s | |
test_compile_add_one_nested[tensordict-eager] | 0.2185ms | 0.1303ms | 7.6735 KOps/s | 7.7171 KOps/s | |
test_compile_add_one_nested[pytree-compile] | 0.1588ms | 94.5884μs | 10.5721 KOps/s | 10.2935 KOps/s | |
test_compile_add_one_nested[pytree-eager] | 2.7617ms | 0.1510ms | 6.6204 KOps/s | 6.7595 KOps/s | |
test_compile_copy_nested[tensordict-compile] | 0.1515ms | 22.7164μs | 44.0211 KOps/s | 47.5781 KOps/s | |
test_compile_copy_nested[tensordict-eager] | 77.8310μs | 29.3068μs | 34.1218 KOps/s | 34.6085 KOps/s | |
test_compile_copy_nested[pytree-compile] | 0.1064ms | 64.2360μs | 15.5676 KOps/s | 15.5619 KOps/s | |
test_compile_copy_nested[pytree-eager] | 89.7110μs | 48.8204μs | 20.4832 KOps/s | 20.3306 KOps/s | |
test_compile_add_one_flat[tensordict-compile] | 0.1801ms | 0.1408ms | 7.1045 KOps/s | 7.0879 KOps/s | |
test_compile_add_one_flat[tensordict-eager] | 0.3176ms | 0.2155ms | 4.6398 KOps/s | 4.6389 KOps/s | |
test_compile_add_one_flat[tensorclass-compile] | 0.1374ms | 97.0694μs | 10.3019 KOps/s | 10.4225 KOps/s | |
test_compile_add_one_flat[tensorclass-eager] | 0.1130ms | 53.5615μs | 18.6701 KOps/s | 18.7145 KOps/s | |
test_compile_add_one_flat[pytree-compile] | 0.1771ms | 0.1380ms | 7.2454 KOps/s | 7.4739 KOps/s | |
test_compile_add_one_flat[pytree-eager] | 0.5442ms | 0.4832ms | 2.0694 KOps/s | 2.0857 KOps/s | |
test_compile_add_self_flat[tensordict-eager] | 0.4057ms | 0.2588ms | 3.8643 KOps/s | 3.8550 KOps/s | |
test_compile_add_self_flat[tensordict-compile] | 0.2782ms | 0.1468ms | 6.8106 KOps/s | 7.0971 KOps/s | |
test_compile_add_self_flat[tensorclass-eager] | 0.1624ms | 66.4273μs | 15.0540 KOps/s | 15.0739 KOps/s | |
test_compile_add_self_flat[tensorclass-compile] | 0.1459ms | 0.1021ms | 9.7901 KOps/s | 10.3683 KOps/s | |
test_compile_add_self_flat[pytree-eager] | 0.4597ms | 0.4076ms | 2.4534 KOps/s | 2.4679 KOps/s | |
test_compile_add_self_flat[pytree-compile] | 0.1812ms | 0.1362ms | 7.3448 KOps/s | 7.5489 KOps/s | |
test_compile_copy_flat[tensordict-compile] | 55.9810μs | 18.9979μs | 52.6374 KOps/s | 58.1463 KOps/s | |
test_compile_copy_flat[tensordict-eager] | 62.8120μs | 31.3133μs | 31.9353 KOps/s | 32.7470 KOps/s | |
test_compile_copy_flat[pytree-compile] | 0.1048ms | 69.3601μs | 14.4175 KOps/s | 14.3738 KOps/s | |
test_compile_copy_flat[pytree-eager] | 0.1757ms | 51.5638μs | 19.3935 KOps/s | 19.5680 KOps/s | |
test_compile_assign_and_add[tensordict-compile] | 1.5972ms | 0.3842ms | 2.6026 KOps/s | 2.2726 KOps/s | |
test_compile_assign_and_add[tensordict-eager] | 2.9054ms | 2.6449ms | 378.0829 Ops/s | 387.3894 Ops/s | |
test_compile_assign_and_add[pytree-compile] | 1.5905ms | 0.4321ms | 2.3141 KOps/s | 2.2990 KOps/s | |
test_compile_assign_and_add[pytree-eager] | 2.7744ms | 2.6807ms | 373.0331 Ops/s | 379.4252 Ops/s | |
test_compile_indexing[tensor-tensordict-compile] | 0.2654ms | 0.1156ms | 8.6475 KOps/s | 8.7849 KOps/s | |
test_compile_indexing[tensor-tensordict-eager] | 0.5641ms | 82.9199μs | 12.0598 KOps/s | 11.7801 KOps/s | |
test_compile_indexing[tensor-tensorclass-compile] | 0.5302ms | 0.1100ms | 9.0941 KOps/s | 9.0615 KOps/s | |
test_compile_indexing[tensor-tensorclass-eager] | 0.2228ms | 71.3963μs | 14.0063 KOps/s | 13.6012 KOps/s | |
test_compile_indexing[tensor-pytree-compile] | 0.1850ms | 0.1088ms | 9.1888 KOps/s | 8.9686 KOps/s | |
test_compile_indexing[tensor-pytree-eager] | 0.1143ms | 70.3731μs | 14.2100 KOps/s | 13.6162 KOps/s | |
test_compile_indexing[slice-tensordict-compile] | 0.2471ms | 0.1032ms | 9.6853 KOps/s | 10.0434 KOps/s | |
test_compile_indexing[slice-tensordict-eager] | 0.1445ms | 16.8045μs | 59.5078 KOps/s | 57.5834 KOps/s | |
test_compile_indexing[slice-tensorclass-compile] | 0.1441ms | 97.9321μs | 10.2112 KOps/s | 10.5993 KOps/s | |
test_compile_indexing[slice-tensorclass-eager] | 55.5710μs | 15.7624μs | 63.4422 KOps/s | 50.9103 KOps/s | |
test_compile_indexing[slice-pytree-compile] | 0.1738ms | 98.0328μs | 10.2007 KOps/s | 10.5503 KOps/s | |
test_compile_indexing[slice-pytree-eager] | 0.1059ms | 15.3067μs | 65.3307 KOps/s | 64.4480 KOps/s | |
test_compile_indexing[int-tensordict-compile] | 0.1470ms | 0.1032ms | 9.6920 KOps/s | 10.0897 KOps/s | |
test_compile_indexing[int-tensordict-eager] | 0.6157ms | 16.6919μs | 59.9093 KOps/s | 58.8217 KOps/s | |
test_compile_indexing[int-tensorclass-compile] | 0.1381ms | 99.4047μs | 10.0599 KOps/s | 10.5568 KOps/s | |
test_compile_indexing[int-tensorclass-eager] | 58.5610μs | 15.8189μs | 63.2154 KOps/s | 65.2958 KOps/s | |
test_compile_indexing[int-pytree-compile] | 0.1501ms | 99.4297μs | 10.0574 KOps/s | 10.4797 KOps/s | |
test_compile_indexing[int-pytree-eager] | 47.0310μs | 15.6248μs | 64.0007 KOps/s | 65.3317 KOps/s | |
test_mod_add[eager] | 86.8210μs | 38.8715μs | 25.7258 KOps/s | 26.7702 KOps/s | |
test_mod_add[compile] | 0.3846ms | 79.1847μs | 12.6287 KOps/s | 12.2004 KOps/s | |
test_mod_add[compile-overhead] | 0.3252ms | 0.1669ms | 5.9913 KOps/s | 5.6891 KOps/s | |
test_mod_wrap[eager] | 0.3304ms | 0.2596ms | 3.8514 KOps/s | 3.8262 KOps/s | |
test_mod_wrap[compile] | 0.3723ms | 0.2861ms | 3.4947 KOps/s | 3.5969 KOps/s | |
test_mod_wrap[compile-overhead] | 7.1035ms | 3.7667ms | 265.4814 Ops/s | 277.5916 Ops/s | |
test_mod_wrap_and_backward[eager] | 1.6514ms | 1.3661ms | 732.0336 Ops/s | 690.4860 Ops/s | |
test_mod_wrap_and_backward[compile] | 1.3405ms | 1.2502ms | 799.8534 Ops/s | 741.7167 Ops/s | |
test_mod_wrap_and_backward[compile-overhead] | 1.3948ms | 0.9227ms | 1.0838 KOps/s | 961.6564 Ops/s | |
test_seq_add[eager] | 0.1826ms | 0.1204ms | 8.3051 KOps/s | 8.7244 KOps/s | |
test_seq_add[compile] | 0.1366ms | 88.5925μs | 11.2876 KOps/s | 11.6546 KOps/s | |
test_seq_add[compile-overhead] | 0.2588ms | 0.1274ms | 7.8502 KOps/s | 7.8369 KOps/s | |
test_seq_wrap[eager] | 0.4979ms | 0.4168ms | 2.3991 KOps/s | 2.3756 KOps/s | |
test_seq_wrap[compile] | 0.3700ms | 0.2959ms | 3.3798 KOps/s | 3.3923 KOps/s | |
test_seq_wrap[compile-overhead] | 0.3278ms | 0.2213ms | 4.5194 KOps/s | 4.5229 KOps/s | |
test_func_call_runtime[False-eager] | 0.8724ms | 0.7325ms | 1.3653 KOps/s | 1.3663 KOps/s | |
test_func_call_runtime[False-compile] | 0.8823ms | 0.7218ms | 1.3855 KOps/s | 1.3731 KOps/s | |
test_func_call_runtime[False-compile-overhead] | 0.4039ms | 0.3599ms | 2.7782 KOps/s | 2.8018 KOps/s | |
test_func_call_runtime[True-eager] | 0.9828ms | 0.8868ms | 1.1276 KOps/s | 1.1146 KOps/s | |
test_func_call_runtime[True-compile] | 0.7953ms | 0.7429ms | 1.3461 KOps/s | 1.3355 KOps/s | |
test_func_call_runtime[True-compile-overhead] | 0.4997ms | 0.3790ms | 2.6388 KOps/s | 2.5825 KOps/s | |
test_func_call_cm_runtime[False-eager] | 0.7987ms | 0.7252ms | 1.3789 KOps/s | 1.3064 KOps/s | |
test_func_call_cm_runtime[False-compile] | 0.8020ms | 0.7309ms | 1.3681 KOps/s | 1.3188 KOps/s | |
test_func_call_cm_runtime[False-compile-overhead] | 0.4011ms | 0.3597ms | 2.7800 KOps/s | 2.7971 KOps/s | |
test_func_call_cm_runtime[True-eager] | 1.0921ms | 0.9901ms | 1.0100 KOps/s | 995.1820 Ops/s | |
test_func_call_cm_runtime[True-compile] | 0.8397ms | 0.7780ms | 1.2854 KOps/s | 1.2917 KOps/s | |
test_func_call_cm_runtime[True-compile-overhead] | 0.4502ms | 0.4055ms | 2.4661 KOps/s | 2.4705 KOps/s | |
test_vmap_func_call_cm_runtime[eager] | 2.5500ms | 2.0734ms | 482.3012 Ops/s | 478.1972 Ops/s | |
test_vmap_func_call_cm_runtime[compile] | 0.9542ms | 0.7922ms | 1.2623 KOps/s | 1.2635 KOps/s | |
test_vmap_func_call_cm_runtime[compile-overhead] | 0.5509ms | 0.4101ms | 2.4387 KOps/s | 2.4501 KOps/s | |
test_distributed | 3.9673ms | 0.1906ms | 5.2473 KOps/s | 8.6352 KOps/s | |
test_tdmodule | 64.9610μs | 21.0847μs | 47.4278 KOps/s | 49.8220 KOps/s | |
test_tdmodule_dispatch | 84.7820μs | 38.5522μs | 25.9389 KOps/s | 27.4361 KOps/s | |
test_tdseq | 42.9300μs | 22.2571μs | 44.9295 KOps/s | 47.4589 KOps/s | |
test_tdseq_dispatch | 60.5810μs | 40.3654μs | 24.7737 KOps/s | 25.3321 KOps/s | |
test_instantiation_functorch | 1.6313ms | 1.5479ms | 646.0338 Ops/s | 662.7182 Ops/s | |
test_exec_functorch | 0.2142ms | 0.1462ms | 6.8421 KOps/s | 7.2215 KOps/s | |
test_exec_functional_call | 0.1749ms | 0.1362ms | 7.3404 KOps/s | 7.7121 KOps/s | |
test_exec_td_decorator | 0.3879ms | 0.1856ms | 5.3892 KOps/s | 5.5858 KOps/s | |
test_vmap_mlp_speed_decorator[True-True] | 0.8297ms | 0.6825ms | 1.4653 KOps/s | 1.4657 KOps/s | |
test_vmap_mlp_speed_decorator[True-False] | 0.7917ms | 0.6807ms | 1.4691 KOps/s | 1.4651 KOps/s | |
test_vmap_mlp_speed_decorator[False-True] | 0.7033ms | 0.5890ms | 1.6978 KOps/s | 1.6935 KOps/s | |
test_vmap_mlp_speed_decorator[False-False] | 0.7309ms | 0.5907ms | 1.6928 KOps/s | 1.6930 KOps/s | |
test_vmap_transformer_speed_decorator[True-True] | 19.4051ms | 19.1844ms | 52.1257 Ops/s | 52.2399 Ops/s | |
test_vmap_transformer_speed_decorator[True-False] | 19.9357ms | 19.3117ms | 51.7821 Ops/s | 52.1430 Ops/s | |
test_vmap_transformer_speed_decorator[False-True] | 19.5048ms | 19.0820ms | 52.4053 Ops/s | 52.8270 Ops/s | |
test_vmap_transformer_speed_decorator[False-False] | 19.1506ms | 19.0146ms | 52.5913 Ops/s | 52.5573 Ops/s | |
test_to_module_speed[True] | 1.0739ms | 0.9730ms | 1.0278 KOps/s | 1.0246 KOps/s | |
test_to_module_speed[False] | 1.0368ms | 0.9507ms | 1.0518 KOps/s | 1.0274 KOps/s | |
test_tc_init | 73.3410μs | 38.2171μs | 26.1663 KOps/s | 25.7245 KOps/s | |
test_tc_init_nested | 0.1590ms | 76.0705μs | 13.1457 KOps/s | 12.6595 KOps/s | |
test_tc_first_layer_tensor | 4.3771μs | 0.7279μs | 1.3738 MOps/s | 1.4245 MOps/s | |
test_tc_first_layer_nontensor | 22.9410μs | 2.3776μs | 420.5912 KOps/s | 425.1506 KOps/s | |
test_tc_second_layer_tensor | 30.7107μs | 1.4600μs | 684.9270 KOps/s | 691.8700 KOps/s | |
test_tc_second_layer_nontensor | 36.9500μs | 3.1334μs | 319.1453 KOps/s | 324.1446 KOps/s | |
test_unbind | 0.2285s | 13.3588ms | 74.8573 Ops/s | 142.1178 Ops/s | |
test_full_like | 12.4294ms | 9.4684ms | 105.6150 Ops/s | 104.7369 Ops/s | |
test_zeros_like | 9.3961ms | 7.2017ms | 138.8557 Ops/s | 228.5277 Ops/s | |
test_ones_like | 5.4071ms | 4.3718ms | 228.7400 Ops/s | 229.1588 Ops/s | |
test_clone | 7.3631ms | 6.6630ms | 150.0825 Ops/s | 101.3250 Ops/s | |
test_squeeze | 81.1810μs | 9.6498μs | 103.6287 KOps/s | 106.2483 KOps/s | |
test_unsqueeze | 0.1267ms | 71.5902μs | 13.9684 KOps/s | 13.8398 KOps/s | |
test_split | 0.2889ms | 0.1598ms | 6.2561 KOps/s | 6.3687 KOps/s | |
test_permute | 0.2519ms | 0.1785ms | 5.6016 KOps/s | 5.5137 KOps/s | |
test_stack | 53.5196ms | 52.0889ms | 19.1980 Ops/s | 17.1623 Ops/s | |
test_cat | 52.6881ms | 51.8927ms | 19.2705 Ops/s | 17.1705 Ops/s |
vmoens
added a commit
that referenced
this pull request
Dec 20, 2024
ghstack-source-id: 48311d7a98a9895b10e5552e5b4a4f13764607e0 Pull Request resolved: #1153
Sign up for free
to join this conversation on GitHub.
Already have an account?
Sign in to comment
Labels
CLA Signed
This label is managed by the Facebook bot. Authors need to sign the CLA before a PR can be reviewed.
Refactor
Refactoring code - not a new feature
Add this suggestion to a batch that can be applied as a single commit.
This suggestion is invalid because no changes were made to the code.
Suggestions cannot be applied while the pull request is closed.
Suggestions cannot be applied while viewing a subset of changes.
Only one suggestion per line can be applied in a batch.
Add this suggestion to a batch that can be applied as a single commit.
Applying suggestions on deleted lines is not supported.
You must change the existing code in this line in order to create a valid suggestion.
Outdated suggestions cannot be applied.
This suggestion has been applied or marked resolved.
Suggestions cannot be applied from pending reviews.
Suggestions cannot be applied on multi-line comments.
Suggestions cannot be applied while the pull request is queued to merge.
Suggestion cannot be applied right now. Please check back later.
Stack from ghstack (oldest at bottom):
__abs__
docstrings,__neg__
,__rxor__
,__ror__
,__invert__
,__and__
,__rand__
,__radd__
,__rtruediv__
,__rmul__
,__rsub__
,__rpow__
,bitwise_and
,logical_and
#1154