-
Notifications
You must be signed in to change notification settings - Fork 76
New issue
Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.
By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.
Already on GitHub? Sign in to your account
[Refactor] Fix property handling in TC #1178
Merged
Merged
Conversation
This file contains bidirectional Unicode text that may be interpreted or compiled differently than what appears below. To review, open the file in an editor that reveals hidden Unicode characters.
Learn more about bidirectional Unicode characters
vmoens
added a commit
that referenced
this pull request
Jan 10, 2025
ghstack-source-id: ad564f84401bb619d0a6163c352d3de43ea24437 Pull Request resolved: #1178
facebook-github-bot
added
the
CLA Signed
This label is managed by the Facebook bot. Authors need to sign the CLA before a PR can be reviewed.
label
Jan 10, 2025
|
Name | Max | Mean | Ops | Ops on Repo HEAD
|
Change |
---|---|---|---|---|---|
test_plain_set_nested | 55.7840μs | 21.1371μs | 47.3101 KOps/s | 48.8291 KOps/s | |
test_plain_set_stack_nested | 60.0010μs | 21.1479μs | 47.2860 KOps/s | 48.7623 KOps/s | |
test_plain_set_nested_inplace | 61.5840μs | 22.8868μs | 43.6934 KOps/s | 45.2845 KOps/s | |
test_plain_set_stack_nested_inplace | 86.2300μs | 23.1803μs | 43.1401 KOps/s | 45.4033 KOps/s | |
test_items | 23.6440μs | 4.2871μs | 233.2585 KOps/s | 232.2122 KOps/s | |
test_items_nested | 0.5340ms | 0.3937ms | 2.5398 KOps/s | 2.5352 KOps/s | |
test_items_nested_locked | 0.6050ms | 0.3930ms | 2.5443 KOps/s | 2.5317 KOps/s | |
test_items_nested_leaf | 0.1748ms | 76.8172μs | 13.0179 KOps/s | 13.0947 KOps/s | |
test_items_stack_nested | 0.9122ms | 0.3987ms | 2.5083 KOps/s | 2.5255 KOps/s | |
test_items_stack_nested_leaf | 0.1499ms | 77.9788μs | 12.8240 KOps/s | 13.2199 KOps/s | |
test_items_stack_nested_locked | 0.7348ms | 0.3950ms | 2.5314 KOps/s | 2.5532 KOps/s | |
test_keys | 24.7160μs | 3.4914μs | 286.4220 KOps/s | 284.0352 KOps/s | |
test_keys_nested | 0.2472ms | 0.1623ms | 6.1620 KOps/s | 6.0388 KOps/s | |
test_keys_nested_locked | 0.8728ms | 0.1686ms | 5.9306 KOps/s | 5.8010 KOps/s | |
test_keys_nested_leaf | 0.2241ms | 0.1415ms | 7.0653 KOps/s | 6.9266 KOps/s | |
test_keys_stack_nested | 0.2576ms | 0.1620ms | 6.1713 KOps/s | 6.0245 KOps/s | |
test_keys_stack_nested_leaf | 0.2291ms | 0.1419ms | 7.0471 KOps/s | 6.9280 KOps/s | |
test_keys_stack_nested_locked | 0.2651ms | 0.1692ms | 5.9098 KOps/s | 5.7807 KOps/s | |
test_values | 12.2182μs | 1.0316μs | 969.3534 KOps/s | 968.7487 KOps/s | |
test_values_nested | 0.1141ms | 62.0899μs | 16.1057 KOps/s | 15.7492 KOps/s | |
test_values_nested_locked | 0.1142ms | 62.0401μs | 16.1186 KOps/s | 15.8751 KOps/s | |
test_values_nested_leaf | 0.1223ms | 70.5280μs | 14.1788 KOps/s | 12.7146 KOps/s | |
test_values_stack_nested | 0.1165ms | 62.7806μs | 15.9285 KOps/s | 15.8772 KOps/s | |
test_values_stack_nested_leaf | 0.1255ms | 70.9429μs | 14.0958 KOps/s | 13.6993 KOps/s | |
test_values_stack_nested_locked | 0.1100ms | 62.8878μs | 15.9013 KOps/s | 15.8677 KOps/s | |
test_membership | 18.4040μs | 0.8740μs | 1.1441 MOps/s | 1.4018 MOps/s | |
test_membership_nested | 44.1620μs | 2.9951μs | 333.8741 KOps/s | 343.1624 KOps/s | |
test_membership_nested_leaf | 42.1280μs | 3.0282μs | 330.2275 KOps/s | 335.3976 KOps/s | |
test_membership_stacked_nested | 30.2660μs | 2.9683μs | 336.8879 KOps/s | 346.6637 KOps/s | |
test_membership_stacked_nested_leaf | 24.7160μs | 3.0253μs | 330.5475 KOps/s | 324.8417 KOps/s | |
test_membership_nested_last | 0.1015ms | 4.5539μs | 219.5938 KOps/s | 231.2102 KOps/s | |
test_membership_nested_leaf_last | 0.1343ms | 4.5978μs | 217.4943 KOps/s | 232.0835 KOps/s | |
test_membership_stacked_nested_last | 42.2080μs | 4.3930μs | 227.6336 KOps/s | 232.2085 KOps/s | |
test_membership_stacked_nested_leaf_last | 34.4840μs | 4.3632μs | 229.1902 KOps/s | 231.3675 KOps/s | |
test_nested_getleaf | 39.5030μs | 10.6598μs | 93.8107 KOps/s | 95.1380 KOps/s | |
test_nested_get | 54.9020μs | 10.0339μs | 99.6623 KOps/s | 100.1976 KOps/s | |
test_stacked_getleaf | 30.4370μs | 10.5341μs | 94.9298 KOps/s | 94.5952 KOps/s | |
test_stacked_get | 33.2120μs | 10.0953μs | 99.0557 KOps/s | 100.8129 KOps/s | |
test_nested_getitemleaf | 0.1011ms | 11.2825μs | 88.6326 KOps/s | 90.4922 KOps/s | |
test_nested_getitem | 81.3710μs | 10.6936μs | 93.5140 KOps/s | 95.8747 KOps/s | |
test_stacked_getitemleaf | 37.0290μs | 11.1551μs | 89.6451 KOps/s | 91.5681 KOps/s | |
test_stacked_getitem | 41.4270μs | 10.6618μs | 93.7931 KOps/s | 94.6362 KOps/s | |
test_lock_nested | 6.8420ms | 0.4598ms | 2.1750 KOps/s | 1.7820 KOps/s | |
test_lock_stack_nested | 0.6465ms | 0.4231ms | 2.3633 KOps/s | 2.3782 KOps/s | |
test_unlock_nested | 0.8342ms | 0.3757ms | 2.6619 KOps/s | 2.6488 KOps/s | |
test_unlock_stack_nested | 0.7499ms | 0.3429ms | 2.9167 KOps/s | 2.9190 KOps/s | |
test_flatten_speed | 0.2066ms | 0.1005ms | 9.9544 KOps/s | 9.9366 KOps/s | |
test_unflatten_speed | 0.7114ms | 0.5218ms | 1.9166 KOps/s | 1.9457 KOps/s | |
test_common_ops | 1.8049ms | 0.8137ms | 1.2290 KOps/s | 1.2656 KOps/s | |
test_creation | 26.8600μs | 2.5068μs | 398.9126 KOps/s | 402.5116 KOps/s | |
test_creation_empty | 46.9880μs | 13.2077μs | 75.7134 KOps/s | 89.2207 KOps/s | |
test_creation_nested_1 | 55.1330μs | 16.3487μs | 61.1668 KOps/s | 70.5575 KOps/s | |
test_creation_nested_2 | 63.0370μs | 20.7858μs | 48.1098 KOps/s | 53.6683 KOps/s | |
test_clone | 0.1170ms | 13.2501μs | 75.4709 KOps/s | 74.7309 KOps/s | |
test_getitem[int] | 1.1895ms | 13.1341μs | 76.1379 KOps/s | 76.8875 KOps/s | |
test_getitem[slice_int] | 0.1413ms | 25.1703μs | 39.7293 KOps/s | 39.4542 KOps/s | |
test_getitem[range] | 0.1997ms | 49.0460μs | 20.3890 KOps/s | 20.0793 KOps/s | |
test_getitem[tuple] | 0.1347ms | 20.7401μs | 48.2158 KOps/s | 47.5712 KOps/s | |
test_getitem[list] | 0.3456ms | 43.9431μs | 22.7567 KOps/s | 22.1090 KOps/s | |
test_setitem_dim[int] | 51.2850μs | 25.9501μs | 38.5355 KOps/s | 38.0744 KOps/s | |
test_setitem_dim[slice_int] | 84.4570μs | 51.4600μs | 19.4326 KOps/s | 18.5864 KOps/s | |
test_setitem_dim[range] | 0.1312ms | 73.8033μs | 13.5495 KOps/s | 13.2286 KOps/s | |
test_setitem_dim[tuple] | 86.7520μs | 40.9283μs | 24.4330 KOps/s | 23.4690 KOps/s | |
test_setitem | 0.1333ms | 21.1639μs | 47.2503 KOps/s | 49.2135 KOps/s | |
test_set | 0.1333ms | 20.8757μs | 47.9025 KOps/s | 51.3996 KOps/s | |
test_set_shared | 3.6236ms | 0.1704ms | 5.8691 KOps/s | 5.8490 KOps/s | |
test_update | 0.2640ms | 24.2838μs | 41.1797 KOps/s | 44.3744 KOps/s | |
test_update_nested | 0.1915ms | 34.4934μs | 28.9911 KOps/s | 29.9096 KOps/s | |
test_update__nested | 0.8270ms | 34.0178μs | 29.3963 KOps/s | 29.4564 KOps/s | |
test_set_nested | 0.1540ms | 23.0841μs | 43.3199 KOps/s | 45.6106 KOps/s | |
test_set_nested_new | 0.1625ms | 27.4345μs | 36.4505 KOps/s | 37.6370 KOps/s | |
test_select | 0.2161ms | 43.7309μs | 22.8671 KOps/s | 23.5294 KOps/s | |
test_select_nested | 0.1228ms | 62.8386μs | 15.9138 KOps/s | 15.9912 KOps/s | |
test_exclude_nested | 0.1702ms | 81.0858μs | 12.3326 KOps/s | 12.3310 KOps/s | |
test_empty[True] | 0.6179ms | 0.4025ms | 2.4843 KOps/s | 2.4781 KOps/s | |
test_empty[False] | 9.9210μs | 1.3739μs | 727.8511 KOps/s | 692.3693 KOps/s | |
test_unbind_speed | 0.5158ms | 0.2685ms | 3.7251 KOps/s | 3.7066 KOps/s | |
test_unbind_speed_stack0 | 0.5231ms | 0.2696ms | 3.7093 KOps/s | 3.7314 KOps/s | |
test_unbind_speed_stack1 | 0.1162s | 0.8221ms | 1.2165 KOps/s | 1.3321 KOps/s | |
test_split | 2.5365ms | 1.6076ms | 622.0416 Ops/s | 560.1719 Ops/s | |
test_chunk | 0.1182s | 1.9848ms | 503.8217 Ops/s | 561.2586 Ops/s | |
test_consolidate_njt[False-None] | 11.3687ms | 8.5047ms | 117.5820 Ops/s | 121.6690 Ops/s | |
test_creation[device0] | 0.3029ms | 92.4164μs | 10.8206 KOps/s | 10.6728 KOps/s | |
test_creation_from_tensor | 4.0045ms | 97.0670μs | 10.3022 KOps/s | 10.3898 KOps/s | |
test_add_one[memmap_tensor0] | 0.2668ms | 5.0312μs | 198.7604 KOps/s | 200.9296 KOps/s | |
test_contiguous[memmap_tensor0] | 10.7300μs | 0.5084μs | 1.9668 MOps/s | 1.9072 MOps/s | |
test_stack[memmap_tensor0] | 50.5640μs | 3.4284μs | 291.6779 KOps/s | 286.1609 KOps/s | |
test_memmaptd_index | 1.0349ms | 0.2447ms | 4.0867 KOps/s | 4.2660 KOps/s | |
test_memmaptd_index_astensor | 0.6668ms | 0.3289ms | 3.0408 KOps/s | 3.1169 KOps/s | |
test_memmaptd_index_op | 1.0589ms | 0.6245ms | 1.6013 KOps/s | 1.6989 KOps/s | |
test_serialize_model | 0.1273s | 0.1196s | 8.3615 Ops/s | 8.6529 Ops/s | |
test_serialize_model_pickle | 0.5352s | 0.4091s | 2.4441 Ops/s | 2.4782 Ops/s | |
test_serialize_weights | 0.1271s | 0.1175s | 8.5106 Ops/s | 7.4557 Ops/s | |
test_serialize_weights_returnearly | 0.1616s | 0.1561s | 6.4044 Ops/s | 6.2105 Ops/s | |
test_serialize_weights_pickle | 0.5546s | 0.4289s | 2.3314 Ops/s | 2.5789 Ops/s | |
test_serialize_weights_filesystem | 0.1513s | 0.1466s | 6.8231 Ops/s | 6.9506 Ops/s | |
test_serialize_model_filesystem | 0.1550s | 0.1522s | 6.5721 Ops/s | 6.0367 Ops/s | |
test_reshape_pytree | 66.7940μs | 26.4825μs | 37.7608 KOps/s | 37.4194 KOps/s | |
test_reshape_td | 79.9490μs | 33.0669μs | 30.2417 KOps/s | 30.8028 KOps/s | |
test_view_pytree | 65.2310μs | 26.4720μs | 37.7757 KOps/s | 37.3416 KOps/s | |
test_view_td | 0.1034ms | 39.8190μs | 25.1136 KOps/s | 26.8348 KOps/s | |
test_unbind_pytree | 66.0630μs | 29.5966μs | 33.7877 KOps/s | 33.1312 KOps/s | |
test_unbind_td | 0.3137ms | 39.7948μs | 25.1289 KOps/s | 25.2762 KOps/s | |
test_split_pytree | 66.2730μs | 29.6247μs | 33.7556 KOps/s | 33.0873 KOps/s | |
test_split_td | 0.5800ms | 45.2224μs | 22.1129 KOps/s | 21.9842 KOps/s | |
test_add_pytree | 0.1086ms | 35.4491μs | 28.2095 KOps/s | 27.8984 KOps/s | |
test_add_td | 0.1376ms | 57.9392μs | 17.2595 KOps/s | 17.9575 KOps/s | |
test_compile_add_one_nested[tensordict-compile] | 0.1584ms | 62.8514μs | 15.9105 KOps/s | 15.6504 KOps/s | |
test_compile_add_one_nested[tensordict-eager] | 0.3928ms | 0.1705ms | 5.8654 KOps/s | 5.7759 KOps/s | |
test_compile_add_one_nested[pytree-compile] | 0.1251ms | 45.8061μs | 21.8311 KOps/s | 22.0087 KOps/s | |
test_compile_add_one_nested[pytree-eager] | 0.2219ms | 0.1177ms | 8.4946 KOps/s | 8.4763 KOps/s | |
test_compile_copy_nested[tensordict-compile] | 71.1520μs | 27.0849μs | 36.9209 KOps/s | 38.2010 KOps/s | |
test_compile_copy_nested[tensordict-eager] | 0.1181ms | 58.2441μs | 17.1691 KOps/s | 17.1418 KOps/s | |
test_compile_copy_nested[pytree-compile] | 0.1439ms | 77.7006μs | 12.8699 KOps/s | 13.0183 KOps/s | |
test_compile_copy_nested[pytree-eager] | 0.1324ms | 67.1035μs | 14.9023 KOps/s | 15.1301 KOps/s | |
test_compile_add_one_flat[tensordict-compile] | 0.1929ms | 0.1060ms | 9.4352 KOps/s | 9.3606 KOps/s | |
test_compile_add_one_flat[tensordict-eager] | 0.4024ms | 0.2120ms | 4.7175 KOps/s | 4.6700 KOps/s | |
test_compile_add_one_flat[tensorclass-compile] | 0.1429ms | 46.1481μs | 21.6693 KOps/s | 21.2690 KOps/s | |
test_compile_add_one_flat[tensorclass-eager] | 0.4716ms | 66.9093μs | 14.9456 KOps/s | 14.6299 KOps/s | |
test_compile_add_one_flat[pytree-compile] | 0.1751ms | 0.1043ms | 9.5877 KOps/s | 9.7832 KOps/s | |
test_compile_add_one_flat[pytree-eager] | 0.3388ms | 0.2014ms | 4.9657 KOps/s | 4.9678 KOps/s | |
test_compile_add_self_flat[tensordict-eager] | 0.4281ms | 0.2312ms | 4.3247 KOps/s | 4.2824 KOps/s | |
test_compile_add_self_flat[tensordict-compile] | 0.2352ms | 0.1057ms | 9.4566 KOps/s | 9.4190 KOps/s | |
test_compile_add_self_flat[tensorclass-eager] | 0.2572ms | 63.6587μs | 15.7088 KOps/s | 16.0125 KOps/s | |
test_compile_add_self_flat[tensorclass-compile] | 0.2295ms | 48.9226μs | 20.4405 KOps/s | 21.5057 KOps/s | |
test_compile_add_self_flat[pytree-eager] | 0.3371ms | 0.1587ms | 6.3027 KOps/s | 6.3759 KOps/s | |
test_compile_add_self_flat[pytree-compile] | 0.1812ms | 0.1031ms | 9.6987 KOps/s | 9.7751 KOps/s | |
test_compile_copy_flat[tensordict-compile] | 70.8220μs | 22.1259μs | 45.1959 KOps/s | 46.4960 KOps/s | |
test_compile_copy_flat[tensordict-eager] | 0.1365ms | 65.6000μs | 15.2439 KOps/s | 14.6291 KOps/s | |
test_compile_copy_flat[pytree-compile] | 0.1979ms | 79.0183μs | 12.6553 KOps/s | 12.8094 KOps/s | |
test_compile_copy_flat[pytree-eager] | 0.1358ms | 67.8446μs | 14.7396 KOps/s | 14.9046 KOps/s | |
test_compile_assign_and_add[tensordict-compile] | 0.3060ms | 0.2048ms | 4.8816 KOps/s | 4.8116 KOps/s | |
test_compile_assign_and_add[tensordict-eager] | 1.4619ms | 1.3154ms | 760.2240 Ops/s | 765.0792 Ops/s | |
test_compile_assign_and_add[pytree-compile] | 0.3255ms | 0.2034ms | 4.9169 KOps/s | 4.9896 KOps/s | |
test_compile_assign_and_add[pytree-eager] | 1.6498ms | 0.7862ms | 1.2719 KOps/s | 1.2905 KOps/s | |
test_compile_assign_and_add_stack[compile] | 0.6239ms | 0.4438ms | 2.2532 KOps/s | 2.2396 KOps/s | |
test_compile_assign_and_add_stack[eager] | 3.4945ms | 2.7541ms | 363.0943 Ops/s | 372.6680 Ops/s | |
test_compile_indexing[tensor-tensordict-compile] | 0.1046ms | 36.5482μs | 27.3612 KOps/s | 27.6239 KOps/s | |
test_compile_indexing[tensor-tensordict-eager] | 0.5396ms | 33.1730μs | 30.1450 KOps/s | 30.3158 KOps/s | |
test_compile_indexing[tensor-tensorclass-compile] | 93.2040μs | 29.5056μs | 33.8918 KOps/s | 34.2202 KOps/s | |
test_compile_indexing[tensor-tensorclass-eager] | 0.1052ms | 22.9329μs | 43.6054 KOps/s | 42.5862 KOps/s | |
test_compile_indexing[tensor-pytree-compile] | 77.0330μs | 29.9021μs | 33.4425 KOps/s | 33.5314 KOps/s | |
test_compile_indexing[tensor-pytree-eager] | 0.1245ms | 22.7395μs | 43.9764 KOps/s | 43.3300 KOps/s | |
test_compile_indexing[slice-tensordict-compile] | 0.1079ms | 51.0421μs | 19.5917 KOps/s | 19.1593 KOps/s | |
test_compile_indexing[slice-tensordict-eager] | 0.6726ms | 20.2053μs | 49.4920 KOps/s | 49.4389 KOps/s | |
test_compile_indexing[slice-tensorclass-compile] | 0.1002ms | 44.2626μs | 22.5924 KOps/s | 22.1605 KOps/s | |
test_compile_indexing[slice-tensorclass-eager] | 52.5780μs | 18.8981μs | 52.9154 KOps/s | 53.1178 KOps/s | |
test_compile_indexing[slice-pytree-compile] | 0.1093ms | 45.1000μs | 22.1729 KOps/s | 21.6933 KOps/s | |
test_compile_indexing[slice-pytree-eager] | 53.6600μs | 18.7678μs | 53.2828 KOps/s | 52.8962 KOps/s | |
test_compile_indexing[int-tensordict-compile] | 0.1173ms | 51.8985μs | 19.2684 KOps/s | 18.5026 KOps/s | |
test_compile_indexing[int-tensordict-eager] | 1.1863ms | 20.2174μs | 49.4625 KOps/s | 49.5503 KOps/s | |
test_compile_indexing[int-tensorclass-compile] | 0.1121ms | 45.3905μs | 22.0310 KOps/s | 21.9091 KOps/s | |
test_compile_indexing[int-tensorclass-eager] | 78.0080μs | 18.8864μs | 52.9481 KOps/s | 53.4157 KOps/s | |
test_compile_indexing[int-pytree-compile] | 0.1020ms | 44.4831μs | 22.4804 KOps/s | 21.8550 KOps/s | |
test_compile_indexing[int-pytree-eager] | 57.0560μs | 19.0030μs | 52.6233 KOps/s | 53.2760 KOps/s | |
test_mod_add[eager] | 87.2030μs | 35.7222μs | 27.9938 KOps/s | 28.5550 KOps/s | |
test_mod_add[compile] | 0.2230ms | 49.6873μs | 20.1259 KOps/s | 20.3864 KOps/s | |
test_mod_add[compile-overhead] | 0.1203ms | 48.4703μs | 20.6312 KOps/s | 20.2302 KOps/s | |
test_mod_wrap[eager] | 0.3616ms | 0.2210ms | 4.5259 KOps/s | 4.4111 KOps/s | |
test_mod_wrap[compile] | 0.3130ms | 0.2063ms | 4.8483 KOps/s | 4.8380 KOps/s | |
test_mod_wrap[compile-overhead] | 0.3193ms | 0.2040ms | 4.9027 KOps/s | 4.8167 KOps/s | |
test_mod_wrap_and_backward[eager] | 12.3277ms | 11.1438ms | 89.7361 Ops/s | 81.3521 Ops/s | |
test_mod_wrap_and_backward[compile] | 12.8491ms | 11.0829ms | 90.2291 Ops/s | 76.9347 Ops/s | |
test_mod_wrap_and_backward[compile-overhead] | 13.4415ms | 11.1312ms | 89.8375 Ops/s | 75.6392 Ops/s | |
test_seq_add[eager] | 0.2285ms | 0.1168ms | 8.5589 KOps/s | 8.5417 KOps/s | |
test_seq_add[compile] | 0.1520ms | 62.5276μs | 15.9929 KOps/s | 15.6267 KOps/s | |
test_seq_add[compile-overhead] | 0.1299ms | 60.4078μs | 16.5541 KOps/s | 16.1587 KOps/s | |
test_seq_wrap[eager] | 0.7518ms | 0.4519ms | 2.2131 KOps/s | 2.1867 KOps/s | |
test_seq_wrap[compile] | 0.4180ms | 0.2288ms | 4.3709 KOps/s | 4.2090 KOps/s | |
test_seq_wrap[compile-overhead] | 0.4133ms | 0.2283ms | 4.3808 KOps/s | 4.3297 KOps/s | |
test_func_call_runtime[False-eager] | 0.6789ms | 0.5345ms | 1.8708 KOps/s | 1.7945 KOps/s | |
test_func_call_runtime[False-compile] | 0.5661ms | 0.4255ms | 2.3504 KOps/s | 2.3312 KOps/s | |
test_func_call_runtime[False-compile-overhead] | 0.5619ms | 0.4277ms | 2.3379 KOps/s | 2.3131 KOps/s | |
test_func_call_runtime[True-eager] | 1.1544ms | 0.7526ms | 1.3287 KOps/s | 1.2752 KOps/s | |
test_func_call_runtime[True-compile] | 0.8984ms | 0.4744ms | 2.1078 KOps/s | 2.1469 KOps/s | |
test_func_call_runtime[True-compile-overhead] | 0.8314ms | 0.4718ms | 2.1195 KOps/s | 2.1437 KOps/s | |
test_func_call_cm_runtime[False-eager] | 0.9291ms | 0.5407ms | 1.8493 KOps/s | 1.7970 KOps/s | |
test_func_call_cm_runtime[False-compile] | 0.6082ms | 0.4267ms | 2.3438 KOps/s | 2.3558 KOps/s | |
test_func_call_cm_runtime[False-compile-overhead] | 0.8174ms | 0.4273ms | 2.3405 KOps/s | 2.3642 KOps/s | |
test_func_call_cm_runtime[True-eager] | 1.1130ms | 0.8944ms | 1.1181 KOps/s | 1.0818 KOps/s | |
test_func_call_cm_runtime[True-compile] | 0.6508ms | 0.4945ms | 2.0224 KOps/s | 2.0298 KOps/s | |
test_func_call_cm_runtime[True-compile-overhead] | 0.6100ms | 0.4916ms | 2.0342 KOps/s | 2.0375 KOps/s | |
test_vmap_func_call_cm_runtime[eager] | 2.6097ms | 1.9130ms | 522.7327 Ops/s | 507.9864 Ops/s | |
test_vmap_func_call_cm_runtime[compile] | 0.7906ms | 0.5264ms | 1.8997 KOps/s | 1.8048 KOps/s | |
test_vmap_func_call_cm_runtime[compile-overhead] | 0.6480ms | 0.5235ms | 1.9101 KOps/s | 1.8646 KOps/s | |
test_distributed | 0.2687ms | 0.1279ms | 7.8202 KOps/s | 7.4707 KOps/s | |
test_tdmodule | 45.0240μs | 26.6981μs | 37.4559 KOps/s | 38.2915 KOps/s | |
test_tdmodule_dispatch | 77.3640μs | 49.2327μs | 20.3117 KOps/s | 21.0302 KOps/s | |
test_tdseq | 60.2320μs | 29.8015μs | 33.5553 KOps/s | 33.2147 KOps/s | |
test_tdseq_dispatch | 96.2600μs | 55.6381μs | 17.9733 KOps/s | 18.2258 KOps/s | |
test_instantiation_functorch | 1.6815ms | 1.5330ms | 652.2995 Ops/s | 631.6163 Ops/s | |
test_exec_functorch | 0.3233ms | 0.1782ms | 5.6122 KOps/s | 5.5374 KOps/s | |
test_exec_functional_call | 0.2610ms | 0.1719ms | 5.8162 KOps/s | 5.7364 KOps/s | |
test_exec_td_decorator | 0.5197ms | 0.2324ms | 4.3021 KOps/s | 4.1947 KOps/s | |
test_vmap_mlp_speed_decorator[True-True] | 0.9171ms | 0.6568ms | 1.5226 KOps/s | 1.4583 KOps/s | |
test_vmap_mlp_speed_decorator[True-False] | 1.0423ms | 0.6558ms | 1.5249 KOps/s | 1.4950 KOps/s | |
test_vmap_mlp_speed_decorator[False-True] | 0.8212ms | 0.5289ms | 1.8907 KOps/s | 1.8518 KOps/s | |
test_vmap_mlp_speed_decorator[False-False] | 0.8321ms | 0.5302ms | 1.8862 KOps/s | 1.8676 KOps/s | |
test_to_module_speed[True] | 1.7483ms | 1.3210ms | 757.0192 Ops/s | 725.5831 Ops/s | |
test_to_module_speed[False] | 2.0885ms | 1.2993ms | 769.6181 Ops/s | 741.0235 Ops/s | |
test_tc_init | 89.8680μs | 47.0798μs | 21.2405 KOps/s | 20.4582 KOps/s | |
test_tc_init_nested | 0.1774ms | 91.5030μs | 10.9286 KOps/s | 10.0887 KOps/s | |
test_tc_first_layer_tensor | 17.5530μs | 1.5574μs | 642.0989 KOps/s | 659.0196 KOps/s | |
test_tc_first_layer_nontensor | 41.7580μs | 4.7942μs | 208.5850 KOps/s | 215.1429 KOps/s | |
test_tc_second_layer_tensor | 40.9360μs | 2.8697μs | 348.4700 KOps/s | 354.0390 KOps/s | |
test_tc_second_layer_nontensor | 33.0410μs | 6.1145μs | 163.5460 KOps/s | 167.1968 KOps/s | |
test_unbind | 0.2350s | 13.5328ms | 73.8943 Ops/s | 78.6957 Ops/s | |
test_full_like | 8.5336ms | 7.7915ms | 128.3447 Ops/s | 72.8023 Ops/s | |
test_zeros_like | 3.7197ms | 3.0018ms | 333.1373 Ops/s | 125.5598 Ops/s | |
test_ones_like | 3.9409ms | 3.5519ms | 281.5402 Ops/s | 125.3765 Ops/s | |
test_clone | 6.0259ms | 5.2053ms | 192.1122 Ops/s | 102.5179 Ops/s | |
test_squeeze | 60.2620μs | 11.8147μs | 84.6402 KOps/s | 82.8546 KOps/s | |
test_unsqueeze | 0.3032ms | 90.3668μs | 11.0660 KOps/s | 11.0386 KOps/s | |
test_split | 0.3257ms | 0.1951ms | 5.1247 KOps/s | 5.0139 KOps/s | |
test_permute | 0.2984ms | 0.2004ms | 4.9902 KOps/s | 5.0495 KOps/s | |
test_stack | 28.4580ms | 24.8341ms | 40.2673 Ops/s | 38.7454 Ops/s | |
test_cat | 27.8976ms | 24.6248ms | 40.6094 Ops/s | 38.9901 Ops/s |
|
Name | Max | Mean | Ops | Ops on Repo HEAD
|
Change |
---|---|---|---|---|---|
test_plain_set_nested | 26.0300μs | 11.3641μs | 87.9966 KOps/s | 77.1256 KOps/s | |
test_plain_set_stack_nested | 28.2310μs | 11.5727μs | 86.4100 KOps/s | 76.1548 KOps/s | |
test_plain_set_nested_inplace | 38.2500μs | 12.4839μs | 80.1034 KOps/s | 71.2887 KOps/s | |
test_plain_set_stack_nested_inplace | 77.1420μs | 12.4413μs | 80.3773 KOps/s | 70.8596 KOps/s | |
test_items | 25.6210μs | 2.9116μs | 343.4494 KOps/s | 335.1050 KOps/s | |
test_items_nested | 0.4043ms | 0.3737ms | 2.6761 KOps/s | 2.6301 KOps/s | |
test_items_nested_locked | 0.4394ms | 0.3734ms | 2.6778 KOps/s | 2.6650 KOps/s | |
test_items_nested_leaf | 89.2420μs | 62.2365μs | 16.0677 KOps/s | 16.1339 KOps/s | |
test_items_stack_nested | 0.4035ms | 0.3770ms | 2.6527 KOps/s | 2.6131 KOps/s | |
test_items_stack_nested_leaf | 94.9620μs | 62.6572μs | 15.9598 KOps/s | 15.8717 KOps/s | |
test_items_stack_nested_locked | 0.4423ms | 0.3749ms | 2.6674 KOps/s | 2.6234 KOps/s | |
test_keys | 28.8900μs | 3.4758μs | 287.7041 KOps/s | 285.8228 KOps/s | |
test_keys_nested | 0.1264ms | 88.1189μs | 11.3483 KOps/s | 11.2285 KOps/s | |
test_keys_nested_locked | 0.7792ms | 93.8314μs | 10.6574 KOps/s | 10.5643 KOps/s | |
test_keys_nested_leaf | 0.1442ms | 78.8028μs | 12.6899 KOps/s | 12.5549 KOps/s | |
test_keys_stack_nested | 0.1211ms | 88.8791μs | 11.2512 KOps/s | 11.0161 KOps/s | |
test_keys_stack_nested_leaf | 0.1127ms | 79.3545μs | 12.6017 KOps/s | 12.2561 KOps/s | |
test_keys_stack_nested_locked | 0.1423ms | 94.4548μs | 10.5871 KOps/s | 10.3834 KOps/s | |
test_values | 6.2485μs | 0.8617μs | 1.1604 MOps/s | 1.1520 MOps/s | |
test_values_nested | 97.9320μs | 38.2996μs | 26.1100 KOps/s | 26.3218 KOps/s | |
test_values_nested_locked | 68.9210μs | 39.5914μs | 25.2580 KOps/s | 25.4562 KOps/s | |
test_values_nested_leaf | 73.4410μs | 42.4872μs | 23.5365 KOps/s | 23.6332 KOps/s | |
test_values_stack_nested | 70.2310μs | 38.3404μs | 26.0822 KOps/s | 25.6010 KOps/s | |
test_values_stack_nested_leaf | 70.2810μs | 43.2597μs | 23.1162 KOps/s | 23.0461 KOps/s | |
test_values_stack_nested_locked | 0.1125ms | 39.6488μs | 25.2215 KOps/s | 24.7858 KOps/s | |
test_membership | 1.6581μs | 0.5581μs | 1.7916 MOps/s | 1.8141 MOps/s | |
test_membership_nested | 15.7900μs | 2.0522μs | 487.2793 KOps/s | 468.2853 KOps/s | |
test_membership_nested_leaf | 14.6805μs | 2.0602μs | 485.3793 KOps/s | 488.9194 KOps/s | |
test_membership_stacked_nested | 40.8510μs | 2.1262μs | 470.3157 KOps/s | 468.8351 KOps/s | |
test_membership_stacked_nested_leaf | 26.4600μs | 2.1258μs | 470.4154 KOps/s | 475.5069 KOps/s | |
test_membership_nested_last | 38.2810μs | 3.1552μs | 316.9342 KOps/s | 316.5543 KOps/s | |
test_membership_nested_leaf_last | 27.6000μs | 3.1435μs | 318.1214 KOps/s | 316.0421 KOps/s | |
test_membership_stacked_nested_last | 28.0610μs | 3.6420μs | 274.5769 KOps/s | 314.2446 KOps/s | |
test_membership_stacked_nested_leaf_last | 31.7000μs | 3.6705μs | 272.4393 KOps/s | 315.1670 KOps/s | |
test_nested_getleaf | 35.2310μs | 6.2066μs | 161.1176 KOps/s | 161.6268 KOps/s | |
test_nested_get | 30.1010μs | 5.8986μs | 169.5329 KOps/s | 169.6707 KOps/s | |
test_stacked_getleaf | 30.3100μs | 6.2076μs | 161.0928 KOps/s | 161.1698 KOps/s | |
test_stacked_get | 28.7710μs | 5.8836μs | 169.9653 KOps/s | 171.1834 KOps/s | |
test_nested_getitemleaf | 30.6310μs | 6.5194μs | 153.3882 KOps/s | 153.3021 KOps/s | |
test_nested_getitem | 26.8710μs | 6.1313μs | 163.0987 KOps/s | 160.7462 KOps/s | |
test_stacked_getitemleaf | 33.0710μs | 6.5152μs | 153.4872 KOps/s | 154.5264 KOps/s | |
test_stacked_getitem | 38.8610μs | 6.1262μs | 163.2328 KOps/s | 161.9371 KOps/s | |
test_lock_nested | 9.1671ms | 0.3833ms | 2.6090 KOps/s | 2.5970 KOps/s | |
test_lock_stack_nested | 0.3907ms | 0.3465ms | 2.8857 KOps/s | 2.8665 KOps/s | |
test_unlock_nested | 0.7718ms | 0.3168ms | 3.1567 KOps/s | 3.1624 KOps/s | |
test_unlock_stack_nested | 0.3235ms | 0.2863ms | 3.4923 KOps/s | 3.4895 KOps/s | |
test_flatten_speed | 0.1200ms | 78.6205μs | 12.7193 KOps/s | 12.7368 KOps/s | |
test_unflatten_speed | 0.4696ms | 0.3195ms | 3.1302 KOps/s | 3.0743 KOps/s | |
test_common_ops | 1.6760ms | 0.5728ms | 1.7459 KOps/s | 1.5619 KOps/s | |
test_creation | 0.1006ms | 1.7708μs | 564.7243 KOps/s | 558.3026 KOps/s | |
test_creation_empty | 44.4510μs | 6.3891μs | 156.5162 KOps/s | 105.9219 KOps/s | |
test_creation_nested_1 | 37.3510μs | 8.1700μs | 122.3991 KOps/s | 89.4513 KOps/s | |
test_creation_nested_2 | 33.1800μs | 10.8349μs | 92.2941 KOps/s | 71.8689 KOps/s | |
test_clone | 81.6820μs | 10.4893μs | 95.3348 KOps/s | 91.9503 KOps/s | |
test_getitem[int] | 1.8158ms | 10.9730μs | 91.1331 KOps/s | 93.1328 KOps/s | |
test_getitem[slice_int] | 0.1241ms | 20.8775μs | 47.8985 KOps/s | 47.7077 KOps/s | |
test_getitem[range] | 0.2158ms | 37.8165μs | 26.4435 KOps/s | 26.0945 KOps/s | |
test_getitem[tuple] | 0.1067ms | 18.1901μs | 54.9750 KOps/s | 55.7531 KOps/s | |
test_getitem[list] | 0.3195ms | 33.4193μs | 29.9228 KOps/s | 29.9571 KOps/s | |
test_setitem_dim[int] | 39.7410μs | 20.0059μs | 49.9854 KOps/s | 50.1016 KOps/s | |
test_setitem_dim[slice_int] | 74.8920μs | 40.8287μs | 24.4926 KOps/s | 25.3427 KOps/s | |
test_setitem_dim[range] | 94.1620μs | 58.1118μs | 17.2082 KOps/s | 18.1563 KOps/s | |
test_setitem_dim[tuple] | 69.5310μs | 34.1936μs | 29.2452 KOps/s | 30.6284 KOps/s | |
test_setitem | 97.6730μs | 14.1572μs | 70.6353 KOps/s | 63.2857 KOps/s | |
test_set | 84.9220μs | 13.6069μs | 73.4920 KOps/s | 64.8651 KOps/s | |
test_set_shared | 1.4044ms | 0.1534ms | 6.5198 KOps/s | 6.4908 KOps/s | |
test_update | 1.0085ms | 15.5192μs | 64.4365 KOps/s | 52.9927 KOps/s | |
test_update_nested | 88.1920μs | 21.2258μs | 47.1124 KOps/s | 40.2530 KOps/s | |
test_update__nested | 0.1399ms | 25.8606μs | 38.6689 KOps/s | 38.9582 KOps/s | |
test_set_nested | 93.6130μs | 14.9750μs | 66.7781 KOps/s | 59.6250 KOps/s | |
test_set_nested_new | 92.1020μs | 16.8587μs | 59.3167 KOps/s | 52.5721 KOps/s | |
test_select | 92.4020μs | 28.4939μs | 35.0952 KOps/s | 31.8688 KOps/s | |
test_select_nested | 70.9420μs | 44.1469μs | 22.6516 KOps/s | 22.6664 KOps/s | |
test_exclude_nested | 0.1206ms | 61.8636μs | 16.1646 KOps/s | 15.5851 KOps/s | |
test_empty[True] | 0.3668ms | 0.2952ms | 3.3873 KOps/s | 3.3680 KOps/s | |
test_empty[False] | 3.7701μs | 0.8465μs | 1.1813 MOps/s | 1.1608 MOps/s | |
test_to | 0.1007ms | 59.2911μs | 16.8659 KOps/s | 17.0748 KOps/s | |
test_to_nonblocking | 89.4120μs | 48.2730μs | 20.7155 KOps/s | 19.5697 KOps/s | |
test_unbind_speed | 0.2764ms | 0.2353ms | 4.2505 KOps/s | 4.2369 KOps/s | |
test_unbind_speed_stack0 | 0.2921ms | 0.2355ms | 4.2466 KOps/s | 4.2381 KOps/s | |
test_unbind_speed_stack1 | 0.6749ms | 0.6210ms | 1.6103 KOps/s | 1.4906 KOps/s | |
test_split | 94.9583ms | 1.5781ms | 633.6613 Ops/s | 578.2829 Ops/s | |
test_chunk | 96.3327ms | 1.5814ms | 632.3420 Ops/s | 687.0053 Ops/s | |
test_consolidate[False-None] | 96.4715ms | 2.9193ms | 342.5523 Ops/s | 329.0016 Ops/s | |
test_consolidate[default-None] | 1.8395ms | 1.7219ms | 580.7424 Ops/s | 583.2083 Ops/s | |
test_consolidate[reduce-overhead-None] | 1.8269ms | 1.7434ms | 573.5915 Ops/s | 571.9658 Ops/s | |
test_consolidate_njt[False-None] | 6.9130ms | 6.5508ms | 152.6539 Ops/s | 149.9359 Ops/s | |
test_to[False-False-None] | 1.7979ms | 1.7237ms | 580.1554 Ops/s | 570.3021 Ops/s | |
test_to[True-False-None] | 1.5628ms | 1.3351ms | 748.9909 Ops/s | 751.7799 Ops/s | |
test_to[within-False-None] | 4.2256ms | 4.1079ms | 243.4351 Ops/s | 243.3795 Ops/s | |
test_to[True-default-None] | 5.5925ms | 5.3773ms | 185.9657 Ops/s | 190.5674 Ops/s | |
test_to_njt[False-False-None] | 7.1325ms | 6.9774ms | 143.3194 Ops/s | 143.9295 Ops/s | |
test_to_njt[True-False-None] | 5.6810ms | 5.4662ms | 182.9417 Ops/s | 181.7383 Ops/s | |
test_to_njt[within-False-None] | 12.4365ms | 12.3020ms | 81.2877 Ops/s | 81.6354 Ops/s | |
test_creation[device0] | 0.4585ms | 81.0971μs | 12.3309 KOps/s | 12.3085 KOps/s | |
test_creation_from_tensor | 0.5061ms | 84.5742μs | 11.8239 KOps/s | 11.5942 KOps/s | |
test_add_one[memmap_tensor0] | 0.4124ms | 6.8996μs | 144.9367 KOps/s | 144.4252 KOps/s | |
test_contiguous[memmap_tensor0] | 1.9001μs | 0.4424μs | 2.2606 MOps/s | 2.3038 MOps/s | |
test_stack[memmap_tensor0] | 39.1900μs | 4.4224μs | 226.1218 KOps/s | 232.9507 KOps/s | |
test_memmaptd_index | 1.6586ms | 0.2499ms | 4.0018 KOps/s | 3.9507 KOps/s | |
test_memmaptd_index_astensor | 0.5784ms | 0.3140ms | 3.1851 KOps/s | 3.1765 KOps/s | |
test_memmaptd_index_op | 0.9974ms | 0.5608ms | 1.7830 KOps/s | 1.6278 KOps/s | |
test_serialize_model | 0.1317s | 0.1309s | 7.6421 Ops/s | 7.6770 Ops/s | |
test_serialize_model_pickle | 1.3499s | 1.1868s | 0.8426 Ops/s | 0.8251 Ops/s | |
test_serialize_weights | 0.1309s | 0.1303s | 7.6758 Ops/s | 7.6935 Ops/s | |
test_serialize_weights_returnearly | 0.3342s | 53.5536ms | 18.6729 Ops/s | 14.0273 Ops/s | |
test_serialize_weights_pickle | 1.3823s | 1.2229s | 0.8177 Ops/s | 0.8257 Ops/s | |
test_reshape_pytree | 58.4810μs | 22.3891μs | 44.6646 KOps/s | 44.4645 KOps/s | |
test_reshape_td | 47.6410μs | 26.5591μs | 37.6519 KOps/s | 35.5816 KOps/s | |
test_view_pytree | 80.9320μs | 22.3097μs | 44.8235 KOps/s | 45.2810 KOps/s | |
test_view_td | 71.4510μs | 30.4880μs | 32.7998 KOps/s | 31.7535 KOps/s | |
test_unbind_pytree | 66.6420μs | 28.1505μs | 35.5233 KOps/s | 35.7436 KOps/s | |
test_unbind_td | 0.7617ms | 36.1326μs | 27.6758 KOps/s | 27.5141 KOps/s | |
test_split_pytree | 68.3710μs | 29.2609μs | 34.1752 KOps/s | 33.2899 KOps/s | |
test_split_td | 0.9281ms | 37.4469μs | 26.7044 KOps/s | 25.7265 KOps/s | |
test_add_pytree | 72.9820μs | 34.4318μs | 29.0429 KOps/s | 28.9657 KOps/s | |
test_add_td | 0.4467ms | 44.3079μs | 22.5694 KOps/s | 20.1804 KOps/s | |
test_compile_add_one_nested[tensordict-compile] | 0.2314ms | 0.1222ms | 8.1849 KOps/s | 7.8258 KOps/s | |
test_compile_add_one_nested[tensordict-eager] | 0.2381ms | 0.1320ms | 7.5759 KOps/s | 7.4102 KOps/s | |
test_compile_add_one_nested[pytree-compile] | 0.4968ms | 95.9723μs | 10.4197 KOps/s | 10.2983 KOps/s | |
test_compile_add_one_nested[pytree-eager] | 0.5898ms | 0.1522ms | 6.5710 KOps/s | 6.5802 KOps/s | |
test_compile_copy_nested[tensordict-compile] | 0.4161ms | 21.9925μs | 45.4700 KOps/s | 45.4362 KOps/s | |
test_compile_copy_nested[tensordict-eager] | 0.4358ms | 29.6271μs | 33.7529 KOps/s | 33.3782 KOps/s | |
test_compile_copy_nested[pytree-compile] | 0.4695ms | 65.3212μs | 15.3090 KOps/s | 14.8338 KOps/s | |
test_compile_copy_nested[pytree-eager] | 0.4549ms | 50.3204μs | 19.8727 KOps/s | 19.3531 KOps/s | |
test_compile_add_one_flat[tensordict-compile] | 0.1878ms | 0.1431ms | 6.9899 KOps/s | 6.9680 KOps/s | |
test_compile_add_one_flat[tensordict-eager] | 0.6249ms | 0.2204ms | 4.5378 KOps/s | 4.5711 KOps/s | |
test_compile_add_one_flat[tensorclass-compile] | 0.1696ms | 98.4834μs | 10.1540 KOps/s | 10.1226 KOps/s | |
test_compile_add_one_flat[tensorclass-eager] | 0.1147ms | 56.6085μs | 17.6652 KOps/s | 17.3973 KOps/s | |
test_compile_add_one_flat[pytree-compile] | 0.2721ms | 0.1364ms | 7.3299 KOps/s | 7.2328 KOps/s | |
test_compile_add_one_flat[pytree-eager] | 0.5906ms | 0.4786ms | 2.0895 KOps/s | 2.0629 KOps/s | |
test_compile_add_self_flat[tensordict-eager] | 0.4047ms | 0.2620ms | 3.8162 KOps/s | 3.8143 KOps/s | |
test_compile_add_self_flat[tensordict-compile] | 0.1830ms | 0.1437ms | 6.9578 KOps/s | 6.7877 KOps/s | |
test_compile_add_self_flat[tensorclass-eager] | 0.1555ms | 68.3829μs | 14.6235 KOps/s | 14.6266 KOps/s | |
test_compile_add_self_flat[tensorclass-compile] | 0.1552ms | 0.1041ms | 9.6060 KOps/s | 9.6783 KOps/s | |
test_compile_add_self_flat[pytree-eager] | 0.4447ms | 0.4004ms | 2.4973 KOps/s | 2.4336 KOps/s | |
test_compile_add_self_flat[pytree-compile] | 0.1880ms | 0.1427ms | 7.0067 KOps/s | 7.4415 KOps/s | |
test_compile_copy_flat[tensordict-compile] | 55.7510μs | 19.8763μs | 50.3112 KOps/s | 55.8602 KOps/s | |
test_compile_copy_flat[tensordict-eager] | 66.9510μs | 31.2038μs | 32.0474 KOps/s | 31.4124 KOps/s | |
test_compile_copy_flat[pytree-compile] | 0.1256ms | 69.6438μs | 14.3588 KOps/s | 14.1180 KOps/s | |
test_compile_copy_flat[pytree-eager] | 0.1805ms | 51.3463μs | 19.4756 KOps/s | 19.0294 KOps/s | |
test_compile_assign_and_add[tensordict-compile] | 1.6377ms | 0.3927ms | 2.5463 KOps/s | 2.1990 KOps/s | |
test_compile_assign_and_add[tensordict-eager] | 3.1067ms | 2.7243ms | 367.0651 Ops/s | 373.2142 Ops/s | |
test_compile_assign_and_add[pytree-compile] | 1.6068ms | 0.4340ms | 2.3040 KOps/s | 2.2508 KOps/s | |
test_compile_assign_and_add[pytree-eager] | 3.0955ms | 2.6923ms | 371.4352 Ops/s | 374.9626 Ops/s | |
test_compile_indexing[tensor-tensordict-compile] | 0.6159ms | 0.1216ms | 8.2203 KOps/s | 8.5609 KOps/s | |
test_compile_indexing[tensor-tensordict-eager] | 0.5988ms | 84.1106μs | 11.8891 KOps/s | 12.4891 KOps/s | |
test_compile_indexing[tensor-tensorclass-compile] | 0.6540ms | 0.1095ms | 9.1346 KOps/s | 9.2453 KOps/s | |
test_compile_indexing[tensor-tensorclass-eager] | 0.1183ms | 72.6196μs | 13.7704 KOps/s | 14.3578 KOps/s | |
test_compile_indexing[tensor-pytree-compile] | 0.1618ms | 0.1124ms | 8.8956 KOps/s | 8.7190 KOps/s | |
test_compile_indexing[tensor-pytree-eager] | 0.1352ms | 71.6051μs | 13.9655 KOps/s | 13.7117 KOps/s | |
test_compile_indexing[slice-tensordict-compile] | 0.2025ms | 0.1027ms | 9.7374 KOps/s | 9.8407 KOps/s | |
test_compile_indexing[slice-tensordict-eager] | 0.1699ms | 17.0187μs | 58.7590 KOps/s | 56.6883 KOps/s | |
test_compile_indexing[slice-tensorclass-compile] | 0.1402ms | 0.1000ms | 9.9982 KOps/s | 10.2561 KOps/s | |
test_compile_indexing[slice-tensorclass-eager] | 59.3620μs | 15.7547μs | 63.4732 KOps/s | 62.6360 KOps/s | |
test_compile_indexing[slice-pytree-compile] | 0.1425ms | 98.2368μs | 10.1795 KOps/s | 10.1320 KOps/s | |
test_compile_indexing[slice-pytree-eager] | 67.9220μs | 16.0005μs | 62.4979 KOps/s | 63.2279 KOps/s | |
test_compile_indexing[int-tensordict-compile] | 0.1447ms | 0.1017ms | 9.8349 KOps/s | 9.8613 KOps/s | |
test_compile_indexing[int-tensordict-eager] | 0.5798ms | 16.9805μs | 58.8912 KOps/s | 57.5203 KOps/s | |
test_compile_indexing[int-tensorclass-compile] | 0.1512ms | 97.7632μs | 10.2288 KOps/s | 10.1682 KOps/s | |
test_compile_indexing[int-tensorclass-eager] | 47.5010μs | 15.9737μs | 62.6027 KOps/s | 62.2293 KOps/s | |
test_compile_indexing[int-pytree-compile] | 0.1475ms | 99.8387μs | 10.0162 KOps/s | 9.8261 KOps/s | |
test_compile_indexing[int-pytree-eager] | 49.4510μs | 15.9860μs | 62.5547 KOps/s | 63.1826 KOps/s | |
test_mod_add[eager] | 0.1075ms | 36.6067μs | 27.3174 KOps/s | 25.7108 KOps/s | |
test_mod_add[compile] | 0.3828ms | 81.5653μs | 12.2601 KOps/s | 12.2654 KOps/s | |
test_mod_add[compile-overhead] | 0.3283ms | 0.1671ms | 5.9832 KOps/s | 5.7135 KOps/s | |
test_mod_wrap[eager] | 0.3332ms | 0.2498ms | 4.0026 KOps/s | 3.8970 KOps/s | |
test_mod_wrap[compile] | 0.3967ms | 0.2847ms | 3.5129 KOps/s | 3.3958 KOps/s | |
test_mod_wrap[compile-overhead] | 7.0233ms | 3.7179ms | 268.9676 Ops/s | 282.9737 Ops/s | |
test_mod_wrap_and_backward[eager] | 1.5906ms | 1.4516ms | 688.8792 Ops/s | 682.7755 Ops/s | |
test_mod_wrap_and_backward[compile] | 1.4906ms | 1.3765ms | 726.4935 Ops/s | 722.7884 Ops/s | |
test_mod_wrap_and_backward[compile-overhead] | 1.5124ms | 1.0434ms | 958.3984 Ops/s | 950.3611 Ops/s | |
test_seq_add[eager] | 0.1609ms | 0.1140ms | 8.7709 KOps/s | 8.3536 KOps/s | |
test_seq_add[compile] | 0.1573ms | 87.9066μs | 11.3757 KOps/s | 11.3467 KOps/s | |
test_seq_add[compile-overhead] | 0.1695ms | 0.1292ms | 7.7408 KOps/s | 7.7014 KOps/s | |
test_seq_wrap[eager] | 0.4754ms | 0.4180ms | 2.3923 KOps/s | 2.3136 KOps/s | |
test_seq_wrap[compile] | 0.3491ms | 0.3010ms | 3.3223 KOps/s | 3.2634 KOps/s | |
test_seq_wrap[compile-overhead] | 0.2856ms | 0.2254ms | 4.4361 KOps/s | 4.3601 KOps/s | |
test_func_call_runtime[False-eager] | 0.8045ms | 0.7456ms | 1.3411 KOps/s | 1.3212 KOps/s | |
test_func_call_runtime[False-compile] | 0.7979ms | 0.7458ms | 1.3408 KOps/s | 1.3200 KOps/s | |
test_func_call_runtime[False-compile-overhead] | 0.4152ms | 0.3668ms | 2.7262 KOps/s | 2.7182 KOps/s | |
test_func_call_runtime[True-eager] | 0.9835ms | 0.9131ms | 1.0952 KOps/s | 1.0778 KOps/s | |
test_func_call_runtime[True-compile] | 0.8251ms | 0.7667ms | 1.3042 KOps/s | 1.3015 KOps/s | |
test_func_call_runtime[True-compile-overhead] | 0.4317ms | 0.3872ms | 2.5827 KOps/s | 2.5803 KOps/s | |
test_func_call_cm_runtime[False-eager] | 0.8826ms | 0.7726ms | 1.2943 KOps/s | 1.3336 KOps/s | |
test_func_call_cm_runtime[False-compile] | 0.8302ms | 0.7524ms | 1.3291 KOps/s | 1.3296 KOps/s | |
test_func_call_cm_runtime[False-compile-overhead] | 0.4347ms | 0.3687ms | 2.7125 KOps/s | 2.6943 KOps/s | |
test_func_call_cm_runtime[True-eager] | 1.1779ms | 1.0235ms | 977.0415 Ops/s | 973.3739 Ops/s | |
test_func_call_cm_runtime[True-compile] | 0.8742ms | 0.7950ms | 1.2578 KOps/s | 1.2435 KOps/s | |
test_func_call_cm_runtime[True-compile-overhead] | 0.4587ms | 0.4120ms | 2.4269 KOps/s | 2.4133 KOps/s | |
test_vmap_func_call_cm_runtime[eager] | 2.6047ms | 2.1164ms | 472.5114 Ops/s | 469.0104 Ops/s | |
test_vmap_func_call_cm_runtime[compile] | 0.8880ms | 0.8099ms | 1.2348 KOps/s | 1.2351 KOps/s | |
test_vmap_func_call_cm_runtime[compile-overhead] | 0.4719ms | 0.4161ms | 2.4031 KOps/s | 2.3887 KOps/s | |
test_distributed | 11.4659ms | 0.2140ms | 4.6738 KOps/s | 8.4312 KOps/s | |
test_tdmodule | 52.7210μs | 19.6687μs | 50.8421 KOps/s | 46.9097 KOps/s | |
test_tdmodule_dispatch | 55.0110μs | 34.4916μs | 28.9926 KOps/s | 26.3862 KOps/s | |
test_tdseq | 39.9710μs | 19.9304μs | 50.1746 KOps/s | 46.3891 KOps/s | |
test_tdseq_dispatch | 57.0710μs | 36.7432μs | 27.2159 KOps/s | 24.8817 KOps/s | |
test_instantiation_functorch | 1.6538ms | 1.5887ms | 629.4291 Ops/s | 632.9240 Ops/s | |
test_exec_functorch | 0.1909ms | 0.1473ms | 6.7873 KOps/s | 6.8588 KOps/s | |
test_exec_functional_call | 0.1692ms | 0.1386ms | 7.2128 KOps/s | 7.2631 KOps/s | |
test_exec_td_decorator | 0.3941ms | 0.1880ms | 5.3189 KOps/s | 5.2810 KOps/s | |
test_vmap_mlp_speed_decorator[True-True] | 0.8019ms | 0.6931ms | 1.4428 KOps/s | 1.4224 KOps/s | |
test_vmap_mlp_speed_decorator[True-False] | 0.8192ms | 0.6953ms | 1.4382 KOps/s | 1.4353 KOps/s | |
test_vmap_mlp_speed_decorator[False-True] | 0.7491ms | 0.6171ms | 1.6205 KOps/s | 1.6329 KOps/s | |
test_vmap_mlp_speed_decorator[False-False] | 0.7310ms | 0.6065ms | 1.6487 KOps/s | 1.6462 KOps/s | |
test_vmap_transformer_speed_decorator[True-True] | 20.0681ms | 19.5369ms | 51.1853 Ops/s | 51.2133 Ops/s | |
test_vmap_transformer_speed_decorator[True-False] | 19.6527ms | 19.5592ms | 51.1269 Ops/s | 51.5123 Ops/s | |
test_vmap_transformer_speed_decorator[False-True] | 19.5581ms | 19.4260ms | 51.4773 Ops/s | 51.7797 Ops/s | |
test_vmap_transformer_speed_decorator[False-False] | 20.5728ms | 19.4054ms | 51.5319 Ops/s | 51.8730 Ops/s | |
test_to_module_speed[True] | 1.0863ms | 1.0027ms | 997.2638 Ops/s | 1.0212 KOps/s | |
test_to_module_speed[False] | 1.5572ms | 0.9816ms | 1.0188 KOps/s | 1.0364 KOps/s | |
test_tc_init | 66.0920μs | 34.2689μs | 29.1810 KOps/s | 26.3929 KOps/s | |
test_tc_init_nested | 0.1028ms | 69.0956μs | 14.4727 KOps/s | 13.1032 KOps/s | |
test_tc_first_layer_tensor | 29.8100μs | 0.8656μs | 1.1552 MOps/s | 1.1829 MOps/s | |
test_tc_first_layer_nontensor | 82.6820μs | 2.3287μs | 429.4226 KOps/s | 440.8304 KOps/s | |
test_tc_second_layer_tensor | 19.5380μs | 1.4784μs | 676.3931 KOps/s | 684.1954 KOps/s | |
test_tc_second_layer_nontensor | 30.7210μs | 3.0907μs | 323.5519 KOps/s | 327.6794 KOps/s | |
test_unbind | 0.2258s | 10.0812ms | 99.1944 Ops/s | 140.8735 Ops/s | |
test_full_like | 10.4500ms | 9.2584ms | 108.0104 Ops/s | 106.7937 Ops/s | |
test_zeros_like | 11.5894ms | 8.7757ms | 113.9510 Ops/s | 233.5947 Ops/s | |
test_ones_like | 9.1336ms | 7.1164ms | 140.5196 Ops/s | 236.4205 Ops/s | |
test_clone | 7.1458ms | 6.5454ms | 152.7788 Ops/s | 107.9270 Ops/s | |
test_squeeze | 59.3120μs | 9.7554μs | 102.5078 KOps/s | 102.0103 KOps/s | |
test_unsqueeze | 0.1225ms | 71.4169μs | 14.0023 KOps/s | 13.8372 KOps/s | |
test_split | 0.4119ms | 0.1593ms | 6.2763 KOps/s | 6.2334 KOps/s | |
test_permute | 0.2332ms | 0.1777ms | 5.6263 KOps/s | 5.7244 KOps/s | |
test_stack | 51.4888ms | 51.0898ms | 19.5734 Ops/s | 19.7533 Ops/s | |
test_cat | 51.6159ms | 51.0431ms | 19.5913 Ops/s | 19.7154 Ops/s |
vmoens
added a commit
that referenced
this pull request
Jan 10, 2025
ghstack-source-id: 8be779a7a85fdf45000181a9ea0f830822f19e37 Pull Request resolved: #1178
vmoens
added a commit
that referenced
this pull request
Jan 10, 2025
ghstack-source-id: 8be779a7a85fdf45000181a9ea0f830822f19e37 Pull Request resolved: #1178
Sign up for free
to join this conversation on GitHub.
Already have an account?
Sign in to comment
Labels
CLA Signed
This label is managed by the Facebook bot. Authors need to sign the CLA before a PR can be reviewed.
Refactor
Refactoring code - not a new feature
Add this suggestion to a batch that can be applied as a single commit.
This suggestion is invalid because no changes were made to the code.
Suggestions cannot be applied while the pull request is closed.
Suggestions cannot be applied while viewing a subset of changes.
Only one suggestion per line can be applied in a batch.
Add this suggestion to a batch that can be applied as a single commit.
Applying suggestions on deleted lines is not supported.
You must change the existing code in this line in order to create a valid suggestion.
Outdated suggestions cannot be applied.
This suggestion has been applied or marked resolved.
Suggestions cannot be applied from pending reviews.
Suggestions cannot be applied on multi-line comments.
Suggestions cannot be applied while the pull request is queued to merge.
Suggestion cannot be applied right now. Please check back later.
Stack from ghstack (oldest at bottom):