-
Notifications
You must be signed in to change notification settings - Fork 76
New issue
Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.
By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.
Already on GitHub? Sign in to your account
[Feature] TensorClass shadow attributes #1159
Merged
Merged
Conversation
This file contains bidirectional Unicode text that may be interpreted or compiled differently than what appears below. To review, open the file in an editor that reveals hidden Unicode characters.
Learn more about bidirectional Unicode characters
vmoens
added a commit
that referenced
this pull request
Jan 7, 2025
ghstack-source-id: 35c639c51e54d32f2f3b46ff945fd206c8bae90f Pull Request resolved: #1159
facebook-github-bot
added
the
CLA Signed
This label is managed by the Facebook bot. Authors need to sign the CLA before a PR can be reviewed.
label
Jan 7, 2025
|
Name | Max | Mean | Ops | Ops on Repo HEAD
|
Change |
---|---|---|---|---|---|
test_plain_set_nested | 43.9720μs | 21.9299μs | 45.5999 KOps/s | 49.3690 KOps/s | |
test_plain_set_stack_nested | 58.0480μs | 22.1378μs | 45.1717 KOps/s | 48.2464 KOps/s | |
test_plain_set_nested_inplace | 57.9880μs | 23.8550μs | 41.9199 KOps/s | 44.9955 KOps/s | |
test_plain_set_stack_nested_inplace | 67.3450μs | 23.7042μs | 42.1866 KOps/s | 44.0407 KOps/s | |
test_items | 43.3100μs | 4.1114μs | 243.2260 KOps/s | 240.0697 KOps/s | |
test_items_nested | 0.6120ms | 0.3993ms | 2.5046 KOps/s | 2.4799 KOps/s | |
test_items_nested_locked | 0.5153ms | 0.4036ms | 2.4777 KOps/s | 2.4645 KOps/s | |
test_items_nested_leaf | 0.1345ms | 76.5578μs | 13.0620 KOps/s | 12.8151 KOps/s | |
test_items_stack_nested | 0.5042ms | 0.4052ms | 2.4678 KOps/s | 2.4486 KOps/s | |
test_items_stack_nested_leaf | 0.1315ms | 78.3791μs | 12.7585 KOps/s | 12.7534 KOps/s | |
test_items_stack_nested_locked | 0.5464ms | 0.4073ms | 2.4552 KOps/s | 2.4564 KOps/s | |
test_keys | 40.3950μs | 3.5239μs | 283.7725 KOps/s | 240.2102 KOps/s | |
test_keys_nested | 0.2473ms | 0.1668ms | 5.9964 KOps/s | 6.0014 KOps/s | |
test_keys_nested_locked | 0.7880ms | 0.1752ms | 5.7089 KOps/s | 5.7909 KOps/s | |
test_keys_nested_leaf | 0.2290ms | 0.1458ms | 6.8572 KOps/s | 6.8952 KOps/s | |
test_keys_stack_nested | 0.3114ms | 0.1682ms | 5.9438 KOps/s | 6.0247 KOps/s | |
test_keys_stack_nested_leaf | 0.3827ms | 0.1460ms | 6.8501 KOps/s | 6.8751 KOps/s | |
test_keys_stack_nested_locked | 0.2706ms | 0.1724ms | 5.8007 KOps/s | 5.8626 KOps/s | |
test_values | 6.9230μs | 1.0553μs | 947.5992 KOps/s | 912.2377 KOps/s | |
test_values_nested | 0.1130ms | 63.1759μs | 15.8288 KOps/s | 15.9002 KOps/s | |
test_values_nested_locked | 0.1090ms | 63.1148μs | 15.8441 KOps/s | 15.8255 KOps/s | |
test_values_nested_leaf | 0.3279ms | 72.2142μs | 13.8477 KOps/s | 13.7716 KOps/s | |
test_values_stack_nested | 0.1207ms | 63.4064μs | 15.7713 KOps/s | 15.7883 KOps/s | |
test_values_stack_nested_leaf | 0.1391ms | 71.5287μs | 13.9804 KOps/s | 13.8353 KOps/s | |
test_values_stack_nested_locked | 0.1204ms | 63.7043μs | 15.6975 KOps/s | 15.8658 KOps/s | |
test_membership | 4.1227μs | 0.7078μs | 1.4129 MOps/s | 1.1263 MOps/s | |
test_membership_nested | 24.0550μs | 2.9739μs | 336.2592 KOps/s | 337.7055 KOps/s | |
test_membership_nested_leaf | 34.7250μs | 3.0203μs | 331.0932 KOps/s | 338.5850 KOps/s | |
test_membership_stacked_nested | 27.3700μs | 3.0161μs | 331.5503 KOps/s | 340.9667 KOps/s | |
test_membership_stacked_nested_leaf | 31.2880μs | 3.0404μs | 328.9054 KOps/s | 343.8431 KOps/s | |
test_membership_nested_last | 22.0700μs | 4.4408μs | 225.1830 KOps/s | 229.7333 KOps/s | |
test_membership_nested_leaf_last | 38.1410μs | 4.4823μs | 223.0982 KOps/s | 226.3552 KOps/s | |
test_membership_stacked_nested_last | 27.6420μs | 4.4159μs | 226.4524 KOps/s | 228.4539 KOps/s | |
test_membership_stacked_nested_leaf_last | 35.2160μs | 4.5246μs | 221.0117 KOps/s | 232.5016 KOps/s | |
test_nested_getleaf | 94.0040μs | 11.7781μs | 84.9034 KOps/s | 89.9518 KOps/s | |
test_nested_get | 51.2160μs | 10.7066μs | 93.4006 KOps/s | 90.3789 KOps/s | |
test_stacked_getleaf | 36.5580μs | 11.3259μs | 88.2934 KOps/s | 87.8320 KOps/s | |
test_stacked_get | 32.5510μs | 10.6371μs | 94.0107 KOps/s | 92.5831 KOps/s | |
test_nested_getitemleaf | 49.9750μs | 11.6498μs | 85.8386 KOps/s | 84.7693 KOps/s | |
test_nested_getitem | 39.1620μs | 10.8789μs | 91.9213 KOps/s | 89.2878 KOps/s | |
test_stacked_getitemleaf | 51.8170μs | 11.4473μs | 87.3567 KOps/s | 83.4812 KOps/s | |
test_stacked_getitem | 68.1470μs | 10.7382μs | 93.1255 KOps/s | 87.6479 KOps/s | |
test_lock_nested | 6.6502ms | 0.4665ms | 2.1437 KOps/s | 2.1399 KOps/s | |
test_lock_stack_nested | 0.6004ms | 0.4306ms | 2.3222 KOps/s | 2.2816 KOps/s | |
test_unlock_nested | 0.8123ms | 0.3813ms | 2.6227 KOps/s | 2.5975 KOps/s | |
test_unlock_stack_nested | 0.4375ms | 0.3474ms | 2.8786 KOps/s | 2.8090 KOps/s | |
test_flatten_speed | 0.1964ms | 99.3767μs | 10.0627 KOps/s | 9.8590 KOps/s | |
test_unflatten_speed | 0.6483ms | 0.5396ms | 1.8532 KOps/s | 1.8377 KOps/s | |
test_common_ops | 4.0722ms | 0.8492ms | 1.1776 KOps/s | 1.2874 KOps/s | |
test_creation | 40.2060μs | 2.6301μs | 380.2112 KOps/s | 407.8265 KOps/s | |
test_creation_empty | 42.3290μs | 13.5534μs | 73.7824 KOps/s | 98.4425 KOps/s | |
test_creation_nested_1 | 85.0670μs | 16.5307μs | 60.4934 KOps/s | 76.8549 KOps/s | |
test_creation_nested_2 | 54.5710μs | 21.4832μs | 46.5480 KOps/s | 53.6592 KOps/s | |
test_clone | 0.1891ms | 13.8265μs | 72.3247 KOps/s | 71.7401 KOps/s | |
test_getitem[int] | 1.1871ms | 12.9060μs | 77.4830 KOps/s | 75.6432 KOps/s | |
test_getitem[slice_int] | 0.1426ms | 25.9748μs | 38.4989 KOps/s | 40.0300 KOps/s | |
test_getitem[range] | 0.2335ms | 50.4841μs | 19.8082 KOps/s | 20.3836 KOps/s | |
test_getitem[tuple] | 0.1425ms | 20.9004μs | 47.8461 KOps/s | 48.0733 KOps/s | |
test_getitem[list] | 0.3067ms | 45.7883μs | 21.8397 KOps/s | 22.1724 KOps/s | |
test_setitem_dim[int] | 58.6690μs | 26.3811μs | 37.9059 KOps/s | 37.7025 KOps/s | |
test_setitem_dim[slice_int] | 95.5480μs | 55.3376μs | 18.0709 KOps/s | 18.6833 KOps/s | |
test_setitem_dim[range] | 0.1241ms | 76.3209μs | 13.1026 KOps/s | 13.4831 KOps/s | |
test_setitem_dim[tuple] | 85.2690μs | 42.8344μs | 23.3457 KOps/s | 23.5635 KOps/s | |
test_setitem | 0.1633ms | 22.3387μs | 44.7655 KOps/s | 49.4336 KOps/s | |
test_set | 0.2205ms | 21.7115μs | 46.0585 KOps/s | 50.8096 KOps/s | |
test_set_shared | 7.6501ms | 0.1744ms | 5.7327 KOps/s | 5.5061 KOps/s | |
test_update | 0.2270ms | 25.2811μs | 39.5553 KOps/s | 46.0173 KOps/s | |
test_update_nested | 0.2412ms | 36.1135μs | 27.6905 KOps/s | 31.0031 KOps/s | |
test_update__nested | 0.4864ms | 34.7464μs | 28.7799 KOps/s | 28.6032 KOps/s | |
test_set_nested | 0.1514ms | 23.4187μs | 42.7008 KOps/s | 44.6418 KOps/s | |
test_set_nested_new | 0.1907ms | 28.2909μs | 35.3471 KOps/s | 36.8721 KOps/s | |
test_select | 0.2310ms | 44.8696μs | 22.2868 KOps/s | 22.8394 KOps/s | |
test_select_nested | 0.1236ms | 63.8244μs | 15.6680 KOps/s | 15.7344 KOps/s | |
test_exclude_nested | 0.1735ms | 83.3171μs | 12.0023 KOps/s | 11.9720 KOps/s | |
test_empty[True] | 0.6228ms | 0.4141ms | 2.4147 KOps/s | 2.3805 KOps/s | |
test_empty[False] | 12.5112μs | 1.3861μs | 721.4646 KOps/s | 715.8035 KOps/s | |
test_unbind_speed | 0.3848ms | 0.2707ms | 3.6939 KOps/s | 3.5663 KOps/s | |
test_unbind_speed_stack0 | 0.5266ms | 0.2702ms | 3.7006 KOps/s | 3.6045 KOps/s | |
test_unbind_speed_stack1 | 0.1071s | 0.8090ms | 1.2361 KOps/s | 1.2898 KOps/s | |
test_split | 0.1011s | 1.7658ms | 566.3063 Ops/s | 550.7131 Ops/s | |
test_chunk | 2.0270ms | 1.6080ms | 621.8901 Ops/s | 549.4227 Ops/s | |
test_consolidate_njt[False-None] | 8.7008ms | 8.3793ms | 119.3418 Ops/s | 111.7626 Ops/s | |
test_creation[device0] | 0.2552ms | 91.7216μs | 10.9026 KOps/s | 10.2050 KOps/s | |
test_creation_from_tensor | 3.6100ms | 94.4417μs | 10.5885 KOps/s | 10.1254 KOps/s | |
test_add_one[memmap_tensor0] | 0.1215ms | 4.9112μs | 203.6152 KOps/s | 192.3018 KOps/s | |
test_contiguous[memmap_tensor0] | 35.4260μs | 0.5190μs | 1.9268 MOps/s | 1.9338 MOps/s | |
test_stack[memmap_tensor0] | 38.5720μs | 3.3319μs | 300.1294 KOps/s | 276.9458 KOps/s | |
test_memmaptd_index | 1.0956ms | 0.2377ms | 4.2074 KOps/s | 3.9433 KOps/s | |
test_memmaptd_index_astensor | 0.5761ms | 0.3245ms | 3.0814 KOps/s | 2.9365 KOps/s | |
test_memmaptd_index_op | 0.9878ms | 0.6159ms | 1.6238 KOps/s | 1.6662 KOps/s | |
test_serialize_model | 0.1235s | 0.1168s | 8.5590 Ops/s | 8.1577 Ops/s | |
test_serialize_model_pickle | 0.4314s | 0.3859s | 2.5911 Ops/s | 2.5518 Ops/s | |
test_serialize_weights | 0.1209s | 0.1146s | 8.7274 Ops/s | 8.2490 Ops/s | |
test_serialize_weights_returnearly | 0.2293s | 0.1606s | 6.2254 Ops/s | 6.3764 Ops/s | |
test_serialize_weights_pickle | 0.4449s | 0.3917s | 2.5532 Ops/s | 1.1145 Ops/s | |
test_serialize_weights_filesystem | 0.1468s | 0.1405s | 7.1182 Ops/s | 7.0031 Ops/s | |
test_serialize_model_filesystem | 0.1640s | 0.1533s | 6.5217 Ops/s | 6.7519 Ops/s | |
test_reshape_pytree | 58.1790μs | 26.6803μs | 37.4809 KOps/s | 36.7756 KOps/s | |
test_reshape_td | 71.3630μs | 33.5278μs | 29.8260 KOps/s | 28.8227 KOps/s | |
test_view_pytree | 66.9650μs | 26.5796μs | 37.6228 KOps/s | 36.4151 KOps/s | |
test_view_td | 0.1030ms | 39.3898μs | 25.3873 KOps/s | 25.1612 KOps/s | |
test_unbind_pytree | 91.2980μs | 29.8780μs | 33.4694 KOps/s | 33.3299 KOps/s | |
test_unbind_td | 0.3799ms | 40.4768μs | 24.7055 KOps/s | 24.3947 KOps/s | |
test_split_pytree | 69.0490μs | 29.6701μs | 33.7040 KOps/s | 33.4575 KOps/s | |
test_split_td | 0.1019s | 56.1230μs | 17.8180 KOps/s | 21.9604 KOps/s | |
test_add_pytree | 92.9530μs | 34.8590μs | 28.6870 KOps/s | 27.0813 KOps/s | |
test_add_td | 0.1375ms | 60.6700μs | 16.4826 KOps/s | 16.8142 KOps/s | |
test_compile_add_one_nested[tensordict-compile] | 0.1698ms | 62.3376μs | 16.0417 KOps/s | 16.1622 KOps/s | |
test_compile_add_one_nested[tensordict-eager] | 0.3715ms | 0.1691ms | 5.9133 KOps/s | 5.7120 KOps/s | |
test_compile_add_one_nested[pytree-compile] | 0.1827ms | 46.6799μs | 21.4225 KOps/s | 21.6537 KOps/s | |
test_compile_add_one_nested[pytree-eager] | 0.2780ms | 0.1180ms | 8.4728 KOps/s | 8.0541 KOps/s | |
test_compile_copy_nested[tensordict-compile] | 82.3440μs | 26.8408μs | 37.2567 KOps/s | 38.2357 KOps/s | |
test_compile_copy_nested[tensordict-eager] | 0.1324ms | 59.8763μs | 16.7011 KOps/s | 16.8068 KOps/s | |
test_compile_copy_nested[pytree-compile] | 0.2077ms | 76.1988μs | 13.1236 KOps/s | 12.6294 KOps/s | |
test_compile_copy_nested[pytree-eager] | 0.1191ms | 66.0287μs | 15.1449 KOps/s | 14.6363 KOps/s | |
test_compile_add_one_flat[tensordict-compile] | 0.1871ms | 0.1057ms | 9.4590 KOps/s | 9.1924 KOps/s | |
test_compile_add_one_flat[tensordict-eager] | 0.3896ms | 0.2162ms | 4.6261 KOps/s | 4.5150 KOps/s | |
test_compile_add_one_flat[tensorclass-compile] | 97.2710μs | 46.4210μs | 21.5420 KOps/s | 22.1544 KOps/s | |
test_compile_add_one_flat[tensorclass-eager] | 0.5149ms | 66.0571μs | 15.1384 KOps/s | 14.8742 KOps/s | |
test_compile_add_one_flat[pytree-compile] | 0.2218ms | 0.1053ms | 9.5007 KOps/s | 9.6054 KOps/s | |
test_compile_add_one_flat[pytree-eager] | 0.4118ms | 0.1992ms | 5.0207 KOps/s | 4.8559 KOps/s | |
test_compile_add_self_flat[tensordict-eager] | 0.4060ms | 0.2312ms | 4.3256 KOps/s | 4.2544 KOps/s | |
test_compile_add_self_flat[tensordict-compile] | 0.1914ms | 0.1063ms | 9.4056 KOps/s | 9.4905 KOps/s | |
test_compile_add_self_flat[tensorclass-eager] | 0.1121ms | 59.3149μs | 16.8592 KOps/s | 16.8625 KOps/s | |
test_compile_add_self_flat[tensorclass-compile] | 1.0540ms | 50.9606μs | 19.6230 KOps/s | 20.6191 KOps/s | |
test_compile_add_self_flat[pytree-eager] | 1.4488ms | 0.1596ms | 6.2654 KOps/s | 6.1495 KOps/s | |
test_compile_add_self_flat[pytree-compile] | 0.1833ms | 0.1068ms | 9.3621 KOps/s | 9.7447 KOps/s | |
test_compile_copy_flat[tensordict-compile] | 69.6090μs | 22.3839μs | 44.6749 KOps/s | 48.3124 KOps/s | |
test_compile_copy_flat[tensordict-eager] | 0.1358ms | 66.7821μs | 14.9741 KOps/s | 15.0574 KOps/s | |
test_compile_copy_flat[pytree-compile] | 0.1948ms | 82.0527μs | 12.1873 KOps/s | 12.1207 KOps/s | |
test_compile_copy_flat[pytree-eager] | 0.1562ms | 67.7315μs | 14.7642 KOps/s | 14.3415 KOps/s | |
test_compile_assign_and_add[tensordict-compile] | 0.4480ms | 0.2101ms | 4.7599 KOps/s | 4.8757 KOps/s | |
test_compile_assign_and_add[tensordict-eager] | 2.5609ms | 1.3223ms | 756.2832 Ops/s | 729.4857 Ops/s | |
test_compile_assign_and_add[pytree-compile] | 0.3239ms | 0.2054ms | 4.8686 KOps/s | 4.9199 KOps/s | |
test_compile_assign_and_add[pytree-eager] | 1.3404ms | 0.7739ms | 1.2921 KOps/s | 1.2654 KOps/s | |
test_compile_assign_and_add_stack[compile] | 0.7593ms | 0.4614ms | 2.1672 KOps/s | 2.1899 KOps/s | |
test_compile_assign_and_add_stack[eager] | 3.7185ms | 2.7952ms | 357.7555 Ops/s | 370.1376 Ops/s | |
test_compile_indexing[tensor-tensordict-compile] | 83.5960μs | 36.1179μs | 27.6871 KOps/s | 26.6355 KOps/s | |
test_compile_indexing[tensor-tensordict-eager] | 0.5236ms | 34.4011μs | 29.0689 KOps/s | 28.2914 KOps/s | |
test_compile_indexing[tensor-tensorclass-compile] | 72.2850μs | 30.4669μs | 32.8225 KOps/s | 32.7469 KOps/s | |
test_compile_indexing[tensor-tensorclass-eager] | 63.8990μs | 22.6399μs | 44.1698 KOps/s | 41.9382 KOps/s | |
test_compile_indexing[tensor-pytree-compile] | 88.0540μs | 31.0127μs | 32.2448 KOps/s | 32.5060 KOps/s | |
test_compile_indexing[tensor-pytree-eager] | 60.9030μs | 22.6771μs | 44.0973 KOps/s | 42.2727 KOps/s | |
test_compile_indexing[slice-tensordict-compile] | 0.1272ms | 53.2183μs | 18.7905 KOps/s | 19.1053 KOps/s | |
test_compile_indexing[slice-tensordict-eager] | 0.5645ms | 20.4898μs | 48.8047 KOps/s | 48.3233 KOps/s | |
test_compile_indexing[slice-tensorclass-compile] | 94.2460μs | 45.9016μs | 21.7858 KOps/s | 21.9174 KOps/s | |
test_compile_indexing[slice-tensorclass-eager] | 0.1276ms | 19.1653μs | 52.1776 KOps/s | 52.6599 KOps/s | |
test_compile_indexing[slice-pytree-compile] | 0.1343ms | 46.1484μs | 21.6692 KOps/s | 21.3444 KOps/s | |
test_compile_indexing[slice-pytree-eager] | 53.4690μs | 18.7600μs | 53.3048 KOps/s | 52.3044 KOps/s | |
test_compile_indexing[int-tensordict-compile] | 0.1195ms | 53.5973μs | 18.6577 KOps/s | 18.7785 KOps/s | |
test_compile_indexing[int-tensordict-eager] | 1.0120ms | 20.8335μs | 47.9996 KOps/s | 49.7298 KOps/s | |
test_compile_indexing[int-tensorclass-compile] | 0.2141ms | 46.6166μs | 21.4516 KOps/s | 21.6425 KOps/s | |
test_compile_indexing[int-tensorclass-eager] | 88.0260μs | 18.5927μs | 53.7845 KOps/s | 52.7883 KOps/s | |
test_compile_indexing[int-pytree-compile] | 91.1400μs | 46.3582μs | 21.5711 KOps/s | 21.3856 KOps/s | |
test_compile_indexing[int-pytree-eager] | 60.7930μs | 18.3923μs | 54.3707 KOps/s | 52.4576 KOps/s | |
test_mod_add[eager] | 77.6040μs | 35.6828μs | 28.0247 KOps/s | 28.6516 KOps/s | |
test_mod_add[compile] | 0.1034ms | 48.6722μs | 20.5456 KOps/s | 20.6248 KOps/s | |
test_mod_add[compile-overhead] | 0.1339ms | 49.2532μs | 20.3033 KOps/s | 20.8520 KOps/s | |
test_mod_wrap[eager] | 0.4957ms | 0.2357ms | 4.2434 KOps/s | 4.3174 KOps/s | |
test_mod_wrap[compile] | 0.2936ms | 0.2095ms | 4.7732 KOps/s | 4.7578 KOps/s | |
test_mod_wrap[compile-overhead] | 0.3454ms | 0.2091ms | 4.7834 KOps/s | 4.8137 KOps/s | |
test_mod_wrap_and_backward[eager] | 13.6358ms | 11.8548ms | 84.3540 Ops/s | 81.4667 Ops/s | |
test_mod_wrap_and_backward[compile] | 16.2655ms | 12.2480ms | 81.6457 Ops/s | 74.4031 Ops/s | |
test_mod_wrap_and_backward[compile-overhead] | 19.0671ms | 13.1710ms | 75.9242 Ops/s | 73.9699 Ops/s | |
test_seq_add[eager] | 0.2028ms | 0.1206ms | 8.2929 KOps/s | 8.5734 KOps/s | |
test_seq_add[compile] | 0.1193ms | 63.4485μs | 15.7608 KOps/s | 16.0333 KOps/s | |
test_seq_add[compile-overhead] | 0.1126ms | 60.9203μs | 16.4149 KOps/s | 16.0621 KOps/s | |
test_seq_wrap[eager] | 1.0709ms | 0.4630ms | 2.1599 KOps/s | 2.1749 KOps/s | |
test_seq_wrap[compile] | 0.4414ms | 0.2346ms | 4.2623 KOps/s | 4.2778 KOps/s | |
test_seq_wrap[compile-overhead] | 0.4402ms | 0.2320ms | 4.3108 KOps/s | 4.2507 KOps/s | |
test_func_call_runtime[False-eager] | 0.8517ms | 0.5596ms | 1.7870 KOps/s | 1.7974 KOps/s | |
test_func_call_runtime[False-compile] | 0.5716ms | 0.4313ms | 2.3185 KOps/s | 2.2846 KOps/s | |
test_func_call_runtime[False-compile-overhead] | 0.5655ms | 0.4299ms | 2.3261 KOps/s | 2.2801 KOps/s | |
test_func_call_runtime[True-eager] | 1.2811ms | 0.7681ms | 1.3020 KOps/s | 1.2603 KOps/s | |
test_func_call_runtime[True-compile] | 0.7996ms | 0.4688ms | 2.1333 KOps/s | 2.1088 KOps/s | |
test_func_call_runtime[True-compile-overhead] | 0.8713ms | 0.4732ms | 2.1131 KOps/s | 2.1095 KOps/s | |
test_func_call_cm_runtime[False-eager] | 0.9603ms | 0.5550ms | 1.8018 KOps/s | 1.7993 KOps/s | |
test_func_call_cm_runtime[False-compile] | 0.6376ms | 0.4324ms | 2.3128 KOps/s | 2.3202 KOps/s | |
test_func_call_cm_runtime[False-compile-overhead] | 0.5241ms | 0.4301ms | 2.3249 KOps/s | 2.3055 KOps/s | |
test_func_call_cm_runtime[True-eager] | 1.2373ms | 0.9149ms | 1.0930 KOps/s | 1.0664 KOps/s | |
test_func_call_cm_runtime[True-compile] | 0.6053ms | 0.4940ms | 2.0244 KOps/s | 2.0038 KOps/s | |
test_func_call_cm_runtime[True-compile-overhead] | 0.5901ms | 0.4932ms | 2.0275 KOps/s | 1.9888 KOps/s | |
test_vmap_func_call_cm_runtime[eager] | 2.6593ms | 1.9500ms | 512.8264 Ops/s | 497.2357 Ops/s | |
test_vmap_func_call_cm_runtime[compile] | 0.9490ms | 0.5215ms | 1.9175 KOps/s | 1.9251 KOps/s | |
test_vmap_func_call_cm_runtime[compile-overhead] | 0.8227ms | 0.5235ms | 1.9103 KOps/s | 1.8855 KOps/s | |
test_distributed | 0.2784ms | 0.1267ms | 7.8900 KOps/s | 7.6709 KOps/s | |
test_tdmodule | 51.8370μs | 28.2164μs | 35.4404 KOps/s | 38.6273 KOps/s | |
test_tdmodule_dispatch | 79.0070μs | 52.2052μs | 19.1552 KOps/s | 20.9480 KOps/s | |
test_tdseq | 53.0790μs | 31.7706μs | 31.4757 KOps/s | 34.2315 KOps/s | |
test_tdseq_dispatch | 89.1860μs | 58.7498μs | 17.0213 KOps/s | 18.3228 KOps/s | |
test_instantiation_functorch | 1.7237ms | 1.5463ms | 646.6941 Ops/s | 636.3368 Ops/s | |
test_exec_functorch | 0.3324ms | 0.1837ms | 5.4443 KOps/s | 5.3507 KOps/s | |
test_exec_functional_call | 0.4529ms | 0.1791ms | 5.5843 KOps/s | 5.5686 KOps/s | |
test_exec_td_decorator | 0.4733ms | 0.2388ms | 4.1873 KOps/s | 4.1415 KOps/s | |
test_vmap_mlp_speed_decorator[True-True] | 1.1725ms | 0.6818ms | 1.4667 KOps/s | 1.4167 KOps/s | |
test_vmap_mlp_speed_decorator[True-False] | 1.0122ms | 0.6794ms | 1.4718 KOps/s | 1.4630 KOps/s | |
test_vmap_mlp_speed_decorator[False-True] | 0.8638ms | 0.5560ms | 1.7984 KOps/s | 1.8321 KOps/s | |
test_vmap_mlp_speed_decorator[False-False] | 0.9280ms | 0.5424ms | 1.8437 KOps/s | 1.8274 KOps/s | |
test_to_module_speed[True] | 2.0458ms | 1.3541ms | 738.5250 Ops/s | 735.9055 Ops/s | |
test_to_module_speed[False] | 1.4390ms | 1.3294ms | 752.2328 Ops/s | 742.0862 Ops/s | |
test_tc_init | 91.4710μs | 51.1410μs | 19.5538 KOps/s | 22.0972 KOps/s | |
test_tc_init_nested | 0.1711ms | 0.1034ms | 9.6709 KOps/s | 10.9201 KOps/s | |
test_tc_first_layer_tensor | 16.9920μs | 1.5591μs | 641.4066 KOps/s | 649.4535 KOps/s | |
test_tc_first_layer_nontensor | 26.6900μs | 4.7088μs | 212.3686 KOps/s | 212.0873 KOps/s | |
test_tc_second_layer_tensor | 48.0400μs | 2.8077μs | 356.1585 KOps/s | 360.6884 KOps/s | |
test_tc_second_layer_nontensor | 52.2200μs | 5.9582μs | 167.8350 KOps/s | 165.8096 KOps/s | |
test_unbind | 0.2276s | 15.5323ms | 64.3820 Ops/s | 75.0398 Ops/s | |
test_full_like | 8.8898ms | 7.8011ms | 128.1868 Ops/s | 117.1174 Ops/s | |
test_zeros_like | 3.6547ms | 3.1308ms | 319.4087 Ops/s | 327.5350 Ops/s | |
test_ones_like | 3.8900ms | 3.4247ms | 291.9962 Ops/s | 298.0450 Ops/s | |
test_clone | 6.4289ms | 5.6522ms | 176.9231 Ops/s | 175.3990 Ops/s | |
test_squeeze | 62.9080μs | 12.8086μs | 78.0726 KOps/s | 81.3826 KOps/s | |
test_unsqueeze | 0.3212ms | 95.7965μs | 10.4388 KOps/s | 10.4541 KOps/s | |
test_split | 0.3726ms | 0.2049ms | 4.8793 KOps/s | 4.9086 KOps/s | |
test_permute | 0.4318ms | 0.2184ms | 4.5790 KOps/s | 4.8013 KOps/s | |
test_stack | 34.4658ms | 27.4914ms | 36.3750 Ops/s | 38.6498 Ops/s | |
test_cat | 35.5788ms | 28.0480ms | 35.6532 Ops/s | 37.2192 Ops/s |
|
Name | Max | Mean | Ops | Ops on Repo HEAD
|
Change |
---|---|---|---|---|---|
test_plain_set_nested | 40.5120μs | 11.2027μs | 89.2641 KOps/s | 77.7410 KOps/s | |
test_plain_set_stack_nested | 39.4320μs | 11.2597μs | 88.8125 KOps/s | 75.9059 KOps/s | |
test_plain_set_nested_inplace | 47.6330μs | 12.3599μs | 80.9065 KOps/s | 70.1271 KOps/s | |
test_plain_set_stack_nested_inplace | 1.0239ms | 12.2560μs | 81.5926 KOps/s | 70.9381 KOps/s | |
test_items | 33.7910μs | 2.8819μs | 346.9972 KOps/s | 341.3361 KOps/s | |
test_items_nested | 0.4068ms | 0.3542ms | 2.8236 KOps/s | 2.8065 KOps/s | |
test_items_nested_locked | 0.4364ms | 0.3547ms | 2.8190 KOps/s | 2.7991 KOps/s | |
test_items_nested_leaf | 0.1058ms | 58.0938μs | 17.2135 KOps/s | 17.0314 KOps/s | |
test_items_stack_nested | 0.3992ms | 0.3587ms | 2.7882 KOps/s | 2.7981 KOps/s | |
test_items_stack_nested_leaf | 85.1140μs | 58.2314μs | 17.1729 KOps/s | 16.6323 KOps/s | |
test_items_stack_nested_locked | 0.4123ms | 0.3586ms | 2.7887 KOps/s | 2.7845 KOps/s | |
test_keys | 32.2310μs | 3.4789μs | 287.4461 KOps/s | 284.2802 KOps/s | |
test_keys_nested | 0.1307ms | 80.7455μs | 12.3846 KOps/s | 12.2332 KOps/s | |
test_keys_nested_locked | 0.8279ms | 87.0605μs | 11.4863 KOps/s | 11.4548 KOps/s | |
test_keys_nested_leaf | 0.1082ms | 71.7156μs | 13.9440 KOps/s | 13.8131 KOps/s | |
test_keys_stack_nested | 0.1445ms | 80.8378μs | 12.3705 KOps/s | 12.0156 KOps/s | |
test_keys_stack_nested_leaf | 0.1040ms | 71.7904μs | 13.9294 KOps/s | 13.4244 KOps/s | |
test_keys_stack_nested_locked | 0.1445ms | 87.4984μs | 11.4288 KOps/s | 11.2162 KOps/s | |
test_values | 6.6887μs | 0.8504μs | 1.1759 MOps/s | 1.1598 MOps/s | |
test_values_nested | 92.6050μs | 33.8097μs | 29.5773 KOps/s | 29.1250 KOps/s | |
test_values_nested_locked | 70.3240μs | 35.7437μs | 27.9769 KOps/s | 27.4592 KOps/s | |
test_values_nested_leaf | 65.4230μs | 38.4026μs | 26.0399 KOps/s | 25.6016 KOps/s | |
test_values_stack_nested | 70.6340μs | 33.9603μs | 29.4461 KOps/s | 28.7431 KOps/s | |
test_values_stack_nested_leaf | 71.2240μs | 38.9403μs | 25.6803 KOps/s | 25.3495 KOps/s | |
test_values_stack_nested_locked | 0.1013ms | 36.0294μs | 27.7551 KOps/s | 27.1054 KOps/s | |
test_membership | 1.9351μs | 0.5317μs | 1.8808 MOps/s | 1.8708 MOps/s | |
test_membership_nested | 16.8055μs | 2.0366μs | 491.0152 KOps/s | 490.4527 KOps/s | |
test_membership_nested_leaf | 21.4410μs | 2.0497μs | 487.8792 KOps/s | 481.4325 KOps/s | |
test_membership_stacked_nested | 39.3220μs | 2.1404μs | 467.2060 KOps/s | 477.3158 KOps/s | |
test_membership_stacked_nested_leaf | 30.0920μs | 2.0983μs | 476.5761 KOps/s | 473.8606 KOps/s | |
test_membership_nested_last | 24.1420μs | 3.1566μs | 316.7921 KOps/s | 312.6731 KOps/s | |
test_membership_nested_leaf_last | 54.2330μs | 3.1402μs | 318.4462 KOps/s | 312.8868 KOps/s | |
test_membership_stacked_nested_last | 27.0010μs | 3.1526μs | 317.1950 KOps/s | 251.9898 KOps/s | |
test_membership_stacked_nested_leaf_last | 62.5430μs | 3.1274μs | 319.7568 KOps/s | 254.6853 KOps/s | |
test_nested_getleaf | 30.7120μs | 6.0928μs | 164.1289 KOps/s | 161.4943 KOps/s | |
test_nested_get | 50.5330μs | 5.7967μs | 172.5118 KOps/s | 169.3085 KOps/s | |
test_stacked_getleaf | 31.4110μs | 6.1114μs | 163.6276 KOps/s | 163.1401 KOps/s | |
test_stacked_get | 46.0830μs | 5.8126μs | 172.0387 KOps/s | 171.3833 KOps/s | |
test_nested_getitemleaf | 70.9940μs | 6.2647μs | 159.6254 KOps/s | 157.8944 KOps/s | |
test_nested_getitem | 35.8520μs | 6.0073μs | 166.4654 KOps/s | 165.9028 KOps/s | |
test_stacked_getitemleaf | 32.9310μs | 6.3299μs | 157.9809 KOps/s | 159.9671 KOps/s | |
test_stacked_getitem | 42.5320μs | 5.9018μs | 169.4398 KOps/s | 168.2967 KOps/s | |
test_lock_nested | 0.7537ms | 0.3722ms | 2.6865 KOps/s | 2.6599 KOps/s | |
test_lock_stack_nested | 0.4139ms | 0.3450ms | 2.8981 KOps/s | 2.8685 KOps/s | |
test_unlock_nested | 0.6110ms | 0.3110ms | 3.2153 KOps/s | 3.1862 KOps/s | |
test_unlock_stack_nested | 0.3432ms | 0.2810ms | 3.5592 KOps/s | 3.5387 KOps/s | |
test_flatten_speed | 0.1537ms | 73.8275μs | 13.5451 KOps/s | 13.4422 KOps/s | |
test_unflatten_speed | 0.3847ms | 0.3225ms | 3.1005 KOps/s | 3.1179 KOps/s | |
test_common_ops | 1.5257ms | 0.5796ms | 1.7253 KOps/s | 1.5781 KOps/s | |
test_creation | 0.1079ms | 1.7456μs | 572.8654 KOps/s | 561.1653 KOps/s | |
test_creation_empty | 34.7220μs | 6.5725μs | 152.1495 KOps/s | 102.9990 KOps/s | |
test_creation_nested_1 | 41.3820μs | 8.1914μs | 122.0789 KOps/s | 88.6700 KOps/s | |
test_creation_nested_2 | 60.7430μs | 11.0720μs | 90.3180 KOps/s | 71.3953 KOps/s | |
test_clone | 93.5850μs | 10.5048μs | 95.1945 KOps/s | 93.7954 KOps/s | |
test_getitem[int] | 1.7992ms | 10.5183μs | 95.0727 KOps/s | 92.6676 KOps/s | |
test_getitem[slice_int] | 0.1074ms | 20.5758μs | 48.6009 KOps/s | 47.7402 KOps/s | |
test_getitem[range] | 0.1260ms | 37.4303μs | 26.7163 KOps/s | 26.9541 KOps/s | |
test_getitem[tuple] | 0.1053ms | 18.1989μs | 54.9485 KOps/s | 54.0758 KOps/s | |
test_getitem[list] | 0.2498ms | 33.1360μs | 30.1787 KOps/s | 30.1104 KOps/s | |
test_setitem_dim[int] | 28.3020μs | 17.7766μs | 56.2536 KOps/s | 53.6008 KOps/s | |
test_setitem_dim[slice_int] | 66.0340μs | 37.5323μs | 26.6437 KOps/s | 26.1791 KOps/s | |
test_setitem_dim[range] | 84.1140μs | 52.0460μs | 19.2138 KOps/s | 19.3618 KOps/s | |
test_setitem_dim[tuple] | 52.5930μs | 32.6042μs | 30.6709 KOps/s | 31.5994 KOps/s | |
test_setitem | 87.8340μs | 14.0790μs | 71.0277 KOps/s | 62.8219 KOps/s | |
test_set | 90.9140μs | 13.5216μs | 73.9556 KOps/s | 64.6680 KOps/s | |
test_set_shared | 1.7270ms | 0.1539ms | 6.4962 KOps/s | 6.5426 KOps/s | |
test_update | 0.4705ms | 15.3644μs | 65.0854 KOps/s | 51.8033 KOps/s | |
test_update_nested | 90.1440μs | 21.0751μs | 47.4494 KOps/s | 40.0545 KOps/s | |
test_update__nested | 0.5097ms | 25.7751μs | 38.7971 KOps/s | 38.9322 KOps/s | |
test_set_nested | 80.3540μs | 14.9527μs | 66.8776 KOps/s | 59.5495 KOps/s | |
test_set_nested_new | 78.7840μs | 17.0494μs | 58.6532 KOps/s | 52.4034 KOps/s | |
test_select | 0.2184ms | 31.2005μs | 32.0508 KOps/s | 31.9337 KOps/s | |
test_select_nested | 72.5240μs | 43.9737μs | 22.7408 KOps/s | 22.4113 KOps/s | |
test_exclude_nested | 0.1368ms | 63.4949μs | 15.7493 KOps/s | 15.6555 KOps/s | |
test_empty[True] | 0.3490ms | 0.2870ms | 3.4838 KOps/s | 3.4810 KOps/s | |
test_empty[False] | 3.4422μs | 0.8657μs | 1.1552 MOps/s | 1.1508 MOps/s | |
test_to | 86.7040μs | 58.4959μs | 17.0952 KOps/s | 18.0333 KOps/s | |
test_to_nonblocking | 92.2750μs | 48.0903μs | 20.7942 KOps/s | 21.3273 KOps/s | |
test_unbind_speed | 0.2900ms | 0.2363ms | 4.2322 KOps/s | 4.2943 KOps/s | |
test_unbind_speed_stack0 | 0.3740ms | 0.2384ms | 4.1941 KOps/s | 4.2604 KOps/s | |
test_unbind_speed_stack1 | 97.3006ms | 0.6654ms | 1.5028 KOps/s | 1.4799 KOps/s | |
test_split | 99.4393ms | 1.7082ms | 585.4281 Ops/s | 632.2556 Ops/s | |
test_chunk | 0.1016s | 1.5857ms | 630.6188 Ops/s | 629.8363 Ops/s | |
test_consolidate[False-None] | 3.3594ms | 2.6531ms | 376.9106 Ops/s | 326.3413 Ops/s | |
test_consolidate[default-None] | 1.7418ms | 1.6677ms | 599.6221 Ops/s | 591.5615 Ops/s | |
test_consolidate[reduce-overhead-None] | 1.9261ms | 1.7068ms | 585.9010 Ops/s | 576.5160 Ops/s | |
test_consolidate_njt[False-None] | 6.7163ms | 6.6214ms | 151.0264 Ops/s | 150.7099 Ops/s | |
test_to[False-False-None] | 1.9383ms | 1.7455ms | 572.8864 Ops/s | 596.4426 Ops/s | |
test_to[True-False-None] | 1.5379ms | 1.2856ms | 777.8640 Ops/s | 763.1708 Ops/s | |
test_to[within-False-None] | 4.1523ms | 4.0473ms | 247.0753 Ops/s | 240.3654 Ops/s | |
test_to[True-default-None] | 5.6143ms | 5.3529ms | 186.8144 Ops/s | 187.8142 Ops/s | |
test_to_njt[False-False-None] | 7.0653ms | 6.9627ms | 143.6229 Ops/s | 146.0411 Ops/s | |
test_to_njt[True-False-None] | 5.6432ms | 5.4610ms | 183.1170 Ops/s | 182.2364 Ops/s | |
test_to_njt[within-False-None] | 12.3281ms | 12.2355ms | 81.7291 Ops/s | 81.4788 Ops/s | |
test_creation[device0] | 0.4646ms | 80.3939μs | 12.4388 KOps/s | 12.0030 KOps/s | |
test_creation_from_tensor | 0.4359ms | 86.4740μs | 11.5642 KOps/s | 11.9430 KOps/s | |
test_add_one[memmap_tensor0] | 0.4080ms | 6.7084μs | 149.0673 KOps/s | 150.3664 KOps/s | |
test_contiguous[memmap_tensor0] | 2.7641μs | 0.4030μs | 2.4813 MOps/s | 2.4385 MOps/s | |
test_stack[memmap_tensor0] | 22.4010μs | 4.3260μs | 231.1585 KOps/s | 227.7971 KOps/s | |
test_memmaptd_index | 1.6204ms | 0.2482ms | 4.0291 KOps/s | 4.0545 KOps/s | |
test_memmaptd_index_astensor | 0.6040ms | 0.3072ms | 3.2549 KOps/s | 3.2279 KOps/s | |
test_memmaptd_index_op | 0.9927ms | 0.5572ms | 1.7946 KOps/s | 1.6585 KOps/s | |
test_serialize_model | 0.1315s | 0.1309s | 7.6398 Ops/s | 7.6404 Ops/s | |
test_serialize_model_pickle | 1.3553s | 1.2121s | 0.8250 Ops/s | 0.8229 Ops/s | |
test_serialize_weights | 0.1330s | 0.1310s | 7.6351 Ops/s | 7.6524 Ops/s | |
test_serialize_weights_returnearly | 0.3602s | 56.9364ms | 17.5635 Ops/s | 14.3210 Ops/s | |
test_serialize_weights_pickle | 1.3741s | 1.2175s | 0.8213 Ops/s | 0.8181 Ops/s | |
test_reshape_pytree | 62.1030μs | 22.0061μs | 45.4420 KOps/s | 44.8524 KOps/s | |
test_reshape_td | 66.1930μs | 27.1647μs | 36.8124 KOps/s | 35.6717 KOps/s | |
test_view_pytree | 56.1830μs | 21.8501μs | 45.7664 KOps/s | 44.8764 KOps/s | |
test_view_td | 68.0640μs | 31.9942μs | 31.2556 KOps/s | 30.2404 KOps/s | |
test_unbind_pytree | 70.9340μs | 29.5490μs | 33.8421 KOps/s | 35.1554 KOps/s | |
test_unbind_td | 0.8484ms | 36.6218μs | 27.3061 KOps/s | 27.2071 KOps/s | |
test_split_pytree | 0.1624ms | 29.7804μs | 33.5792 KOps/s | 33.3868 KOps/s | |
test_split_td | 0.9526ms | 38.1188μs | 26.2338 KOps/s | 25.4769 KOps/s | |
test_add_pytree | 79.2340μs | 34.6713μs | 28.8423 KOps/s | 29.0864 KOps/s | |
test_add_td | 0.1823ms | 44.2475μs | 22.6002 KOps/s | 19.7076 KOps/s | |
test_compile_add_one_nested[tensordict-compile] | 0.1728ms | 0.1203ms | 8.3147 KOps/s | 8.1088 KOps/s | |
test_compile_add_one_nested[tensordict-eager] | 0.2772ms | 0.1297ms | 7.7085 KOps/s | 7.6189 KOps/s | |
test_compile_add_one_nested[pytree-compile] | 0.1593ms | 96.1975μs | 10.3953 KOps/s | 10.3279 KOps/s | |
test_compile_add_one_nested[pytree-eager] | 1.5720ms | 0.1500ms | 6.6657 KOps/s | 6.8582 KOps/s | |
test_compile_copy_nested[tensordict-compile] | 67.6940μs | 22.3865μs | 44.6699 KOps/s | 42.5093 KOps/s | |
test_compile_copy_nested[tensordict-eager] | 67.1130μs | 29.4361μs | 33.9719 KOps/s | 33.0760 KOps/s | |
test_compile_copy_nested[pytree-compile] | 0.4306ms | 64.3356μs | 15.5435 KOps/s | 15.4294 KOps/s | |
test_compile_copy_nested[pytree-eager] | 81.9750μs | 49.0162μs | 20.4014 KOps/s | 20.4043 KOps/s | |
test_compile_add_one_flat[tensordict-compile] | 0.1811ms | 0.1417ms | 7.0554 KOps/s | 6.9327 KOps/s | |
test_compile_add_one_flat[tensordict-eager] | 0.3460ms | 0.2164ms | 4.6220 KOps/s | 4.6727 KOps/s | |
test_compile_add_one_flat[tensorclass-compile] | 0.1603ms | 98.4688μs | 10.1555 KOps/s | 10.2212 KOps/s | |
test_compile_add_one_flat[tensorclass-eager] | 0.1212ms | 53.5894μs | 18.6604 KOps/s | 18.6792 KOps/s | |
test_compile_add_one_flat[pytree-compile] | 0.2462ms | 0.1367ms | 7.3171 KOps/s | 7.3232 KOps/s | |
test_compile_add_one_flat[pytree-eager] | 0.6364ms | 0.4798ms | 2.0843 KOps/s | 2.1420 KOps/s | |
test_compile_add_self_flat[tensordict-eager] | 0.4343ms | 0.2589ms | 3.8626 KOps/s | 3.8786 KOps/s | |
test_compile_add_self_flat[tensordict-compile] | 0.2058ms | 0.1465ms | 6.8258 KOps/s | 6.9782 KOps/s | |
test_compile_add_self_flat[tensorclass-eager] | 0.1818ms | 65.0052μs | 15.3834 KOps/s | 15.6223 KOps/s | |
test_compile_add_self_flat[tensorclass-compile] | 0.1490ms | 0.1022ms | 9.7892 KOps/s | 10.1305 KOps/s | |
test_compile_add_self_flat[pytree-eager] | 0.5000ms | 0.4036ms | 2.4777 KOps/s | 2.5264 KOps/s | |
test_compile_add_self_flat[pytree-compile] | 0.1958ms | 0.1366ms | 7.3196 KOps/s | 7.3129 KOps/s | |
test_compile_copy_flat[tensordict-compile] | 0.1599ms | 19.2093μs | 52.0581 KOps/s | 51.9408 KOps/s | |
test_compile_copy_flat[tensordict-eager] | 99.9150μs | 31.9382μs | 31.3105 KOps/s | 30.3396 KOps/s | |
test_compile_copy_flat[pytree-compile] | 0.1826ms | 70.5547μs | 14.1734 KOps/s | 14.0728 KOps/s | |
test_compile_copy_flat[pytree-eager] | 85.0140μs | 51.8236μs | 19.2962 KOps/s | 19.4576 KOps/s | |
test_compile_assign_and_add[tensordict-compile] | 1.6047ms | 0.3874ms | 2.5814 KOps/s | 2.2174 KOps/s | |
test_compile_assign_and_add[tensordict-eager] | 2.8615ms | 2.6321ms | 379.9259 Ops/s | 368.4427 Ops/s | |
test_compile_assign_and_add[pytree-compile] | 1.5763ms | 0.4282ms | 2.3351 KOps/s | 2.2831 KOps/s | |
test_compile_assign_and_add[pytree-eager] | 2.8505ms | 2.6377ms | 379.1251 Ops/s | 392.1930 Ops/s | |
test_compile_indexing[tensor-tensordict-compile] | 0.5397ms | 0.1172ms | 8.5332 KOps/s | 8.8490 KOps/s | |
test_compile_indexing[tensor-tensordict-eager] | 0.5607ms | 82.6012μs | 12.1064 KOps/s | 12.1601 KOps/s | |
test_compile_indexing[tensor-tensorclass-compile] | 0.5544ms | 0.1119ms | 8.9350 KOps/s | 9.1469 KOps/s | |
test_compile_indexing[tensor-tensorclass-eager] | 0.4929ms | 72.4508μs | 13.8025 KOps/s | 14.3422 KOps/s | |
test_compile_indexing[tensor-pytree-compile] | 0.5364ms | 0.1140ms | 8.7732 KOps/s | 9.1628 KOps/s | |
test_compile_indexing[tensor-pytree-eager] | 0.5107ms | 72.3754μs | 13.8169 KOps/s | 14.1814 KOps/s | |
test_compile_indexing[slice-tensordict-compile] | 0.1633ms | 0.1034ms | 9.6676 KOps/s | 9.8970 KOps/s | |
test_compile_indexing[slice-tensordict-eager] | 0.4248ms | 17.5223μs | 57.0701 KOps/s | 53.6544 KOps/s | |
test_compile_indexing[slice-tensorclass-compile] | 0.1593ms | 0.1017ms | 9.8291 KOps/s | 10.3545 KOps/s | |
test_compile_indexing[slice-tensorclass-eager] | 0.4192ms | 16.0163μs | 62.4364 KOps/s | 64.4039 KOps/s | |
test_compile_indexing[slice-pytree-compile] | 0.5467ms | 0.1043ms | 9.5860 KOps/s | 10.2499 KOps/s | |
test_compile_indexing[slice-pytree-eager] | 0.4222ms | 15.6323μs | 63.9699 KOps/s | 63.3193 KOps/s | |
test_compile_indexing[int-tensordict-compile] | 0.2435ms | 0.1057ms | 9.4586 KOps/s | 9.8299 KOps/s | |
test_compile_indexing[int-tensordict-eager] | 0.6227ms | 16.8256μs | 59.4332 KOps/s | 57.8398 KOps/s | |
test_compile_indexing[int-tensorclass-compile] | 0.2463ms | 99.1660μs | 10.0841 KOps/s | 10.2638 KOps/s | |
test_compile_indexing[int-tensorclass-eager] | 75.2940μs | 16.1687μs | 61.8480 KOps/s | 64.1620 KOps/s | |
test_compile_indexing[int-pytree-compile] | 0.1646ms | 99.2280μs | 10.0778 KOps/s | 10.2771 KOps/s | |
test_compile_indexing[int-pytree-eager] | 51.0030μs | 16.0583μs | 62.2732 KOps/s | 63.7848 KOps/s | |
test_mod_add[eager] | 0.1304ms | 35.8296μs | 27.9099 KOps/s | 25.8785 KOps/s | |
test_mod_add[compile] | 0.1498ms | 80.0693μs | 12.4892 KOps/s | 12.4516 KOps/s | |
test_mod_add[compile-overhead] | 0.3352ms | 0.1715ms | 5.8310 KOps/s | 5.7200 KOps/s | |
test_mod_wrap[eager] | 0.3895ms | 0.2483ms | 4.0276 KOps/s | 4.0252 KOps/s | |
test_mod_wrap[compile] | 0.3586ms | 0.2782ms | 3.5952 KOps/s | 3.3080 KOps/s | |
test_mod_wrap[compile-overhead] | 6.5653ms | 3.6232ms | 276.0017 Ops/s | 269.3096 Ops/s | |
test_mod_wrap_and_backward[eager] | 1.4795ms | 1.3582ms | 736.2617 Ops/s | 743.6184 Ops/s | |
test_mod_wrap_and_backward[compile] | 1.3539ms | 1.2526ms | 798.3610 Ops/s | 789.3003 Ops/s | |
test_mod_wrap_and_backward[compile-overhead] | 1.3777ms | 0.9281ms | 1.0774 KOps/s | 1.0621 KOps/s | |
test_seq_add[eager] | 0.1697ms | 0.1131ms | 8.8453 KOps/s | 8.3533 KOps/s | |
test_seq_add[compile] | 0.2816ms | 88.0325μs | 11.3594 KOps/s | 11.4922 KOps/s | |
test_seq_add[compile-overhead] | 0.1960ms | 0.1288ms | 7.7657 KOps/s | 7.7986 KOps/s | |
test_seq_wrap[eager] | 0.4740ms | 0.4091ms | 2.4443 KOps/s | 2.3442 KOps/s | |
test_seq_wrap[compile] | 0.4391ms | 0.2971ms | 3.3662 KOps/s | 3.3172 KOps/s | |
test_seq_wrap[compile-overhead] | 0.2784ms | 0.2244ms | 4.4560 KOps/s | 4.4130 KOps/s | |
test_func_call_runtime[False-eager] | 0.8273ms | 0.7302ms | 1.3694 KOps/s | 1.3849 KOps/s | |
test_func_call_runtime[False-compile] | 0.7893ms | 0.7271ms | 1.3753 KOps/s | 1.3615 KOps/s | |
test_func_call_runtime[False-compile-overhead] | 0.4192ms | 0.3633ms | 2.7522 KOps/s | 2.7373 KOps/s | |
test_func_call_runtime[True-eager] | 1.0375ms | 0.8967ms | 1.1152 KOps/s | 1.1243 KOps/s | |
test_func_call_runtime[True-compile] | 0.8921ms | 0.7564ms | 1.3220 KOps/s | 1.3279 KOps/s | |
test_func_call_runtime[True-compile-overhead] | 0.4336ms | 0.3833ms | 2.6092 KOps/s | 2.6074 KOps/s | |
test_func_call_cm_runtime[False-eager] | 0.8601ms | 0.7257ms | 1.3781 KOps/s | 1.3907 KOps/s | |
test_func_call_cm_runtime[False-compile] | 0.8203ms | 0.7327ms | 1.3648 KOps/s | 1.3512 KOps/s | |
test_func_call_cm_runtime[False-compile-overhead] | 0.4217ms | 0.3651ms | 2.7389 KOps/s | 2.7178 KOps/s | |
test_func_call_cm_runtime[True-eager] | 1.1038ms | 0.9937ms | 1.0063 KOps/s | 997.9730 Ops/s | |
test_func_call_cm_runtime[True-compile] | 0.8690ms | 0.7821ms | 1.2785 KOps/s | 1.2775 KOps/s | |
test_func_call_cm_runtime[True-compile-overhead] | 0.5523ms | 0.4088ms | 2.4462 KOps/s | 2.4137 KOps/s | |
test_vmap_func_call_cm_runtime[eager] | 2.5381ms | 2.0864ms | 479.2830 Ops/s | 477.0301 Ops/s | |
test_vmap_func_call_cm_runtime[compile] | 0.9733ms | 0.8020ms | 1.2469 KOps/s | 1.2479 KOps/s | |
test_vmap_func_call_cm_runtime[compile-overhead] | 0.4696ms | 0.4125ms | 2.4240 KOps/s | 2.4046 KOps/s | |
test_distributed | 6.6873ms | 0.1903ms | 5.2559 KOps/s | 8.5075 KOps/s | |
test_tdmodule | 39.0520μs | 18.8636μs | 53.0121 KOps/s | 48.2937 KOps/s | |
test_tdmodule_dispatch | 62.5730μs | 33.4414μs | 29.9031 KOps/s | 26.8759 KOps/s | |
test_tdseq | 55.6530μs | 19.7098μs | 50.7362 KOps/s | 45.4292 KOps/s | |
test_tdseq_dispatch | 67.3930μs | 36.3421μs | 27.5163 KOps/s | 24.4830 KOps/s | |
test_instantiation_functorch | 1.6567ms | 1.5557ms | 642.8008 Ops/s | 635.0326 Ops/s | |
test_exec_functorch | 0.1888ms | 0.1452ms | 6.8850 KOps/s | 7.0304 KOps/s | |
test_exec_functional_call | 0.1953ms | 0.1364ms | 7.3324 KOps/s | 7.3691 KOps/s | |
test_exec_td_decorator | 0.3950ms | 0.1854ms | 5.3941 KOps/s | 5.3977 KOps/s | |
test_vmap_mlp_speed_decorator[True-True] | 0.8147ms | 0.6892ms | 1.4510 KOps/s | 1.4570 KOps/s | |
test_vmap_mlp_speed_decorator[True-False] | 0.8743ms | 0.6858ms | 1.4581 KOps/s | 1.4545 KOps/s | |
test_vmap_mlp_speed_decorator[False-True] | 0.7423ms | 0.5997ms | 1.6676 KOps/s | 1.6862 KOps/s | |
test_vmap_mlp_speed_decorator[False-False] | 0.7153ms | 0.6007ms | 1.6649 KOps/s | 1.6874 KOps/s | |
test_vmap_transformer_speed_decorator[True-True] | 21.1457ms | 19.3539ms | 51.6693 Ops/s | 52.3269 Ops/s | |
test_vmap_transformer_speed_decorator[True-False] | 20.0667ms | 19.3391ms | 51.7087 Ops/s | 52.2234 Ops/s | |
test_vmap_transformer_speed_decorator[False-True] | 19.3003ms | 19.1952ms | 52.0964 Ops/s | 52.7962 Ops/s | |
test_vmap_transformer_speed_decorator[False-False] | 19.5114ms | 19.2012ms | 52.0801 Ops/s | 52.5698 Ops/s | |
test_to_module_speed[True] | 1.0608ms | 0.9679ms | 1.0332 KOps/s | 1.0202 KOps/s | |
test_to_module_speed[False] | 1.0519ms | 0.9515ms | 1.0509 KOps/s | 1.0352 KOps/s | |
test_tc_init | 60.7430μs | 34.8861μs | 28.6647 KOps/s | 26.3805 KOps/s | |
test_tc_init_nested | 0.1057ms | 68.1989μs | 14.6630 KOps/s | 13.1850 KOps/s | |
test_tc_first_layer_tensor | 4.4274μs | 0.7149μs | 1.3987 MOps/s | 1.3987 MOps/s | |
test_tc_first_layer_nontensor | 0.1277ms | 2.2762μs | 439.3290 KOps/s | 421.2691 KOps/s | |
test_tc_second_layer_tensor | 10.2973μs | 1.4544μs | 687.5895 KOps/s | 695.8408 KOps/s | |
test_tc_second_layer_nontensor | 70.6040μs | 3.0207μs | 331.0546 KOps/s | 321.9338 KOps/s | |
test_unbind | 0.2281s | 11.1606ms | 89.6008 Ops/s | 141.8442 Ops/s | |
test_full_like | 12.3229ms | 10.3087ms | 97.0055 Ops/s | 96.3653 Ops/s | |
test_zeros_like | 5.5395ms | 4.5406ms | 220.2376 Ops/s | 218.3931 Ops/s | |
test_ones_like | 5.2828ms | 4.4813ms | 223.1508 Ops/s | 214.1781 Ops/s | |
test_clone | 9.0446ms | 7.1127ms | 140.5931 Ops/s | 136.3155 Ops/s | |
test_squeeze | 60.8840μs | 9.8226μs | 101.8061 KOps/s | 101.6964 KOps/s | |
test_unsqueeze | 0.1241ms | 71.7465μs | 13.9380 KOps/s | 13.6436 KOps/s | |
test_split | 0.3932ms | 0.1580ms | 6.3288 KOps/s | 6.2148 KOps/s | |
test_permute | 0.2372ms | 0.1798ms | 5.5627 KOps/s | 5.2960 KOps/s | |
test_stack | 54.1820ms | 52.2205ms | 19.1496 Ops/s | 19.0241 Ops/s | |
test_cat | 53.4870ms | 52.1520ms | 19.1747 Ops/s | 19.1332 Ops/s |
vmoens
added a commit
that referenced
this pull request
Jan 7, 2025
ghstack-source-id: b5cc7c7fea2d48394e63d289ee2d6f215c2333bc Pull Request resolved: #1159
Sign up for free
to join this conversation on GitHub.
Already have an account?
Sign in to comment
Labels
CLA Signed
This label is managed by the Facebook bot. Authors need to sign the CLA before a PR can be reviewed.
Add this suggestion to a batch that can be applied as a single commit.
This suggestion is invalid because no changes were made to the code.
Suggestions cannot be applied while the pull request is closed.
Suggestions cannot be applied while viewing a subset of changes.
Only one suggestion per line can be applied in a batch.
Add this suggestion to a batch that can be applied as a single commit.
Applying suggestions on deleted lines is not supported.
You must change the existing code in this line in order to create a valid suggestion.
Outdated suggestions cannot be applied.
This suggestion has been applied or marked resolved.
Suggestions cannot be applied from pending reviews.
Suggestions cannot be applied on multi-line comments.
Suggestions cannot be applied while the pull request is queued to merge.
Suggestion cannot be applied right now. Please check back later.
Stack from ghstack (oldest at bottom):