New issue
Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.
By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We鈥檒l occasionally send you account related emails.
Already on GitHub? Sign in to your account
[compile time] AOT Autograd is taking long time in tracing #125977
Comments
This one high prio? |
Making it high priority, mostly to get an owner to take a look and check for low hanging fruits if any. |
I ran the repro locally, switching out the backend for I found: (1) the total time was 390s. 132 of that was coming from (2) I noticed that @aorenste recently had a PR to speed up this function due to inefficient lookups of graph inputs (PR), that hadn't made it into fbcode yet. When I patch in that PR, the total drops to 251s. Still not great, but a >50% speedup. I now see:
|
Cc @Chillee partitioner |
Trying to think more about where the low-hanging fruit in (1) 6s: I tried making (2) 21s: removing custom size dispatch on (3) ~4s. When I print out any other metadata calls that are getting plumbed through FunctionalTensorMode here, I see many calls to I'm not sure if (3) is worth the work immediately, but (2) definitely seems worth attempting to fix (PR incoming) |
How do I look at the SVG for this? |
curious is there any thing specfic about SVG that makes it easier to navigate than strobelight also: |
馃悰 Describe the bug
profile - https://fburl.com/scuba/pyperf_experimental/on_demand/ghc8lpjr
xref - https://fb.workplace.com/groups/1075192433118967/permalink/1425136311457909/
Repro - D57090987
AOT Autograd is taking large amount of time. This is the rough breakdown
create_aot_dispatcher_function
is taking 900 seconds, whilecompile_fx_inner
is taking 375 seconds. So, around 500 seconds are spent in AOT Autograd.Generator strobelight profile:
use
TORCH_COMPILE_STROBELIGHT=TRUE buck2 run ...
for more info how to navigate the profile see
https://fb.workplace.com/groups/257735836456307/posts/669969978566222
Error logs
No response
Minified repro
No response
Versions
N/A
cc @ezyang @gchanan @zou3519 @kadeng @msaroufim @bdhirsh @chauhang
The text was updated successfully, but these errors were encountered: