-
Notifications
You must be signed in to change notification settings - Fork 1.3k
New issue
Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.
By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.
Already on GitHub? Sign in to your account
CPU Spikes when upgrading to 2.4.10 from 2.4.0 #12535
Comments
@pryorda thanks for raising this! The baseline for CPU usage seems to be almost the same. It's true the peaks you're seeing in usage are higher on 2.14.10, but they don't seem really exaggerated to me. It would be helpful for us to figure out whether you're experiencing any other symptoms here, or whether you're experiencing a concrete problem as a result of this spike. From a high-level perspective, a 3rd of a CPU in usage doesn't seem to indicate any pathological behaviour. If we can narrow what the scope of the issue is (whether it's more about optimising the code or fixing what might be a regression in our codebase) we can carve out a more detailed plan on how to investigate this or assist you with your investigation. On that note, I think more data would also be helpful here. There have been a lot of changes to the code base between 2.14.0 and 2.14.10. We've been addressing some reports of staleness in discovery data and as a result introduced more work in the destination container. If you want to try and root cause this, here are my suggestions:
Hope this all makes sense! |
Thank you for responding. I will see what information I can gather to help with diagnosing the issue. I don't know how to fully debug the linkerd side of things so any advice you have is a good start. As far as bisecting the code to see differences, that's a bit out of my realm and might be a steep ask. I've looked at the change log and nothing stood out to me. Here is what it looks like on our prod cluster since we upgraded. |
@pryorda Did you get a chance to look any further here? |
What is the issue?
We observed CPU spikes in linkerd/controller after upgrading to linkerd 2.4.10 from 2.4.0 using the linkerd helm charts.
How can it be reproduced?
Not fully known.
Logs, error output, etc
Not sure which logs would be beneficial. Please tell me the logs you would like to have and I will obtain them.
output of
linkerd check -o short
Environment
Possible solution
Downgrade
Additional context
Purple is after downgrade to 2.4.0
Would you like to work on fixing this bug?
yes
The text was updated successfully, but these errors were encountered: