[EKS][GPU Failure Handling]: Nvidia EC2 Instance Failure Handling in Managed Node Group #2324
Labels
EKS Managed Nodes
EKS Managed Nodes
EKS
Amazon Elastic Kubernetes Service
Proposed
Community submitted issue
Community Note
Tell us about your request
What do you want us to build?
(Out-of-Box) NativeAWS Experience for Nvidia EC2 Instance Failure Handling on EKS include:
Which service(s) is this request for?
This could be Fargate, ECS, EKS, ECR
EKS
Tell us about the problem you're trying to solve. What are you trying to do, and why is it hard?
What outcome are you trying to achieve, ultimately, and why is it hard/impossible to do right now? What is the impact of not having this problem solved? The more details you can provide, the better we'll be able to understand and solve the problem.
Are you currently working around this issue?
How are you currently solving this problem?
Additional context
Anything else we should know?
Attachments
If you think you might have additional information that you'd like to include via an attachment, please do - we'll take a look. (Remember to remove any personally-identifiable information.)
The text was updated successfully, but these errors were encountered: