Skip to content
New issue

Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.

By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.

Already on GitHub? Sign in to your account

[Feature] We need to keep the state of the jobs on the cluster consistent with the state of the jobs on the platform #3663

Open
2 of 3 tasks
Mrart opened this issue Apr 12, 2024 · 1 comment

Comments

@Mrart
Copy link

Mrart commented Apr 12, 2024

Search before asking

  • I had searched in the feature and found no similar feature requirement.

Description

  1. Synchronize the job status when the k8s/yarn cluster is abnormal
  2. If the operation is abnormal, the status can be synchronized to the platform
  3. After the platform is restarted, the job status can be synchronized

Usage Scenario

No response

Related issues

No response

Are you willing to submit a PR?

  • Yes I am willing to submit a PR!

Code of Conduct

@caicancai
Copy link
Member

@Mrart I am happy to participate in this part of the discussion. Do you have a corresponding solution? I also think this function is very important. I have also been trying to sort it out recently.

Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment
Labels
None yet
Projects
None yet
Development

No branches or pull requests

2 participants