[Feature]: Deleting steps up to a given point #7357
Labels
a:app
Area: Frontend/Backend
a:cli
Area: Client
c:stitch
ty:feature_request
type of the issue is a feature request
Description
In HPC environments we often have job allocations (e.g. 10 days). It is possible that a save is triggered on 9 days and 20 hours (e.g. at step 100). After saving, logging just continues until the 10 days are over (e.g. until step 105). Then, we continue with a new job from that latest saved checkpoint at step 100. But that leads to a discrepancy: wandb logging is at step 105 already, but we restart from step 100 - so the graph will be messed up a bit. It would therefore be incredibly useful if some tooling exists to remove data points in a given range of steps.
Suggested Solution
A CLI interface that, given a project name and run as well as a steps-range, allows us to remove all data points within that range. From the outside looking in, this seems straightforward but I'm sure there are technical reasons why this is more difficult to do.
The text was updated successfully, but these errors were encountered: