[Bug]: Missing metrics when logging hyperparameters on tensorboard #1298

rogierz · 2023-01-26T11:33:48Z

🐛 Bug

When I try to log metrics related to some hyperparameters on tensorboard, the values of metrics are not stored.

To Reproduce

from stable_baselines3.common.logger import configure, HParam

tmp_path = "log_bug"
# set up logger
new_logger = configure(tmp_path, ["tensorboard"])
hp = HParam({"hparam": 1.0}, {"missing_metric": 2.0})

new_logger.record("hparams", hp)
new_logger.dump()

Relevant log output / Error message

No response

System Info

OS: Linux-5.15.79.1-microsoft-standard-WSL2-x86_64-with-glibc2.29 #1 SMP Wed Nov 23 01:01:46 UTC 2022
Python: 3.8.10
Stable-Baselines3: 1.6.2
PyTorch: 1.13.1+cu117
GPU Enabled: False
Numpy: 1.24.1
Gym: 0.21.0

Checklist

I have checked that there is no similar issue in the repo
I have read the documentation
I have provided a minimal working example to reproduce the bug
I've used the markdown code blocks for both code and stack traces.

The text was updated successfully, but these errors were encountered:

riccardosepe · 2023-01-26T11:35:00Z

I have the same issue when trying to log hparams metrics

Co-authored-by: Riccardo Sepe <pastarick@users.noreply.github.com> Co-authored-by: Francesco Scalera <francescoscalera99@users.noreply.github.com>

araffin · 2023-01-26T11:48:17Z

@timothe-chaumont as you did the implementation in #984, could you have a look?

new_logger.dump()

I would expect dump(num_timesteps) there

Co-authored-by: Riccardo Sepe <pastarick@users.noreply.github.com> Co-authored-by: Francesco Scalera <francescoscalera99@users.noreply.github.com>

timothe-chaumont · 2023-02-20T00:37:40Z

You are right @rogierz, metric values that are passed to HParam through the metric_dict won't be saved. They are supposed to reference metrics that have been logged separately (otherwise they won't be displayed in HPARAMS).

In the documentation, the example mentions:

# define the metrics that will appear in the `HPARAMS` Tensorboard tab by referencing their tag
# Tensorbaord will find & display metrics from the `SCALARS` tab
metric_dict = {
    "rollout/ep_len_mean": 0,
    "train/value_loss": 0.0,
}

So in your example you would need to log your custom metric with

new_logger.record("missing_metric", 2.0)

so that, when referenced in HParam, TensorBoard will find it and add it to the HPARAMS tab:

kingjin94 · 2024-01-15T12:50:57Z

IMHO I might have an idea where this bug comes from: E.g. in HumanOutputFormat one iterates over all keys to output during logger.dump. But the iteration happens together with key_excluded. It is to note that nowhere is it ensured that a loggers.name_to_value and name_to_exclude are the same length (they are public and one might want to use them to do more specific logging then possible just with log_value/mean)..

I would suggest the following patch @

stable-baselines3/stable_baselines3/common/logger.py

Line 179 in a9273f9

    
           for (key, value), (_, excluded) in zip(sorted(key_values.items()), sorted(key_excluded.items())):

:

- for (key, value), (_, excluded) in zip(sorted(key_values.items()), sorted(key_excluded.items())):
+ for (key, value) in sorted(key_values.items():
+     excluded = key_excluded.get(key, ('',))

Similar changes are needed, e.g., for the TensorboardOutputFormat @

stable-baselines3/stable_baselines3/common/logger.py

Line 404 in a9273f9

    
           for (key, value), (_, excluded) in zip(sorted(key_values.items()), sorted(key_excluded.items())):

rogierz added the bug Something isn't working label Jan 26, 2023

rogierz added a commit to rogierz/stable-baselines3 that referenced this issue Jan 26, 2023

Added missing metrics when logging on tensorboard (DLR-RM#1298)

ac0489a

rogierz linked a pull request Jan 26, 2023 that will close this issue

Added missing metrics when logging on tensorboard (#1298) #1299

Draft

16 tasks

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

[Bug]: Missing metrics when logging hyperparameters on tensorboard #1298

[Bug]: Missing metrics when logging hyperparameters on tensorboard #1298

rogierz commented Jan 26, 2023

riccardosepe commented Jan 26, 2023

araffin commented Jan 26, 2023

timothe-chaumont commented Feb 20, 2023

kingjin94 commented Jan 15, 2024 •

edited

[Bug]: Missing metrics when logging hyperparameters on tensorboard #1298

[Bug]: Missing metrics when logging hyperparameters on tensorboard #1298

Comments

rogierz commented Jan 26, 2023

🐛 Bug

To Reproduce

Relevant log output / Error message

System Info

Checklist

riccardosepe commented Jan 26, 2023

araffin commented Jan 26, 2023

timothe-chaumont commented Feb 20, 2023

kingjin94 commented Jan 15, 2024 • edited

kingjin94 commented Jan 15, 2024 •

edited