You signed in with another tab or window. Reload to refresh your session.You signed out in another tab or window. Reload to refresh your session.You switched accounts on another tab or window. Reload to refresh your session.Dismiss alert
Env.render() returns the observation of each agent in the environment. This observation is used to choose the best policy in q-learing. However the env.render() returns an array of arrays containing 2-dim arrays with values I do not understand. Is there anyone who can clarify this to me? How could these observations be used to conduct q-learning?
The text was updated successfully, but these errors were encountered:
Env.render() returns the observation of each agent in the environment. This observation is used to choose the best policy in q-learing. However the env.render() returns an array of arrays containing 2-dim arrays with values I do not understand. Is there anyone who can clarify this to me? How could these observations be used to conduct q-learning?
The text was updated successfully, but these errors were encountered: