Skip to content

Implementation of the Asynchronous Advantage Actor Critic with Communication in TensorFlow 2

Notifications You must be signed in to change notification settings

Ralami1859/A3C2-in-TensorFlow-2

Repository files navigation

Asynchronous Advantage Actor-Critic with Communication in TensorFlow 2

The source-code used on the paper Multi-Agent Reinforcement Deep Learning with Emergent Communication, published on IJCNN'19. The paper describes the A3C2 algorithm, for multi-agent learning, with communication.

The implementation is done using Tensorflow2.

Contains 4 environments (Hidden Reward, Navigation, Pursuit, Traffic Intersection), and scripts to launch A3C2 and learn policies. Use the requirements.txt to install your dependencies and run the scripts.

4

Each agent is defined by 3 networks.

1

The algorithm is distributed, and multiple workers update the networks.

2

Gradients are pushed across multiple time-steps to optimize the communication network and enforce communication.

3

About

Implementation of the Asynchronous Advantage Actor Critic with Communication in TensorFlow 2

Topics

Resources

Stars

Watchers

Forks

Packages

No packages published

Languages