Multi-Armed Bandit Test Bed

This code reproduces Section 2.3 from "Reinforcement Learning: An Introduction" by Sutton & Barto, Second Edition. I am grateful to Richard Sutton for sharing his knowledge of reinforcement learning with such a wonderful textbook.

Components

This code has two main parts:

Experiment-running and data generation: written in Rust
Data visualization: written in Python

Running

Use the three scripts in 'bin' to generate three figures:

Figure 2.2: bin/fig_2_2
Figure 2.4: bin/fig_2_4
Figure 2.5: bin/fig_2_5

Each script will output files to the 'results' directory.

Setup

I recommend creating a Conda environment named 'plot' with the correct dependencies:

conda create -n plot python=3
conda activate plot
conda install matplotlib numpy pandas

Name		Name	Last commit message	Last commit date
Latest commit History 3 Commits
bin		bin
results/a=10_p=2000_s=1000		results/a=10_p=2000_s=1000
src		src
.gitignore		.gitignore
Cargo.lock		Cargo.lock
Cargo.toml		Cargo.toml
README.md		README.md
plot.py		plot.py

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

bin

bin

results/a=10_p=2000_s=1000

results/a=10_p=2000_s=1000

src

src

.gitignore

.gitignore

Cargo.lock

Cargo.lock

Cargo.toml

Cargo.toml

README.md

README.md

plot.py

plot.py

Repository files navigation

Multi-Armed Bandit Test Bed

Components

Running

Setup

About

Releases

Packages

Languages

xpe/rs_bandit_testbed

Folders and files

Latest commit

History

Repository files navigation

Multi-Armed Bandit Test Bed

Components

Running

Setup

About

Resources

Stars

Watchers

Forks

Languages