The Sparsely Gated Mixture of Experts Layer for PyTorch

This repository contains the PyTorch re-implementation of the MoE layer described in the paper Outrageously Large Neural Networks for PyTorch.

Requirements

This example was tested using torch v1.0.0 and Python v3.6.1 on CPU.

To install the requirements run:

pip install -r requirements.txt

Example

The file test.py contains an example illustrating how to train and evaluate the MoE layer with dummy inputs and targets. To run the example:

python test.py

Acknowledgements

The code is based on the TensorFlow implementation that can be found here.

Name		Name	Last commit message	Last commit date
Latest commit History 6 Commits
README.md		README.md
mlp.py		mlp.py
moe.py		moe.py
requirements.txt		requirements.txt
test.py		test.py

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

README.md

README.md

mlp.py

mlp.py

moe.py

moe.py

requirements.txt

requirements.txt

test.py

test.py

Repository files navigation

The Sparsely Gated Mixture of Experts Layer for PyTorch

Requirements

Example

Acknowledgements

About

Releases

Packages

Languages

YeonwooSung/Pytorch_mixture-of-experts

Folders and files

Latest commit

History

Repository files navigation

The Sparsely Gated Mixture of Experts Layer for PyTorch

Requirements

Example

Acknowledgements

About

Topics

Resources

Stars

Watchers

Forks

Languages