Skip to content

Releases: lucidrains/st-moe-pytorch

0.1.7

29 Feb 15:31
Compare
Choose a tag to compare

Full Changelog: 0.1.6...0.1.7

0.1.6

24 Jan 14:05
Compare
Choose a tag to compare

Full Changelog: 0.1.5...0.1.6

0.1.5

14 Dec 20:14
Compare
Choose a tag to compare
make sure contiguous

0.1.4

21 Sep 15:14
Compare
Choose a tag to compare
router z loss should be calculated on the unnoised gating logits

0.1.2

21 Sep 04:12
Compare
Choose a tag to compare
allow for noising of gates

0.1.1

11 Sep 21:43
Compare
Choose a tag to compare
researcher will want to log the unweighted auxiliary losses

0.1.0

11 Sep 21:42
Compare
Choose a tag to compare
rename loss_coef to balance_loss_coef, sum the balance and router z-l…

…oss and return the total auxiliary loss and add some comments in readme on what to do with it

0.0.30

11 Sep 15:52
Compare
Choose a tag to compare
handle variable sequence lengths if `allow_var_seq_len = True` on `Ex…

…perts`

0.0.29

10 Sep 20:09
Compare
Choose a tag to compare
any combinatino of number of experts and world size should not break

0.0.28

10 Sep 16:59
Compare
Choose a tag to compare
oops