Skip to content

Releases: lucidrains/PaLM-rlhf-pytorch

0.2.1

05 Apr 14:28
Compare
Choose a tag to compare
fix a bug with the final norm in palm, thanks to @conceptofmind and @…

…dmahan93

0.2.0

23 Mar 14:04
Compare
Choose a tag to compare
address https://github.com/lucidrains/PaLM-rlhf-pytorch/issues/41 , b…

…e faithful to the paper

0.1.4

22 Mar 13:51
Compare
Choose a tag to compare
old action log probs should be the true distribution in the kl div lo…

…ss, addressing https://github.com/lucidrains/PaLM-rlhf-pytorch/issues/43

0.1.2

17 Mar 20:55
Compare
Choose a tag to compare
flash attention sdp context config only needs to be done once

0.1.1

17 Mar 03:53
Compare
Choose a tag to compare
fix assert

0.1.0

17 Mar 03:36
Compare
Choose a tag to compare
add ability to use flash attention if using pytorch 2.0, thanks to @c…

…onceptofmind for the initial PR!

0.0.68

13 Mar 04:56
Compare
Choose a tag to compare
0.0.68

0.0.67

12 Mar 22:25
Compare
Choose a tag to compare
fix silly error in masked kl div loss, thanks to @taynoel84

0.0.66

12 Mar 00:38
Compare
Choose a tag to compare
allow for setting critic palm from rlhftrainer

0.0.65

22 Feb 21:29
Compare
Choose a tag to compare
fix an error with the way action log prob is collected during the epi…

…sode rollouts, addressing https://github.com/lucidrains/PaLM-rlhf-pytorch/issues/31 and thanks to @kisseternity