- Shanghai
- www.vectortheta.com
Block or Report
Block or report michaelnny
Report abuse
Contact GitHub support about this user’s behavior. Learn more about reporting abuse.
Report abusePinned
-
deep_rl_zoo
deep_rl_zoo PublicA collection of Deep Reinforcement Learning algorithms implemented with PyTorch to solve Atari games and classic control tasks like CartPole, LunarLander, and MountainCar.
-
alpha_zero
alpha_zero PublicA PyTorch implementation of DeepMind's AlphaZero agent to play Go and Gomoku board games
-
InstructLLaMA
InstructLLaMA PublicImplements pre-training, supervised fine-tuning (SFT), and reinforcement learning from human feedback (RLHF), to train and fine-tune the LLaMA2 model to follow human instructions, similar to Instru…
Something went wrong, please refresh the page to try again.
If the problem persists, check the GitHub status page or contact support.
If the problem persists, check the GitHub status page or contact support.