michaelnny

Follow

Michael Hu michaelnny

Follow

20 followers · 5 following

Shanghai
www.vectortheta.com

Achievements

BetaSend feedback

Achievements

BetaSend feedback

Block or Report

Block or report michaelnny

Report abuse

Contact GitHub support about this user’s behavior. Learn more about reporting abuse.

Report abuse

Pinned

deep_rl_zoo deep_rl_zoo Public

A collection of Deep Reinforcement Learning algorithms implemented with PyTorch to solve Atari games and classic control tasks like CartPole, LunarLander, and MountainCar.

Python 93 7
alpha_zero alpha_zero Public

A PyTorch implementation of DeepMind's AlphaZero agent to play Go and Gomoku board games

Python 39 8
muzero muzero Public

A PyTorch implementation of DeepMind's MuZero agent

Python 23 3
InstructLLaMA InstructLLaMA Public

Implements pre-training, supervised fine-tuning (SFT), and reinforcement learning from human feedback (RLHF), to train and fine-tune the LLaMA2 model to follow human instructions, similar to Instru…

Jupyter Notebook 29 8
RAG-LLaMA RAG-LLaMA Public

A clean and simple implementation of Retrieval Augmented Generation (RAG) to enhanced LLaMA chat model to answer questions from a private knowledge base. We use Tesla user manuals to build the know…

Jupyter Notebook 1
MM-LLaMA MM-LLaMA Public

Bring multimodality to the LLaMA model by leveraging ImageBind as the modal encoder. This project supports vision input (both images and short videos) to the LLaMA model, with text output generated…

Python 3 1