reinforcement-learning-from-human-feedback

Star

Here are 11 public repositories matching this topic...

PKU-Alignment / safe-rlhf

Star

Safe RLHF: Constrained Value Alignment via Safe Reinforcement Learning from Human Feedback

Updated Apr 20, 2024
Python

OpenLLMAI / OpenRLHF

Star

An Easy-to-use, Scalable and High-performance RLHF Framework (Support 70B+ full tuning & LoRA & Mixtral & KTO)

reinforcement-learning raylib transformers deepspeed large-language-models reinforcement-learning-from-human-feedback vllm

Updated May 8, 2024
Python

tatsu-lab / alpaca_farm

Star

A simulation framework for RLHF and alternatives. Develop your RLHF method without collecting human data.

natural-language-processing deep-learning instruction-following large-language-models reinforcement-learning-from-human-feedback

Updated Feb 24, 2024
Python

nlp-uoregon / Okapi

Star

Okapi: Instruction-tuned Large Language Models in Multiple Languages with Reinforcement Learning from Human Feedback

multilingual nlp bloom natural-language-processing reinforcement-learning chatbot dataset question-answering llama language-model large-language-models rlhf instruction-tuning reinforcement-learning-from-human-feedback

Updated Aug 18, 2023
Python

tlc4418 / llm_optimization

Star

A repo for RLHF training and BoN over LLMs, with support for reward model ensembles.

deep-learning ensembles best-of-n large-language-models reinforcement-learning-from-human-feedback reward-models

Updated Mar 9, 2024
Python

clam004 / minichatgpt

Star

annotated tutorial of the huggingface TRL repo for reinforcement learning from human feedback connecting equations from PPO and GAE to the lines of code in the pytorch implementation

nlp reinforcement-learning deep-learning transformers deep-reinforcement-learning pytorch language-model fine-tuning large-language-models reinforcement-learning-from-human-feedback

Updated Feb 28, 2023
Jupyter Notebook

XplainMind / LLMindCraft

Star

Shaping Language Models with Cognitive Insights

docker transformers pretraining deepspeed large-language-models reinforcement-learning-from-human-feedback instruct-tuning

Updated Feb 29, 2024
Python

liushunyu / Ask-AC

Star

[TSMC] Ask-AC: An Initiative Advisor-in-the-Loop Actor-Critic Framework

reinforcement-learning reinforcement-learning-from-human-feedback action-advising

Updated Mar 17, 2024
Python

ymetz / rlhfblender

Star

RLHF-Blender: A Configurable Interactive Interface for Learning from Diverse Human Feedback

react python reinforcement-learning experimentation human-ai-interaction reinforcement-learning-from-human-feedback

Updated Mar 15, 2024
Python

Almost-Intelligence / LMRax

Star

LMRax is a framework built on JAX to train transformers language models by reinforcement learning, along with the reward model training.

reinforcement-learning transformer language-model jax reinforcement-learning-from-human-feedback

Updated Mar 3, 2023
Python

ymnseol / weekly-paper-reading-group

Star

Summaries of papers related to the alignment problem in NLP

nlp natural-language-processing rlhf instruction-tuning reinforcement-learning-from-human-feedback

Updated May 29, 2023

Improve this page

Add a description, image, and links to the reinforcement-learning-from-human-feedback topic page so that developers can more easily learn about it.

Curate this topic

Add this topic to your repo

To associate your repository with the reinforcement-learning-from-human-feedback topic, visit your repo's landing page and select "manage topics."

Learn more

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

reinforcement-learning-from-human-feedback

Here are 11 public repositories matching this topic...

PKU-Alignment / safe-rlhf

OpenLLMAI / OpenRLHF

tatsu-lab / alpaca_farm

nlp-uoregon / Okapi

tlc4418 / llm_optimization

clam004 / minichatgpt

XplainMind / LLMindCraft

liushunyu / Ask-AC

ymetz / rlhfblender

Almost-Intelligence / LMRax

ymnseol / weekly-paper-reading-group

Improve this page

Add this topic to your repo