rlhf
Here are 109 public repositories matching this topic...
Code for my thesis titled "Eliciting latent knowledge from language reward models" for the MPhil in Machine Learning and Machine Intelligence at the University of Cambridge
-
Updated
Oct 5, 2023 - Jupyter Notebook
This project is based on fine-tuning LLM models (FLAN-T5) for text summarisation task using PEFT approach. All evaluation metrics being computed on ROUGE scoring and LoRA optimisation techniques being used for fine-tuning.
-
Updated
Aug 8, 2023 - Jupyter Notebook
An alternative RLHF reward model formulation from a social choice perspective
-
Updated
Apr 7, 2024 - Python
Open efforts to implement ChatGPT-like models and beyond.
-
Updated
May 10, 2023
After RLHF and SFT show promising results, a new technique named SPIN is invented for 2024
-
Updated
Jan 17, 2024
RAG Law systems base on google search and Gemini Pro
-
Updated
Mar 14, 2024 - Python
This repository was commited under the action of executing important tasks on which modern Generative AI concepts are laid on. In particular, we focussed on three coding actions of Large Language Models. Extra and necessary details are given in the README.md file.
-
Updated
Mar 28, 2024 - Jupyter Notebook
This repository is dedicated to small projects and some theoretical material that I used to get into NLP and LLM in a practical and efficient way.
-
Updated
May 6, 2024 - Jupyter Notebook
Projects and Models built in Python leveraging PyTorch, implementing Reinforcement Learning algorithms for reward-based tasks.
-
Updated
May 7, 2024 - Jupyter Notebook
Library built on TextRL for easy training and usage of fine-tuned models using RLHF, a rewards model, and PPO
-
Updated
Feb 28, 2024 - Python
A program that enhances and customizes ChatGPT's underlying pre-trained LLM w/ transformer architecture. Based on OpenAI's beta InstructGPT fine-tune model.
-
Updated
Jul 30, 2023
Applying quantum computing principles to large language models for more reliable, interpretable, and steerable systems.
-
Updated
Jan 5, 2024 - Python
Survey of preference alignment algorithms
-
Updated
Feb 25, 2024
Intelligent AI Chatbot that has the capability to learn from the user
-
Updated
Mar 22, 2024 - Python
Some experiments with activation steering in LLMs
-
Updated
Jan 21, 2024 - Python
Researching the reinforcement learning algorithm of ChatGPT
-
Updated
Apr 7, 2023 - Jupyter Notebook
Reinforcement Learning Tutorial (强化学习教程)
-
Updated
Sep 10, 2023
Large Language Model for Competitive Programming
-
Updated
Apr 28, 2023 - Python
Improve this page
Add a description, image, and links to the rlhf topic page so that developers can more easily learn about it.
Add this topic to your repo
To associate your repository with the rlhf topic, visit your repo's landing page and select "manage topics."