rlhf

Here are 109 public repositories matching this topic...

AugustasMacijauskas / mlmi-thesis

Code for my thesis titled "Eliciting latent knowledge from language reward models" for the MPhil in Machine Learning and Machine Intelligence at the University of Cambridge

alignment interpretability rlhf

Updated Oct 5, 2023
Jupyter Notebook

navneet1083 / textsum-tune

Star

This project is based on fine-tuning LLM models (FLAN-T5) for text summarisation task using PEFT approach. All evaluation metrics being computed on ROUGE scoring and LoRA optimisation techniques being used for fine-tuning.

lora ppo peft ppo-agent huggingface-transformers rlhf flan-t5 llm-training

Updated Aug 8, 2023
Jupyter Notebook

colehaus / social-choice-rlhf

Star

An alternative RLHF reward model formulation from a social choice perspective

rlhf

Updated Apr 7, 2024
Python

vualidon / rewrite_retrieve_read_law

Star

RAG Law systems base on google search and Gemini Pro

law rag google-search-api llm rlhf gemini-pro

Updated Mar 14, 2024
Python

AMfeta99 / NLP_LLM

Star

This repository is dedicated to small projects and some theoretical material that I used to get into NLP and LLM in a practical and efficient way.

Updated May 6, 2024
Jupyter Notebook

akain0 / Reinforcement-Learning-

Star

Projects and Models built in Python leveraging PyTorch, implementing Reinforcement Learning algorithms for reward-based tasks.

reinforcement-learning reinforcement-learning-algorithms a3c lstm-neural-networks bellman-equation rlhf

Updated May 7, 2024
Jupyter Notebook

jddunn / rlhf

Star

Library built on TextRL for easy training and usage of fine-tuned models using RLHF, a rewards model, and PPO

ppo rlhf reward-model textrl

Updated Feb 28, 2024
Python

ChukwumaChukwuma / enyimba2_ai

Star

Applying quantum computing principles to large language models for more reliable, interpretable, and steerable systems.

machine-learning natural-language-processing reinforcement-learning ai artificial-intelligence quantum-computing llms generative-ai rlhf llama2

Updated Jan 5, 2024
Python

BARUDA-AI / Awesome-Preference-Optimization

Star

Survey of preference alignment algorithms

alignment direct preference-learning rlhf preference-alignment

Updated Feb 25, 2024

10mudassir007 / AI-CHATBOT

Star

Intelligent AI Chatbot that has the capability to learn from the user

python nlp ai learning-python chatbot nltk nlp-machine-learning nltk-python rlhf

Updated Mar 22, 2024
Python

ChukwumaChukwuma / enyimba_ai

Star

Applying AlphaZero Self-Play Tactics to LLaMA for Enhanced Chatbot Interaction

machine-learning natural-language-processing reinforcement-learning ai chatbot artificial-intelligence strategy policy-evaluation alphazero muzero prompt-engineering llms generative-ai rlhf llama2

Updated Jan 5, 2024
Python

SharathHebbar / dpo_chatgpt2

Star

Direct Preference Optimization of ChatGPT2 using TRL Library

decoder transformers text-generation dpo gpt2 trl llm rlhf chatgpt2

Updated Jan 24, 2024
Jupyter Notebook

congchan / llm

Star

Codebase and experiments of LLM(Large Language Modeling)

large-language-models rlhf

Updated May 5, 2024
Python

thisisHJLee / RLHF

Star

nlp reinforcement-learning language-model ppo rlhf supervised-finetuning reward-model

Updated Jul 20, 2023

ymnseol / weekly-paper-reading-group

Star

Summaries of papers related to the alignment problem in NLP

nlp natural-language-processing rlhf instruction-tuning reinforcement-learning-from-human-feedback

Updated May 29, 2023

MOONLAPSED / cognOS

Star

Python package for cognosis kb, syntax, and markup language. Under-construction.

agent rlhf local-llm llama2

Updated Apr 8, 2024
Python

himanshuvnm / Foundation-Model-Large-Language-Model-FM-LLM

Star

This repository was commited under the action of executing important tasks on which modern Generative AI concepts are laid on. In particular, we focussed on three coding actions of Large Language Models. Extra and necessary details are given in the README.md file.