GitHub - kikirizki/miniChatbot: The minimum implementation of chatbot using popular LLM model rewrite from ground up with simplicity in mind

miniChatbot: A Humble Exploration of Intricate Chatbot's Theoritical Foundation

Welcome to miniChatbot! This humble repository is a labor of love aimed at providing an accessible journey through the intricate world of chatbot theoritical foundation and it's code implementation. Our goal is simple: to share knowledge and learn together, one step at a time.

Our Humble Objective

miniChatbot strives to be a gentle guide for those curious about chatbots, whether you're just starting your journey or have been exploring for a while. We aim to break down complex concepts into bite-sized pieces, emphasizing understanding over complexity. Our focus is on:

Taking small steps, one iteration at a time
Exploring the math and theory behind chatbots with humility and curiosity
Keeping our code concise and approachable, like a friendly conversation

How to Run The Demo

currently the code contain the simple re-implementation LLaMA2 emphasizing code readability over maintainability and robustness, hence the code is different from offical llama2 model, but still compatible with it's official checkpoint, please refer to the official repo to get the model checkpoint

To run the chatbot demo please run the following command

python3 demo.py [model name (mistral/llama)] [path to checkpoint directory]  [path to tokenizer weight] [optional, allow gpu or not setting to 0 will force to use CPU]

examples

python3 -m demo llama ~/LLaMa2/7B/  ~/LLaMa2/tokenizer.model

python3 -m demo mistral ~/mistral-7B-v0.1/ ~/mistral-7B-v0.1/tokenizer.model

to get mistral model weight run this command

wget https://models.mistralcdn.com/mistral-7b-v0-1/mistral-7B-v0.1.tar (md5sum: 37dab53973db2d56b2da0a033a15307f)
tar -xf mistral-7B-v0.1.tar

to download llama2 weight, please head to this llama2 official repo https://github.com/facebookresearch/llama#download and click request a new download link and wait for meta to accept your request

The Philosophy

In the spirit of humility, we've miniChatbot is build upon these simple philosopical pillars:

Iteration: Follow along as we build our chatbot step by step, learning and growing with each iteration.
Mathematics: Delve into the mathematical underpinnings of our chatbot with humility and awe.
Papers: Explore research papers and academic articles with us, marveling at the wisdom of those who came before.
Code: Dive into our humble codebase, where simplicity and clarity reign supreme.

Introduction to Large Language Model
- Tokenization
  - BPE
- Transformer neural network architecture
  - scaled dot product attention
  - positional embedding
- LLM inference strategy
  - greedy
  - random sampling
    - top-k
    - top-p
LLaMa2
- RMS Normalization
- Rotary Embedding
- KV-Cache
- Grouped Query Attention
- inference strategy with KV-cache
Mistral
- Sliding Window Attention
- Sparse Mixture of Experts
- KV-Cache with rolling buffer
- inference strategy with rolling buffer KV-cache
Finetuning
- LoRA: Low-Rank Adaptation of Large Language Models
- qLoRA

Todo list

We Humbly Wellcome You to Collaborate

miniChatbot is a humble endeavor, and we welcome collaboration from all who approach with humility and respect. Whether you're a seasoned expert or a humble novice, your contributions are valued and appreciated. For guidance on how to humbly contribute, please consult our CONTRIBUTING.md file.

Humbly Seeking Feedback

Your feedback is essential in our quest for continuous improvement. If you have suggestions, gentle critiques, or words of encouragement, please share them with us by opening an issue or reaching out directly.

License

miniChatbot is licensed under the MIT License, a humble gesture allowing for the free exchange of knowledge and ideas.

References:

This code is inspired by [Andrej Kharpaty great lecture] (https://www.youtube.com/watch?v=PaCmpygFfXo&t=1274s)
Ashish Vaswani, Noam Shazeer, Niki Parmar, Jakob Uszkoreit, Llion Jones, Aidan N. Gomez, Lukasz Kaiser, and Illia Polosukhin. Attention is all you need, 2017
Llama 2: Open Foundation and Fine-Tuned Chat Models
LLaMA2 official code
Mistral arxiv pre-print

Name		Name	Last commit message	Last commit date
Latest commit History 115 Commits
assets		assets
doc		doc
.gitignore		.gitignore
LICENSE		LICENSE
README.md		README.md
dataset.py		dataset.py
demo.py		demo.py
finetune.py		finetune.py
llama.py		llama.py
lora.py		lora.py
lora_mistral.py		lora_mistral.py
mistral.py		mistral.py
requirements.txt		requirements.txt

License

kikirizki/miniChatbot

Folders and files

Latest commit

History

Repository files navigation

miniChatbot: A Humble Exploration of Intricate Chatbot's Theoritical Foundation

Our Humble Objective

How to Run The Demo

The Philosophy

Table of Contents

Todo list

We Humbly Wellcome You to Collaborate

Humbly Seeking Feedback

License

About

Topics

Resources

License

Stars

Watchers

Forks

Languages