GitHub - rahulunair/sql_llm: Finetune an LLM to generate SQL from text on Intel GPUs (XPUs) using QLoRA

Text-to-SQL Generation Using Fine-tuned LLMs on Intel GPUs(XPUs) and QLoRA.

This repository includes code for fine-tuning a Language Model for text-to-SQL tasks and for generating SQL queries with the fine-tuned model. Both the fine-tuning and generation processes leverage QLoRA, a Quantized Low-Rank Parameter Efficient finetuning method, enabled by Intel's BigDL library on Intel GPUs.

Prerequisites

Python 3.x
PyTorch
Transformers library
Datasets library
Intel Extension for PyTorch (IPEX)
Intel BigDL-LLM[XPU]

Installation

Clone this repo.

git clone https://github.com/your_username/your_repository.git

Install required python packages

pip install -r requirements

Install Intel BigDL llm package

pip install --pre --upgrade bigdl-llm[xpu] -f https://developer.intel.com/ipex-whl-stable-xpu

File Descriptions

finetune.py : Contains code for fine-tuning a pre-trained Language Model on text-to-SQL tasks.
generate.py : Contains code for generating SQL queries using a fine-tuned model.

Fine-Tuning a Model (finetune.py)

To finetune a model, run the finetune.py script

python finetune.py

============================================================
Training Parameters:
Foundation model:         NousResearch/CodeLlama-7b-hf
Model save path:          ./final_model
Device used:              xpu
Intel GPU:                Intel(R) Data Center GPU Max 1100
Batch size per device:    32
Gradient accum. steps:    4
Warmup steps:             100
Save steps:               20
Evaluation steps:         20
Max steps:                300
Learning rate:            0.0003
Max gradient norm:        0.3
Save total limit:         3
Logging steps:            20
============================================================

Here is how the loss chart looks at the end of 300 steps of finetuning:

As you can see the loss has a big drop in the intial steps and training loss gradually tapers to around 0.6:

Key Features:

Downloads a pre-trained model based on the given base model ID.
Tokenizes the input questions, context, and answers.
Fine-tunes the model using the tokenized data and qLoRA.
Saves the fine-tuned model.

Configuration:

BASE_MODEL: The pre-trained model to use for fine-tuning.
MODEL_PATH: Path to save the fine-tuned model.
DEVICE: Device to run the model on.

SQL Query Generation (generate.py)

To generate SQL queries using the fine-tuned model, run the generate.py script.

Key Features:

Uses either the base model or a fine-tuned model for SQL query generation.
Loads sample data and generates SQL queries for each sample.

Configuration:

BASE_MODEL: The base model to use for inference.
MODEL_PATH: Path to the fine-tuned model.
LORA_CHECKPOINT: Latest checkpoint for the fine-tuned model.
TEST_DATA: Path to the test data file.

Following a 15-minute training session, the finetuned model demonstrates enhanced proficiency in generating SQL queries that more accurately reflect the given questions, compared to the base model. With additional training steps, we can anticipate further improvements in the model's response accuracy:

Finetuned model generation:

Base model generation:

Default Configurations

Model

Default base model for fine-tuning: openlm-research/open_llama_3b
Model path for saving the fine-tuned LoRA adaptor (incase of interruptions): ./saved_model
Path for saving task based (here it is text to sql) LoRA adaptors: ./lora_models

Dataset

Default dataset for fine-tuning: b-mc2/sql-create-context

Contributing

Pull requests are welcome. For major changes, please open an issue first to discuss what you would like to change.

Name		Name	Last commit message	Last commit date
Latest commit History 33 Commits
test_data		test_data
LLM_finetuning.ipynb		LLM_finetuning.ipynb
LLM_finetuning_detailed.ipynb		LLM_finetuning_detailed.ipynb
Readme.md		Readme.md
finetune.py		finetune.py
generate.py		generate.py
lora_adapters(1).png		lora_adapters(1).png
requirements.txt		requirements.txt

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

test_data

test_data

LLM_finetuning.ipynb

LLM_finetuning.ipynb

LLM_finetuning_detailed.ipynb

LLM_finetuning_detailed.ipynb

Readme.md

Readme.md

finetune.py

finetune.py

generate.py

generate.py

lora_adapters(1).png

lora_adapters(1).png

requirements.txt

requirements.txt

Repository files navigation

Text-to-SQL Generation Using Fine-tuned LLMs on Intel GPUs(XPUs) and QLoRA.

Prerequisites

Installation

File Descriptions

Fine-Tuning a Model (finetune.py)

Key Features:

Configuration:

SQL Query Generation (generate.py)

Key Features:

Configuration:

Default Configurations

Model

Dataset

Contributing

About

Releases

Packages

Languages

rahulunair/sql_llm

Folders and files

Latest commit

History

Repository files navigation

Text-to-SQL Generation Using Fine-tuned LLMs on Intel GPUs(XPUs) and QLoRA.

Prerequisites

Installation

File Descriptions

Fine-Tuning a Model (finetune.py)

Key Features:

Configuration:

SQL Query Generation (generate.py)

Key Features:

Configuration:

Default Configurations

Model

Dataset

Contributing

About

Topics

Resources

Stars

Watchers

Forks

Languages