GitHub - amycardoso/retrieval-based-chatbot: Analysis Of The Context Size Impact In Deep Learning Conversational Systems

Analysis Of The Context Size Impact In Deep Learning Conversational Systems

This repository holds an implementation of the Deep Learning Dual Encoder LSTM Model and the Vector Space Model, both used to evaluate and analyze the impact of the context size over the quality of responses.

Overview

Some of the codes used here were produced in this hands-on, which implement the Dual Encoder LSTM Model, from this paper, also implement the Vector Space Model, which in this research was used as a baseline.

Configuration

The codes use Python 3. Clone the repository and install all necessary packages:

1. install tensorflow (version 0.11 and above work correctly, version 0.10 not tested)
2. (optional) install cuda + cudnn (recommend for gpu support)
2. pip install -U pip
3. pip install -r requirements.txt

Dialogue dataset

Experiments can be performed using Ubuntu Dialogue Corpus version 2.0 featured in this paper, whose generation script is available in this repository. However, since the goal of the research was to understand the impact of context size on predicting the next utterance, it was necessary to modify the generation script to get training sets with the number of turns informed by argument. Thus, the modified script can be found in the scripts folder.

Modified script training set generation

For the generation of training sets, follow the steps described in this repository, except for the addition of a sub parser to the training parser to determine the desired number of turns.

Subparser:

train: training set generator

-t: desired number of turns

Example for generating a set consisting of contexts with 2 turns:

python create_ubuntu_dataset_modificado.py --data_root ./dados -o 'train.csv' -t -s -l train -t 2

Run training set generation with the modified script, but for validation and test sets use the original script, or download all required sets here. Finally, move all files to the ./Data folder.

Preprocessing

Before moving to Deep Learning model training, sets need to be transformed from CSV to TFRecord.

cd scripts
python prepare-data.py

Dual Encoder LSTM Model

Training

python udc_train.py

Evaluation

python udc_test.py --model_dir=...

Example:

python udc_test.py --model_dir=./runs/1481183770/

Prediction

python udc_predict.py --model_dir=...

Example:

python udc_predict.py --model_dir=./runs/1481183770/

Vector Space Model

As a baseline, we used the Vector Space Model, available in the notebooks folder.

Name		Name	Last commit message	Last commit date
Latest commit History 45 Commits
models		models
notebooks		notebooks
scripts		scripts
.gitattributes		.gitattributes
.gitignore		.gitignore
LICENSE		LICENSE
README.md		README.md
requirements.txt		requirements.txt
udc_hparams.py		udc_hparams.py
udc_inputs.py		udc_inputs.py
udc_metrics.py		udc_metrics.py
udc_model.py		udc_model.py
udc_predict.py		udc_predict.py
udc_test.py		udc_test.py
udc_train.py		udc_train.py

License

amycardoso/retrieval-based-chatbot

Folders and files

Latest commit

History

Repository files navigation

Analysis Of The Context Size Impact In Deep Learning Conversational Systems

Overview

Configuration

Dialogue dataset

Modified script training set generation

Subparser:

Preprocessing

Dual Encoder LSTM Model

Training

Evaluation

Prediction

Vector Space Model

About

Topics

Resources

License

Stars

Watchers

Forks

Languages