Open-domain Chatbot Augmented with Commonsense Knowledge

Master Thesis work published on the Conference "Lithuanian MSc Research in Informatics and ICT".

Abstract

Building an open-domain dialog system is a challenging task in current research. In order to successfully maintain a conversation with human, a dialog system must develop many qualities: being engaging, empathetic, show a unique personality and having general knowledge about the world. Prior research has shown that it is possible to develop such chat-bot system that combines these features, but this work explores this problem further. Most state-of-the-art dialogue systems are guided by unstructured knowledge such as Wikipedia articles, but there is a lack of research on how structured knowledge bases can be used for open-domain dialogue generation. This work proposes usage of structured knowledge base ConceptNet for knowledge-grounded dialogue generation. Novel knowledge extraction algorithm is developed which is then used to incorporate knowledge into existing dialogue datasets. Current state-of-the-art model BlenderBot is finetuned on newly created datasets and it is shown that knowledge augmentation of the dataset improved BlenderBot in terms of various automated metrics and according to human evaluation.

Small technical description

Baseline model, BlenderBot 1, was fine-tuned on a knowledge-augmented datasets. Each original dataset (BST, ConvAI2, WoW, ED) was preprocessed by knowledge extraction algorithm. Developed algorithm extracts knowledge triples (assertions) from ConceptNet and adds the most relevant ones to the inputted utterance. Relevance is described by cosine similarity between the utterance sentence embedding and the knowledge triple embedding (treated as a small sentence). Extracted knowledge were appended to dataset messages, each ConceptNet relation was treated as a special token. The latest version of the algorithm also extracts knowledge from the whole context of the dialogue and not only the last utterance.

Automated metrics

Human evaluation

There was an attempt to evaluate the developed model in a fashion similar to ACUTE-EVAL. Although there were not enough resources to perform a full-scale crowdsourced survey, a small amount (~30) of friends and relatives were able to take a survey. One can still take the survey if interested.

Name		Name	Last commit message	Last commit date
Latest commit History 24 Commits
inference		inference
parlai_agents		parlai_agents
scripts		scripts
.gitignore		.gitignore
README.md		README.md
bst_json2txt.py		bst_json2txt.py
extraction.py		extraction.py
filtering.py		filtering.py
prepare.py		prepare.py

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

inference

inference

parlai_agents

parlai_agents

scripts

scripts

.gitignore

.gitignore

README.md

README.md

bst_json2txt.py

bst_json2txt.py

extraction.py

extraction.py

filtering.py

filtering.py

prepare.py

prepare.py

Repository files navigation

Open-domain Chatbot Augmented with Commonsense Knowledge

Abstract

Small technical description

Automated metrics

Human evaluation

About

Releases

Packages

Languages

Misterion777/cn_extraction

Folders and files

Latest commit

History

Repository files navigation

Open-domain Chatbot Augmented with Commonsense Knowledge

Abstract

Small technical description

Automated metrics

Human evaluation

About

Topics

Resources

Stars

Watchers

Forks

Languages