Intents Classification for Neural Text Generation

The project consists of building an intent classifier which purpose is to pedict the sequence of labels in a dialogue

Abstract :

The hype around OpenAI's ChatGPT has more than ever sparked interest in AI-based bots where labeling and classification of utterances are a centerpiece in order to improve user experience. Broadly, Dialogue Acts (DA) and Emotion/Sentiment (E/S) tasks are identified through sequence labeling systems that are trained in a supervised manner. In this work, we propose four encoder-decoder models to learn generic representations adapted to the spoken dialog, which we evaluate on six datasets of different sizes of the Sequence labellIng Evaluation benChmark fOr spoken laNguagE benchmark (SILICONE) benchmark. Designed models are represented with either a hierarchical encoder or non-hierarchical encoders both based on pre-trained transformers (BERT/XLNet). We notice the failure of the models to learn some datasets due to their inherent properties but in general, the BERT-GRU architecture is the best model regarding accuracy.

Getting Started

Clone the repository

git clone https://github.com/konkinit/intent_classification.git

Upgrade pip and install the dependencies

python -m pip install --upgrade pip
pip install -r requirements.txt

Run the script ./src/utils/get_datasets.py until all the experiment datasets of SILICONE are downloaded
Run the notebook ./notebooks/experimental_results.ipynb

Architecture of used models

We design 4 models based on the below encoder-decoder architecture where $\mathcal{T}$ and $\mathcal{D}$ are respectively an encoder and a decoder. Typically an encoder is a transformer in our case a BERT or XLNet model and a decoder a neural network which can be a plain MLP or a GRU.

Experimental results

The models we dsigned have been applied to some datasets of SILICONE to obtain the following results:

Architecture	$\mathtt{SWdA}$	$\mathtt{DyDA_a}$	$\mathtt{MRDA}$	$\mathtt{DyDA_e}$	$\mathtt{MELD_e}$	$\mathtt{MELD_s}$
BERT + MLP	37.4	63.5	69.1	86.1	52.0	57.8
BERT + GRU	44.0	81.9	69.3	86.7	60.5	70.3
XLNet + MLP	39.1	61.7	69.3	85.7	52.3	53.7
XLNet + GRU	58.7	78.3	69.3	85.3	51.2	63.9

Name		Name	Last commit message	Last commit date
Latest commit History 34 Commits
.github/workflows		.github/workflows
data		data
notebooks		notebooks
src		src
tests		tests
.gitattributes		.gitattributes
.gitignore		.gitignore
LICENSE		LICENSE
README.md		README.md
requirements.txt		requirements.txt

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

.github/workflows

.github/workflows

data

data

notebooks

notebooks

src

src

tests

tests

.gitattributes

.gitattributes

.gitignore

.gitignore

LICENSE

LICENSE

README.md

README.md

requirements.txt

requirements.txt

Repository files navigation

Intents Classification for Neural Text Generation

Getting Started

Architecture of used models

Experimental results

About

Releases

Packages

Languages

License

konkinit/intent_classification

Folders and files

Latest commit

History

Repository files navigation

Intents Classification for Neural Text Generation

Getting Started

Architecture of used models

Experimental results

About

Topics

Resources

License

Stars

Watchers

Forks

Languages