Azza - Grounded Q/A Conversational Agent 🤖

title

sdk

app_file

models

pinned

Azza

gradio

app.py

ml6team/keyphrase-extraction-kbir-inspec

false

Azza - Grounded Q/A Conversational Agent 🤖

We applied BERT to the Stanford Question Answering Dataset (SQuAD) to achieve natural and grounded human-level performance on question answering.

Link to demo: https://huggingface.co/spaces/jingxiangmo/Azza

Pipeline

Key-phrase extraction from the user's question [1].
Retrieval of the relevant Wikipedia article based on the extracted key-phrase (reference text).
Tokenization of the question and context.
Evaluation of the model and extraction of answer tokens using BERT [2].
Reconstruction of the answer from the tokens.
Wrap the answer with a generative language model for a better response.

Methodology

BERT (Bidirectional Encoder Representation from Transformers) is a pre-trained language model designed to improve nautral-language understanding in various tasks. BERT can find the answer given a question and reference by learning contextual relationships between words in a a text using bidirectional transformers.

We fine-tuned BERT for question answering using the Stanford Question Answering Dataset (SQuAD 2.0) [3], which is a reading comprehension dataset for training and evaluation. BERT learns to identify the most relevant information within the reference text to answer a given question.

Tokenization: The question and reference text from wikipedia are tokenized, which means they are converted into a format that BERT can understand (a sequence of tokens).
Input preparation: The tokenized question and reference are combined into a single input sequence, with special tokens added to indicate the beginning and end of the question and reference segments.
Contextual understanding: BERT processes the input sequence using its bidirectional transformer architecture, encoding contextual information about each token (word) in the sequence.
Answer prediction: BERT generates a probability distribution over the tokens in the reference text for both the start and end positions of the answer. The model identifies the tokens with the highest probabilities for the start and end positions as the answer span.
Answer extraction: The answer is extracted by combining the tokens from the identified start to end positions.

References

[1] https://huggingface.co/ml6team/keyphrase-extraction-kbir-inspec
[2] https://arxiv.org/pdf/1810.04805v2.pdf
[3] https://rajpurkar.github.io/SQuAD-explorer/

Name		Name	Last commit message	Last commit date
Latest commit History 45 Commits
.github/workflows		.github/workflows
deliverables		deliverables
.gitignore		.gitignore
README.md		README.md
app.py		app.py
requirements.txt		requirements.txt

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

.github/workflows

.github/workflows

deliverables

deliverables

.gitignore

.gitignore

README.md

README.md

app.py

app.py

requirements.txt

requirements.txt

Repository files navigation

Azza - Grounded Q/A Conversational Agent 🤖

Pipeline

Methodology

References

About

Contributors 3

Languages

jingxiangmo/Azza

Folders and files

Latest commit

History

Repository files navigation

Azza - Grounded Q/A Conversational Agent 🤖

Pipeline

Methodology

References

About

Topics

Resources

Stars

Watchers

Forks

Languages