DialogStudio/open-domain-dialogues at main · salesforce/DialogStudio

History

Name		Name	Last commit message	Last commit date
parent directory ..
AntiScam		AntiScam
ConvAI2		ConvAI2
Empathetic		Empathetic
HH-RLHF		HH-RLHF
PLACES3.5		PLACES3.5
Prosocial		Prosocial
SODA		SODA
ShareGPT		ShareGPT
chitchat-dataset		chitchat-dataset
README.md		README.md

README.md

Open-Domain Dialogues

Below is a general format for open domain dialogues:

{
    "dataset_name--train/val/test--dialog_id": {
        "original dialog id": str,
        "dialog index": int,
        "original dialog info": dict,
        "log": [
            {
                "turn id": int,
                "user utterance": str,
                "system response": str,
                "dialog history": str,
                "original user side information": dict,
                "original system side information": dict,
            },
         	...
        ],
        "prompt": [
            "This is a conversation between two speakers talking about history. Given the dialog context, please generate a relevant response.",
            ...
        ]
    },
    ...
}

Chitchat dialogues generally do not involve extra annotations. Therefore the "original user1/2 side information" are usually left blank. Unlike task-oriented dialogues, chitchat does not necessarily end on the user2 side (system side in task-oriented dialogues). So, there are some dialogues contain only user1 utterance in the last turn.

For SODA, we design 6 prompts for each dialog and below shows the template:

{
    "Imagine you are {speaker_system} and you are talking to {speaker_user}. Generate a coherent and appropriate response.",
    "In the role of {speaker_system}, engage with {speaker_user}. Formulate a response that is both consistent with the conversation and suitable to the context.",
    "As {speaker_system}, you are in a dialogue with {speaker_user}. Create a coherent and relevant reply that fits the ongoing discussion.",
    "Assuming the persona of {speaker_system}, you're conversing with {speaker_user}. Generate a logical and suitable response that aligns with the conversation.",
    "Imagine yourself as {speaker_system} engaging with {speaker_user}. Your task is to produce a coherent and fitting response to continue the conversation.",
    "Pretend to be {speaker_system} in a conversation with {speaker_user}. Construct a response that maintains the coherence of the dialogue and is appropriate for the context."
}

Where 'speaker_user' and 'speaker_system' represent the 'PersonX' name and the 'PersonY' name, respectively.

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

open-domain-dialogues

open-domain-dialogues

AntiScam

AntiScam

ConvAI2

ConvAI2

Empathetic

Empathetic

HH-RLHF

HH-RLHF

PLACES3.5

PLACES3.5

Prosocial

Prosocial

SODA

SODA

ShareGPT

ShareGPT

chitchat-dataset

chitchat-dataset

README.md

README.md

README.md

Open-Domain Dialogues

Files

open-domain-dialogues

Directory actions

More options

Directory actions

More options

Latest commit

History

open-domain-dialogues

Folders and files

parent directory

Open-Domain Dialogues