GitHub - KwaiKEG/CogGPT: Unleashing the Power of Cognitive Dynamics on Large Language Models

English | 中文

Code and data for the paper "CogGPT: Unleashing the Power of Cognitive Dynamics on Large Language Models".

CogBench

CogBench is a bilingual benchmark specifically designed to evaluate the cognitive dynamics of Large Language Models (LLMs) in both Chinese and English. CogBench is divided into two parts based on the type of information flow: CogBench_a for articles and CogBench_v for short videos.

In this benchmark, both an LLM and a human are assigned the same initial profile and receive identical information flows over 10 iterations. After each iteration, they are required to complete the same cognitive questionnaire. This questionnaire, using a five-point Likert scale, allows participants to express their attitudes towards the current questions.

CogBench aims to assess the cognitive alignment between the LLM and the human. The evaluation metrics include:

Authenticity: Measures the consistency of ratings between the LLM and the human.
Rationality: Assesses the reasoning provided by the LLM.

CogGPT

CogGPT is an LLM-driven agent, designed to showcase the cognitive dynamics of LLMs. Confronted with ever-changing information flows, CogGPT regularly updates its profile and methodically stores preferred knowledge in its long-term memory. This unique capability enables CogGPT to sustain role-specific cognitive dynamics, facilitating lifelong learning.

News

2024.01.17 - Paper released.
2024.01.12 - CogBench released.
2024.01.05 - Project initially released.

User Guide

Setup

Follow these steps to build CogBench:

Clone the Repository: Clone this repository to your local environment.
Switch Directory: Use the cd command to enter the repository directory.
Download Data: Download the CogBench and save it in the dataset directory.
Run Experiments: Implement your method using cogbench_a.json and cogbench_v.json for CogBench_a and CogBench_v, respectively, and record your experimental results.
Evaluate Results: Fill in the eval_cogbench_a.json and eval_cogbench_v.json files with your experimental results for evaluations.

Using CogGPT

Declare environment variables to use the GPT-4 API:

export OPENAI_API_KEY=sk-xxxxx

Run CogGPT with default settings:

python coggpt/agent.py

Evaluation

To evaluate your method based on the authenticity and rationality metrics, we recommend running the following commands:

python evaluation.py --file_path <YOUR_FILE_PATH> --method <YOUR_METHOD_NAME> --authenticity --rationality

For example, to evaluate the CoT method on CogBench_v, run:

python evaluation.py --file_path dataset/english/eval_cogbench_v.json --method CoT --authenticity --rationality

The evaluation scores will be displayed as follows:

======= CoT Authenticity =======
Average authenticity: 0.15277666156947955
5th iteration authenticity: 0.3023255813953488
10th iteration authenticity: 0.13135593220338992
======= CoT Rationality =======
Average rationality: 3.058333333333333
5th iteration rationality: 3.7666666666666666
10th iteration rationality: 3.0833333333333335

Please refer to CogBench for more details.

Citation

@misc{lv2024coggpt,
      title={CogGPT: Unleashing the Power of Cognitive Dynamics on Large Language Models}, 
      author={Yaojia Lv and Haojie Pan and Ruiji Fu and Ming Liu and Zhongyuan Wang and Bing Qin},
      year={2024},
      eprint={2401.08438},
      archivePrefix={arXiv},
      primaryClass={cs.CL}
}

Name		Name	Last commit message	Last commit date
Latest commit History 11 Commits
blob		blob
coggpt		coggpt
datasets		datasets
llms		llms
prompts		prompts
utils		utils
CogGPT_Unleashing_the_Power_of_Cognitive_Dynamics_on_Large_Language_Models.pdf		CogGPT_Unleashing_the_Power_of_Cognitive_Dynamics_on_Large_Language_Models.pdf
LICENSE.txt		LICENSE.txt
README.md		README.md
README_ZH.md		README_ZH.md
evaluation.py		evaluation.py

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

blob

blob

coggpt

coggpt

datasets

datasets

llms

llms

prompts

prompts

utils

utils

CogGPT_Unleashing_the_Power_of_Cognitive_Dynamics_on_Large_Language_Models.pdf

CogGPT_Unleashing_the_Power_of_Cognitive_Dynamics_on_Large_Language_Models.pdf

LICENSE.txt

LICENSE.txt

README.md

README.md

README_ZH.md

README_ZH.md

evaluation.py

evaluation.py

Repository files navigation

CogBench

CogGPT

News

User Guide

Setup

Using CogGPT

Evaluation

Citation

About

Releases

Packages

Contributors 2

Languages

License

KwaiKEG/CogGPT

Folders and files

Latest commit

History

Repository files navigation

CogBench

CogGPT

News

User Guide

Setup

Using CogGPT

Evaluation

Citation

About

Topics

Resources

License

Stars

Watchers

Forks

Languages