Skip to content

Popular repositories

  1. Cherry_LLM Cherry_LLM Public

    [NAACL'24] Self-data filtering of LLM instruction-tuning data using a novel perplexity-based difficulty score, without using any other models

    Python 196 13

  2. HallusionBench HallusionBench Public

    [CVPR'24] HallusionBench: You See What You Think? Or You Think What You See? An Image-Context Reasoning Benchmark Challenging for GPT-4V(ision), LLaVA-1.5, and Other Multi-modality Models

    Python 177 2

  3. Reflection_Tuning Reflection_Tuning Public

    [ACL'24] Selective Reflection-Tuning: Student-Selected Data Recycling for LLM Instruction-Tuning

    Python 57 3

  4. Superfiltering Superfiltering Public

    [ACL'24] Superfiltering: Weak-to-Strong Data Filtering for Fast Instruction-Tuning

    Python 40 5

  5. DEBATunE DEBATunE Public

    [ACL'24] Can LLMs Speak For Diverse People? Tuning LLMs via Debate to Generate Controllable Controversial Statements

    Python 8

Repositories

Showing 5 of 5 repositories
  • Cherry_LLM Public

    [NAACL'24] Self-data filtering of LLM instruction-tuning data using a novel perplexity-based difficulty score, without using any other models

    Python 196 13 0 0 Updated Mar 18, 2024
  • Superfiltering Public

    [ACL'24] Superfiltering: Weak-to-Strong Data Filtering for Fast Instruction-Tuning

    Python 40 5 0 0 Updated Mar 18, 2024
  • Reflection_Tuning Public

    [ACL'24] Selective Reflection-Tuning: Student-Selected Data Recycling for LLM Instruction-Tuning

    Python 57 3 0 0 Updated Mar 18, 2024
  • HallusionBench Public

    [CVPR'24] HallusionBench: You See What You Think? Or You Think What You See? An Image-Context Reasoning Benchmark Challenging for GPT-4V(ision), LLaVA-1.5, and Other Multi-modality Models

    Python 177 BSD-3-Clause 2 0 0 Updated Mar 17, 2024
  • DEBATunE Public

    [ACL'24] Can LLMs Speak For Diverse People? Tuning LLMs via Debate to Generate Controllable Controversial Statements

    Python 8 0 0 0 Updated Feb 19, 2024

Top languages

Loading…

Most used topics

Loading…