Skip to content
New issue

Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.

By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.

Already on GitHub? Sign in to your account

Benchmark existing techniques using evaluation harness #7625

Open
1 of 3 tasks
Tracked by #7728
mrm1001 opened this issue May 2, 2024 · 0 comments
Open
1 of 3 tasks
Tracked by #7728

Benchmark existing techniques using evaluation harness #7625

mrm1001 opened this issue May 2, 2024 · 0 comments
Assignees
Labels
P1 High priority, add to the next sprint topic:benchmark

Comments

@mrm1001
Copy link
Member

mrm1001 commented May 2, 2024

Context on benchmark work

  • goal number 1 is to give user practical guidance on what techniques to try out on their dataset/use case

  • goal number 2 is to show that there is not a “silver bullet” type of solution, that it depends on the dataset and use case, but that Haystack can support them all

  • goal number 3 is to showcase advanced evaluation/experimentation API (most advanced compared to competitors)

  • it’s not a research paper, so should not be too “academic” (i.e. not too restricted in terms of metrics or datasets to use, not meant to be peer-reviewed or submitted to an academic conference)

  • Datasets

Tasks

  1. P1
    davidsbatista
  2. P1
    davidsbatista
Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment
Labels
P1 High priority, add to the next sprint topic:benchmark
Projects
None yet
Development

No branches or pull requests

2 participants