Skip to content

Latest commit

 

History

History
12 lines (8 loc) · 621 Bytes

evaluation.md

File metadata and controls

12 lines (8 loc) · 621 Bytes

Evaluation

Evaluating an AgentKit app can be done on multiple levels:

  • Routing layer: Evaluate the meta agent's accuracy of choosing the right action plan based on the user query
  • Tool layer: Evaluate individual tools
  • Output layer: Evaluate the final output quality

AgentKit evaluation layers

AgentKit natively integrates with LangSmith, which is a useful tool for tracing and tracking the performance of your app. https://docs.smith.langchain.com/

See Optional Features for instructions to set up LangSmith.