コンテンツにスキップ

Evaluation

Measuring performance and accuracy of LLM applications.

Tools

  1. Galileo: Platform for LLM evaluation and observability.
  2. Ragas: Framework for evaluating Retrieval Augmented Generation (RAG) pipelines.