Evaluation¶ Measuring performance and accuracy of LLM applications. Tools¶ Galileo: Platform for LLM evaluation and observability. Ragas: Framework for evaluating Retrieval Augmented Generation (RAG) pipelines.