continuous-eval
continuous-eval is an open-source tool providing a data-driven evaluation for LLM applications. It uses a modular approach to assess each pipeline segment with specific metrics, supporting RAG, code generation, and classification through diverse metric types. Leverage its ability to use feedback and synthetic datasets for thorough testing. Explore custom metrics for comprehensive evaluations.