Project Icon

continuous-eval

Innovative Evaluation Metrics for LLM Applications

Product Descriptioncontinuous-eval is an open-source tool providing a data-driven evaluation for LLM applications. It uses a modular approach to assess each pipeline segment with specific metrics, supporting RAG, code generation, and classification through diverse metric types. Leverage its ability to use feedback and synthetic datasets for thorough testing. Explore custom metrics for comprehensive evaluations.
Project Details