Project Icon

ollama-grid-search

Enhance LLM Evaluation with Automated Grid Search and A/B Testing Tool

Product DescriptionA Rust-based application designed to streamline the evaluation process for LLM models, prompts, and parameters by automating the selection of optimal configurations. It offers detailed A/B testing, concurrent evaluations, and comprehensive experiment logging. The tool supports model retrieval from local or remote Ollama servers and includes customizable inference settings to adapt to different testing scenarios. Users can revisit previous experiments, view results in accessible formats, and download experiment data in JSON. Future enhancements will focus on improving data management and sharing features.
Project Details