vidur
Vidur facilitates efficient LLM deployment planning with minimal GPU usage. It supports diverse models and allows testing of new ideas, scheduling algorithms, and performance evaluations under various workloads. Its features include pipeline parallelism and detailed performance tracing, making it an invaluable tool for system deployment enhancement.