llama-cpp-agent
Explore a framework optimized for seamless interaction with Large Language Models, enabling features like a chat interface, structured output, function execution, retrieval augmented generation, and agentic chain processing. Utilizing guided sampling, it facilitates function calls and structured output across different servers, compatible with llama-cpp-python and TGI, suitable for varied use-cases from casual chatting to advanced function execution, ensuring integration with OpenAI tools and others.