pykoi-rlhf-finetuned-transformers
Pykoi is an open-source Python library that facilitates the optimization of large language models utilizing Reinforcement Learning with Human Feedback (RLHF). It features a unified interface for collecting user feedback in real-time, finetuning, and comparing different models. Key functionalities include a UI for chat history storage, tools for efficient model performance comparison, and RAG chatbot integration. Compatible with CPU and GPU environments, Pykoi supports models from OpenAI, Amazon Bedrock, and Huggingface, aiding in fine-tuning models with custom datasets for improved precision and relevance.