chameleon-llm - Enhance Large Language Model Functionality with Compositional Reasoning Framework

Introduction to Chameleon-LLM Project

What is Chameleon?

Chameleon is an innovative plug-and-play compositional reasoning framework that enhances large language models (LLMs) by integrating various tools. It's designed to augment LLMs like GPT-4, with additional capabilities, enabling them to handle complex reasoning tasks across different domains. The framework creatively synthesizes programs to utilize tools such as LLMs, vision models, web search engines, Python functions, and rule-based modules, depending on the user's needs. Essentially, Chameleon acts as a planner, determining the best sequence of tools to achieve the desired outcome.

Key Achievements and Features

Chameleon has demonstrated remarkable effectiveness and adaptability in tasks like Science Question Answering (ScienceQA) and Tabular Math Word Problems (TabMWP). With GPT-4, Chameleon achieves an impressive 86.54% accuracy on ScienceQA, outperforming previous best models by over 11%. For TabMWP, it achieves a 17% improvement, leading to a 98.78% accuracy rate. This success is attributed to GPT-4's ability to consistently select and combine the most suitable tools, unlike some other models that may be less efficient.

Technical Requirements

To use Chameleon, users need an OpenAI API key and optionally, a Bing Search API key. The setup involves installing Python dependencies like numpy, pandas, and transformers. These technological components ensure smooth functioning and potent capabilities for various tasks.

Running Chameleon

To execute Chameleon for tasks like ScienceQA or TabMWP, users can run scripts provided in the project repository. These scripts initialize Chameleon with specific models like GPT-4 or ChatGPT and run the problem-solving processes, leveraging the vast capabilities of these language models in combination with other tools. The project supports both traditional chain-of-thought processes and more advanced program-based reasoning structures.

Exploring Results

The effectiveness of Chameleon can be explored using visualizations and comparisons with metrics from other models. With interactive tools like Jupyter Notebooks, users can visually analyze Chameleon's outputs and transitions between various modules, revealing the sophisticated planning and execution that Chameleon enables.

Developing New Tasks

For developers interested in extending Chameleon's capabilities to new tasks, they can create new module inventories, define language model prompts, develop planners, and implement data loaders and evaluation methods. This modular approach allows for flexibility and customization to meet the demands of different domains.

Staying Connected

The Chameleon project is actively discussed and developed by a community of AI enthusiasts and experts. Developers and users are encouraged to engage with the project through direct contact with the lead developer, Pan Lu, or through platforms like Twitter and GitHub to share insights, improvements, and feedback.

Citation and Acknowledgment

The project invites users who find Chameleon useful to cite its paper in their research, ensuring recognition of the contributions made by the developers and contributing to the academic discussion around compositional reasoning frameworks.

Chameleon-LLM opens exciting possibilities for leveraging AI in diverse and complex tasks, showcasing the powerful synergy that comes from combining state-of-the-art language models with a comprehensive toolkit.