Recurrent-LLM

Introduction

Recurrent-LLM is an open-source implementation that draws from the paper "RecurrentGPT: Interactive Generation of (Arbitrarily) Long Text." This project addresses a vital limitation seen in GPT models - their inability to generate texts of arbitrary lengths due to the constraints of the fixed-size context in Transformer architecture.

RecurrentGPT introduces a language-based simulacrum inspired by the recurrence mechanism found in Recurrent Neural Networks (RNNs). By emulating the Long Short-Term Memory (LSTM) process, RecurrentGPT makes it feasible to generate and manage longer texts without losing context. Built on top of large language models like ChatGPT, it maintains two types of memories: language-based long and short-term memory updates, stored both on the hard drive and in the prompts used in interactions.

Key Features

Arbitrary Text Generation: The core functionality of RecurrentGPT is to produce lengthy text passages seamlessly, overcoming the contextual limitations typical of traditional Transformer-based models.
Interactivity and Interpretability: Users can interact with generated content, editing and observing the natural language memories. This boosts the interpretability and interactivity of the model, making it suitable for applications requiring ongoing user engagement.
AI As Contents (AIAC): Moving beyond conventional AI-generated content (AIGC), the project demonstrates AIAC - a novel usage form where models not only generate content but also engage users through interactive fiction, leading to a new generation of content creation.
Personalized Interactive Fiction Creation: This feature allows for crafting tailored story experiences, where narratives interact directly with readers.

Supported Large Language Models (LLMs)

Recurrent-LLM is designed to work with several open-source LLMs:

OpenAI ChatGPT: Users must provide an OpenAI API key to activate this integration.
Vicuna: Requires downloading and configuring as per specific model instructions.
ChatGLM, Baichuan, Aquila, and Falcon: Each requires setting up with particular tokenization and model loading tasks suited to their architectures.

Setup and Requirements

To get started, users need to install specific packages available on GitHub, namely transformers, peft, accelerate, and bitsandbytes. Additional tools include flagai and bminf, which further enhance performance and capabilities.

Configuration

The configuration involves specifying global settings such as language options (English or Chinese) and selecting the desired open-source LLM model option from the supported list.

Usage

Setting up the Recurrent-LLM involves starting a web server using Python, allowing users to engage with the model through a WebUI.

WebUI

Recurrent-LLM provides a user-friendly web interface, enabling easy interaction with the model and generated content, suitable for various practical applications.

Contribution and Support

The project encourages contributions from the community and suggests means for supporters to donate if they find the tool beneficial in reducing development time.

License

Recurrent-LLM is shared under the MIT License, permitting wide use and adaptation, fostering open-source collaboration and advancement.

Recurrent-LLM marks a significant step towards improving the generative capabilities of language models, with a focus on long-form, interactive, and personalized content creation. The blend of cognitive science insights with deep learning frameworks underlines its innovative approach to solving current technological constraints.