ChatLLM-Web - Enhance Browser AI Chat with Secure and Private Multi-Conversation Features

ChatLLM-Web: Transforming Communication with AI

Introduction

ChatLLM-Web is a groundbreaking web-based platform that allows users to interact with Language Learning Models (LLMs) like Vicuna right in their web browsers. This innovative project aims to provide safe, private, and server-free AI communication with the power of WebGPU, backed by web-llm.

Key Features

Browser-Based Operation: ChatLLM-Web eliminates the need for server support by running entirely within the browser, powered by WebGPU technology for accelerated performance.
Efficient Processing: The model functions in a separate web worker, ensuring a smooth user experience by not interfering with the browser's interface.
Simple Deployment: Users can quickly deploy their own ChatLLM-Web platform to Vercel with just a click, requiring less than a minute for setup.
Model Caching: To enhance efficiency, model caching is available—users need to download the model only once.
Multiple Chat Support: Enjoy seamless multi-conversation capabilities, with all data securely stored in the local browser, ensuring privacy.
Enhanced Response Features: The platform supports markdown and streaming responses, including math and code highlighting functionalities.
Responsive Design: ChatLLM-Web offers a user-friendly interface with adaptive design options, including dark mode.
Offline Accessibility: As a Progressive Web App (PWA), users can download and operate the app entirely offline.

Usage Instructions

To utilize ChatLLM-Web, a browser supporting WebGPU, such as Chrome 113 or Chrome Canary, is necessary. Older versions of Chrome will not support this functionality. The app requires a GPU with approximately 6.4GB of memory for optimal speed, although it will still operate on GPUs with less memory at a slower response rate. Initially, users must download the model, a process involving a 4GB download for the Vicuna-7b model, which subsequently loads from browser cache for quicker access.

Future Developments

The roadmap for ChatLLM-Web includes ongoing enhancements:

The successful implementation of LLM using web workers for answer generation.
Established multi-conversation support.
Progressive Web App (PWA) related features are fully operational.
Upcoming settings for user interface preferences, device management, cache handling, and model configuration, including multiple model support.

Deployment to Vercel

Deploying ChatLLM-Web to Vercel is a streamlined process:

Begin by clicking the Vercel deployment button.
Follow the simple steps to complete deployment within a minute and start enjoying the platform.

Development Setup

Developing with ChatLLM-Web locally involves cloning the repository and running a few simple commands:

git clone https://github.com/Ryan-yang125/ChatLLM-Web.git
cd ChatLLM-Web
npm i
npm run dev

Conclusion

ChatLLM-Web harnesses cutting-edge technology to deliver a private, efficient AI communication tool directly in users' browsers, promising a future of seamless, server-free interaction with advanced language models. For additional information and constant updates, interested parties can explore the mlc.ai/web-llm page.