cortex - Local AI Platform for Efficient Language Model Customization

Discover Cortex.cpp: A Local AI API Platform

Cortex.cpp is an innovative Local AI API Platform designed to help users efficiently run and customize large language models (LLMs). With its active development status, Cortex.cpp seeks to set new standards for AI deployment across various applications and devices. Here's a detailed look at what Cortex.cpp offers:

Key Features

User-Friendly Command-Line Interface (CLI): Inspired by Ollama, Cortex.cpp provides a straightforward CLI to facilitate easy operations for both beginners and experts alike.
Comprehensive C++ Implementation: The platform is built entirely in C++, ensuring it can be seamlessly packaged into desktop and mobile apps.
Flexible Model Access: Cortex enables access to models from Hugging Face or its built-in models, expanding its usability.
Universal Model Storage: Instead of storing models as blobs, Cortex uses universal file formats for better integration and management.
Swappable Engines: Users can choose between different engines such as llamacpp (the default), ONNXRuntime, and TensorRT-LLM to suit their specific needs.
Deployment Options: Cortex can function as a standalone API server or be embedded directly into applications like Jan.ai.

Installation Models

Cortex offers versatile installation options catering to different user needs:

Local Installer: This option packages all necessary dependencies, allowing you to install without an internet connection.
Network Installer: Alternatively, this smaller installer downloads required dependencies during installation, ensuring everything is up-to-date.

Installation is supported across Windows, MacOS, and Linux platforms with specific executables provided for each.

Usability and Accessibility

Command-Line Interface (CLI)

Once installed, Cortex.cpp can be run easily from the command line. Users can manage various tasks like pulling models or running operations with simple commands. Detailed quickstart and CLI documentation are available for guidance.

Example commands include:

cortex pull llama3.2                                    
cortex run llama3.2                                  
cortex models stop llama3.2

REST API

Cortex.cpp also features a REST API accessible locally. This allows for integration with existing systems through standard web protocols. Comprehensive API documentation ensures developers can easily leverage this feature.

Model Management

Cortex.cpp stands out with its flexibility in pulling models from multiple hubs like Hugging Face and its built-in options. Users are advised on the memory requirements depending on model size, ensuring smooth operation.

Additionally, Cortex.cpp supports various quantizations, allowing users to select models optimized for their specific use case with options ranging from lower to higher quantizations, accommodating different resource capabilities.

Advanced Installation and Updates

Cortex.cpp remains accessible to those wishing to explore cutting-edge features through:

Beta & Nightly Builds: These preview versions let advanced users test the latest developments and provide valuable feedback.
Building from Source: Instructions cover the compilation process on Windows, MacOS, and Linux, providing flexibility for customization and advanced configurations.

Uninstallation Process

Removing Cortex.cpp is straightforward, with procedures varying slightly by operating system to ensure complete removal from the user's system.

Support and Community

Cortex.cpp encourages an active community engagement through multiple support channels:

Report issues or feature requests via GitHub.
Join discussions and get answers on Discord.
Reach out through email for detailed inquiries.

In summary, Cortex.cpp is a robust, adaptable, and user-friendly platform for running AI models locally, with strong community and resource support. Its comprehensive features cater to diverse deployment needs, making it a powerful tool for AI practitioners and developers.