Index-1.9B - Efficient Language Model Supporting 32K Contexts and Multilingual Interactions

Introduction to the Index-1.9B Project

Overview

The Index-1.9B project is part of the Index series and stands out as a lightweight version among its peers. This series includes a variety of models, each tailored for specific tasks and optimized to perform at a competitive level. Below are some of the models in the Index-1.9B suite:

Index-1.9B Base: This foundational model carries 1.9 billion non-embedding parameters and has been pre-trained on a vast corpus of 2.8 trillion words primarily in Chinese and English. It excels in various evaluation tests compared to other models of its size.
Index-1.9B Pure: Designed as a control version of the base model, this variant filters out instruction-related data entirely from its training corpus. This allows for evaluating the influence of instructional data on performance metrics.
Index-1.9B Chat: Aimed at conversational tasks, this model is fine-tuned to align with supervised feedback tasks (SFT) and direct policy optimization (DPO). Thanks to its comprehensive pre-training on internet community content, it offers engaging chat abilities and strong translation skills across multiple languages, particularly East Asian ones.
Index-1.9B Character: This model supports few-shot role-playing customization, adding RAG (retrieval-augmented generation) capabilities to help users simulate different roles in various contexts.
Index-1.9B-32K: Unique for its ability to handle long contexts, this model supports a 32K context length, meaning it can process over 35,000 words at once, despite having only 1.9 billion parameters.

Recent Updates

Notable recent updates for the Index-1.9B project include:

The release of the open-source 32K long-context model Index-1.9B-32K, accompanied by a detailed technical report.
Adaptation to llama.cpp and Ollama, enhancing compatibility with these systems.
Open-source availability of a checkpoint prior to decay for research purposes, allowing for exploration of constant learning rate impacts.

Evaluation Performance

The models in the Index-1.9B series have been rigorously tested and compared against other models across several benchmarks. Evaluation metrics include average scores in English, comprehension abilities (MMLU, CEVAL, CMMLU), and other standardized tests like HellaSwag and ArcC/E. Impressively, the Index-1.9B variants often outperform larger models in specific areas, demonstrating their effectiveness despite their smaller size.

Model Accessibility and Usage

These models are available on platforms like Hugging Face and ModelScope, where they can be downloaded for various uses, including chat interactions, role-playing, and long-context processing. For those wishing to explore its features, the project provides environment setup instructions, examples of how to interact with the model using different tools, and even options for fine-tuning to create customizations.

A Note on Limitations

The developers of Index-1.9B caution users about potential inaccuracies or biases in generated content, emphasizing the importance of independent verification and responsible usage. The open-source license supports academic and free commercial use, provided users comply with outlined licensing agreements.

Model Significance

The Index-1.9B project highlights the ability to achieve great performance with compact models, offering versatility across languages and conditions. Its technical achievements indicate significant advancements in processing capabilities for long documents and engaging dialogue systems, reinforcing its potential for broader applications and innovation in language models.