Maix-Speech - Compact AI Speech Library for Embedded Systems with ASR and TTS Support

Introduction to Maix-Speech

Overview

Maix-Speech is an artificial intelligence library designed for speech processing. It is a fast and compact library that is optimized to run on embedded devices as well as traditional PCs. The project incorporates various functionalities such as Automatic Speech Recognition (ASR), conversational capabilities (chat), and Text-to-Speech (TTS) synthesis.

Currently, Maix-Speech only supports the Chinese language. For more information and usage guidelines in Chinese, there is a dedicated 中文 guide available.

Building the Project

Cloning the Repository

To get started with the Maix-Speech project, you need to clone the repository from GitHub. This can be done using the following command:

git clone https://github.com/sipeed/Maix-Speech

Compilation Process

Maix-Speech can be compiled for different platforms such as x86x64 and R329. Here are the steps for each:

x86x64:

Navigate to the projects/asr directory:
```
cd projects/asr
```

Clean the configuration and set up the configuration menu:

python project.py clean_conf
python project.py menuconfig

Build the project:

python project.py build
# For more detailed output during the build process you can use:
# python project.py build --verbose

Run the compiled project:
```
./build/asr
```

If needed, clean up by running:

python project.py clean
python project.py distclean
# python project.py clean_conf

R329:

Navigate to the projects/asr directory:
```
cd projects/asr
```

Configure the toolchain:

python project.py --toolchain /opt/toolchain/bin --toolchain-prefix aarch64-openwrt-linux- config

Set up the configuration menu:
```
python project.py menuconfig
```
Build the project with the R329 toolchain:
```
python project.py build
```

Additional Resources

For more details on project structure and usage, the developers recommend visiting this GitHub project framework. This repository provides a robust framework for working with C and C++ projects, which is beneficial for expanding or customizing the Maix-Speech library.

Licensing

Maix-Speech is licensed under the Apache 2.0 License, which is a widely used open-source license that allows users to use, modify, and distribute the software with minimal restrictions.