ollama-voice-mac - Completely Offline Mac Voice Assistant Integrating Mistral 7b and Whisper

Ollama Voice Mac Project Introduction

Overview

The Ollama Voice Mac project offers an offline voice assistant designed to operate on Mac systems. It utilizes two powerful tools: the Mistral 7b model through Ollama and Whisper speech recognition models. Originally inspired by the foundational work from another project, this version enhances its compatibility with Mac computers, incorporating several upgrades to improve functionality and user experience.

Key Features

Offline Functionality: Ollama Voice Mac operates completely offline, ensuring privacy and independence from internet connectivity.
Compatibility: Specifically designed for Mac users, offering a seamless integration into the Apple ecosystem.
Customizable Language Support: Users have the option to configure the assistant to recognize and operate in various languages by modifying configuration settings.

Installation and Setup

Embarking on the journey to bring Ollama Voice Mac to life involves a straightforward installation process:

Install Ollama: Begin by downloading Ollama on your Mac, which serves as the foundational platform for the assistant.
Mistral 7b Model: Retrieve this AI model using the command ollama pull mistral.
Whisper Model: Obtain a compatible OpenAI Whisper Model, like the base.en version, which provides robust speech recognition capabilities.
Repo Cloning: Clone the project repository to your local machine.
Model Placement: Place the downloaded Whisper model within a directory labeled /whisper at the root of the cloned repository.
Essential Tools: Ensure the installation of Python and Pip, which are vital for running the assistant's software.
Apple Silicon Compatibility: For those using Apple silicon, install Homebrew and execute brew install portaudio to facilitate the PyAudio library.
Dependencies Installation: Run pip install -r requirements.txt to secure necessary dependencies.
Launching the Assistant: Use python assistant.py to start the voice assistant.

Enhancing Voice Quality

To achieve a more refined auditory experience, users can upgrade the default voice settings:

Navigate to System Settings, then Accessibility, followed by Spoken Content.
Opt for Manage Voices and locate "Zoe (Premium)" among the English voices. Download and select this option as the system voice for a richer output quality.

Multilingual Capabilities

Users looking to expand the assistant’s language ability can modify the assistant.yaml configuration file. By adjusting the modelPath and incorporating a Whisper model in the desired language, the assistant can effectively communicate in languages other than English.

This project empowers users with a robust, private, and fully offline voice assistant experience tailored for Mac users, with flexibility for language customization and improved voice outputs.