embedditor - Open Source Editor for Efficient Embedding and Vector Search Management

Introducing Embedditor: Revolutionizing Vector Search

Overview of Embedditor

Embedditor positions itself as the open-source equivalent of MS Word, tailored for embedding tasks. It is a pre-reprocessing editor that enables users to manipulate GPT / LLM embeddings as though they are working with text documents. Consequently, users can optimize their vector searches while significantly minimizing costs associated with embedding and vector storage.

Community and Support

Embedditor encourages users to join its thriving community for support and interaction. The platform can be accessed via various channels, including its website, Discord, Twitter, and extensive documentation.

Key Features

Rich Editor Interface

Effortlessly join and split text chunks with a few clicks.
Manipulate embedding metadata and tokens effectively.
Exclude unnecessary words or sentences from embedding.
Choose specific parts of a chunk for embedding.
Augment embeddings with additional information such as URLs or images.
Generate aesthetically pleasing HTML-markup for AI search outcomes.
Save pre-processed embedding files in convenient formats like .veml or .json.

Pre-processing Automation

Automate noise filtering by excluding punctuations and stop-words.
Utilize TF-IDF to remove insignificant, frequently used words from embeddings.
Normalize embedding tokens to enhance vectorization.

Benefits of Using Embedditor

With a user-friendly spreadsheet-like interface, Embedditor enhances the user's experience by:

Ensuring content relevancy retrieved from vector databases.
Boosting efficiency and accuracy in AI or LLM-related applications.
Offering visually engaging search results with images and links.
Delivering cost efficiency, potentially reducing embedding and vector storage expenses by up to 30%.
Providing full control over data with the ability to deploy Embedditor locally.
Facilitating the use of pre-processed or ready embeddings in various vector databases like LangChain and Chromat.

Quick Start

Prospective users can try Embedditor by signing up for free on IngestAI. The platform also provides an accessible dashboard at http://localhost:8080/.

Installation Guide

Duplicate the .env.example file and rename it to .env.
Enter the required OPENAI_API_KEY in the .env file.
Set up the project using the following commands:
- php artisan migrate
- php artisan db:seed
- php artisan storage:link

Embedditor promises to be a revolutionary tool in the realm of embedding and vector search, providing users with powerful features to manage and optimize their data while reducing associated costs.