AI-Auto-Video-Generator Project Overview
AI-Auto-Video-Generator is an innovative tool that revolutionizes the way users create storytelling videos by utilizing advanced artificial intelligence technologies. This project seamlessly combines various AI-powered elements such as text generation, image creation, voiceover synthesis, and video editing into a singular, streamlined process.
Key Features
-
Story Generation with GPT-3: Users start by providing a story prompt. This could be a simple idea or a more detailed narrative. The project employs OpenAI's GPT-3, a powerful AI model, to develop these prompts into fully fleshed-out stories.
-
Image Creation with DALL-E: To accompany the story, AI-Auto-Video-Generator uses OpenAI's DALL-E to generate stunning visuals. This AI model is capable of creating detailed images from textual descriptions, enriching the storytelling experience.
-
Voiceover Through ElevenLabs API: The project ensures that each video has a professional-sounding voiceover. Using the ElevenLabs API, it converts the generated text story into a natural-sounding voice, further enhancing the video’s appeal.
-
Video Compilation: Finally, all elements—story, images, and voiceover—are combined into a cohesive video, creating an engaging storytelling medium ready for sharing with audiences.
Getting Started
Prerequisites
To run this project on a local machine, certain software and tools are needed:
- Python: Version 3.6 or higher is required.
- Pip: Python’s package manager is essential for installing required Python libraries.
- FFmpeg: A powerful tool that is necessary for video processing.
Setup Instructions
- Cloning the Repository: Start by cloning the project's GitHub repository to access the source code and resources.
- Virtual Environment: It's crucial to set up a virtual environment to manage dependencies and avoid version conflicts.
- Install Required Packages: Execute the installation script to load all essential Python packages.
- FFmpeg Installation: Ensure FFmpeg is installed correctly by using platform-specific package managers or downloading it from its official website.
- Download SpaCy Language Model: SpaCy is used for natural language processing, and the English language model needs to be downloaded.
Usage Instructions
- API Keys Configuration: The project requires access to OpenAI and ElevenLabs services. Users must insert their API keys into the configuration file named
.env
. - Font Customization: Users can personalize video captions by specifying font type and location.
- Running the Generator: By executing the script
main.py
, users can interact with the program to input a story prompt and generate their video. - Output: The completed video will be saved in the project directory.
Troubleshooting
To ensure smooth operation, a few common issues and solutions are highlighted:
- If dependencies are missing, rerun the package installation command.
- Ensure FFmpeg is installed and accessible via the system's command line interface.
- Visit the respective websites for OpenAI and ElevenLabs to manage API keys and account settings.
Customization and Advanced Usage
The project includes instructions for customizing output, such as generating haikus or specific factual content (like bee facts), altering voiceover settings, and changing visual elements of the video. Users can adjust model settings, text overlays, and even the duration of displayed images to tailor the video to their preferences.
Overall, AI-Auto-Video-Generator offers a versatile and innovative platform for creating AI-enhanced storytelling videos, making advanced AI technologies accessible and useful for creative endeavors.