autotab-starter - AI-Driven Automation Tool for Browser Tasks

Discover autotab-starter: A Creative Approach to Browser Automation

The autotab-starter project offers a novel way to simplify browser automations by utilizing artificial intelligence. This amazing tool allows users to seamlessly transition from simple point-and-click actions within a browser to fully functional and executable code for those actions in mere seconds. The development team is continuously enhancing autotab, bringing fresh features to improve user experience. Although still in its alpha phase, autotab is growing steadily, promising interesting new updates.

Getting Started with autotab

Setting up autotab is quick and easy, typically taking only 5-10 minutes. Users need to have the Chrome browser installed and it is advised to set up a Python virtual environment for optimal use. Here's a quick overview to get you started:

Clone the autotab-starter repository from GitHub and navigate to the directory.

Install the necessary dependencies using:

make install
brew install --cask chromedriver

Configuration

Configuration is straightforward. Users need to create a .autotab.yaml file to manage credentials, following the guide provided in the .example.autotab.yaml file. This step usually takes about three minutes.

Running the Tool

To begin recording browser automations, users can simply run autotab record. This command opens a new browser window, enabling users to log in using an autotab_api_key, which can be obtained for free from the autotab dashboard. This key is essential for unlocking the full potential of the autotab extension.

Using autotab

Recording Automations

Initiating a new automation involves running the autotab record command. For users who prefer specificity, an optional --agent <agent_name> argument can be provided to open a controlled Chrome session. If the extension's sidepanel does not open automatically, it can be accessed using the shortcut Command - Shift - Y.

Users can toggle between record mode and select mode using Command - E and Command I respectively. Record mode focuses on capturing clicks and typing, while select mode allows users to select elements for interaction, like hovering or copying text.

Once recording is complete, ensure all code is copied and saved. The tool will generate a Python file in the agents/ directory with boilerplate code for the automation.

Playing Automations

To execute a recorded automation, users can use the command autotab play --agent <agent_name>. If the agent name is omitted, it defaults to running agents/agent.py. Playing the automation simply runs a Python script, which can be debugged similarly to any other Python execution.

Setting Up Environment

Installation

The installation process detailed in the "Quickstart" section above also includes dependencies for executing the autotab record and autotab play commands effectively.

Managing Secrets

A critical initial step is creating a .autotab.yaml file, populated according to the .example.autotab.yaml. This setup includes handling cookies for logging into Google, with careful management to ensure security.

Important Points to Note

This repository is offered "as-is," and potential users should thoroughly review all code. If using autotab for scraping, it’s vital to adhere to the target website’s Terms of Service and guidelines. The tool should not be used for unauthorized or illegal activities. Additionally, while using autotab record, user data such as application state, DOM, and model responses are logged, although users can opt-out of this via settings.

Join the autotab community on their Discord channel to stay updated with the latest developments and connect with fellow autotab enthusiasts. With the exciting evolution of autotab, there is much to anticipate in this dynamic area of browser automation!