Introduction to Google-Images-Download Project
The Google-Images-Download project, revamped and maintained by Ultralytics, offers a robust image scraping tool focusing on Bing rather than Google. This useful tool is perfect for those who need to gather images for applications such as machine learning, data analysis, or personal projects. Originally derived from the work at hardikvasa/google-images-download, it is continually updated to ensure efficiency and usability.
Docker Execution
For those interested in running the tool through Docker, simple instructions are available. By pulling the Docker image and running a basic command, users can quickly set up their image searcher, as detailed in the Docker GitHub repository.
$ docker run -d -p 80:80 --name image_searcher saitamatechno/google_images_download:v1.0
Requirements
To get started with the project, ensure that Python 3.8 or a later version is installed. Acquiring necessary dependencies is straightforward; these can be installed using the provided requirements.txt
file, which lists essential packages, including Selenium.
$ pip install -r requirements.txt
Installation Steps
Setting up the image scraper involves a few easy steps:
- Clone the repository to your local machine.
- Install the required dependencies.
$ git clone https://github.com/ultralytics/google-images-download
$ cd google-images-download
$ pip install -r requirements.txt
How to Run the Scraper
To run the image scraper, make sure you have Google Chrome installed. You'll also need the appropriate version of Chromedriver that corresponds to your Chrome version.
Execute the Python script using either a URL to fetch images or search terms for broader image collection. Specify parameters such as limit and download path, and if necessary, detailed logs will guide you through the process, showing both progress and errors.
Example with a URL:
$ python3 bing_scraper.py --url 'https://www.bing.com/images/search?q=flowers' --limit 10 --download --chromedriver /path/to/your/chromedriver
Example with search terms:
$ python3 bing_scraper.py --search 'honeybees on flowers' --limit 10 --download --chromedriver /path/to/your/chromedriver
Images are saved in the ./images
directory by default.
Citing the Project
When using this tool in academic work or publications, users are encouraged to credit the original project, which is available on its GitHub repository.
Contributing
Community contributions are welcomed and invaluable for this open-source project. Whether it's reporting issues, suggesting features, or contributing code, interested parties can consult the Contributing Guide to get involved. Feedback is also appreciated via the Ultralytics Survey.
Licensing
The project offers two types of licenses:
- AGPL-3.0 License: Best suited for non-commercial use, fostering open collaboration.
- Enterprise License: Provides commercial users more flexibility away from the restrictions of AGPL-3.0. Contact Ultralytics for more information on enterprise solutions.
Contact and Support
For issues, questions, or to suggest features, users can use the GitHub Issues page. For broader discussions and community interaction, Ultralytics offers a supportive Discord community.