news-please
News-please is an open-source tool designed for efficient extraction of news articles from websites, supporting recent and archived content. Utilizing advanced libraries such as Scrapy and Newspaper, it enables precise extraction and offers functionality as both a command-line tool and a Python library. The tool supports storage in JSON, PostgreSQL, and ElasticSearch, facilitating the management of extensive news datasets. Additionally, explore its related projects for sentiment analysis and event extraction to enhance your news analysis.