Scrapy
Scrapy is a free, open-source, high-level, Python library that allows for fast web scraping and web crawling. It can be used for a variety of tasks such as automated testing and other use cases that include extracting structured data from web pages.
Scrapy is popular for both small and enterprise level developer teams wishing to crawl and scrape web-pages for use cases ranging from indexing to data analytics.
The “batteries-included” included philosophy that Scrapy implements makes it a breeze to work with since the developers don’t have to worry for anything related to link-gathering, data exporting or persistence, hence they can focus on developing the logic of what data to gather and how to process them.
Scrapy was initialy developed Zyte 1and maintained by them and many other contributors who continue on developing this framework.
Authors
Master's student at TU Delft, following the Artificial Intelligence Technology Track