Tag: web scraping

July 26, 2019

Introducing Common Crawler

By Juan Soldi Miscellaneous 0 Comments

Common Crawler is a free version of Helium Scraper that, instead of loading pages from the web, it loads them from the Common Crawl database. Aimed at both developers and non-developers, it makes it easy to query the common crawl data and then create selectors and actions that extract structured data from the target HTML

December 11, 2018

The Web Scraping Dilemma

By Juan Soldi Miscellaneous 0 Comments

The web scraping community seems to be divided into two sub-worlds. One is the world of programmers, who would often use Python or JavaScript to carefully craft their agents down to the details in a time consuming but ultimately rewarding process. And the other is the world of layman users, who must choose between a