Anyone who tries to scrape some data from the web sooner or later faces the need to use proxies to bypass certain restrictions and make the process more streamlined and smooth. Moreover, if you’re dealing with large volumes of information, you absolutely must use a proxy. Otherwise, you simply won’t be able to reach your
There are many operators in Helium Scraper, but the plus (+) operator deserves its own tutorial, given the number of uses it has. This is because it doesn’t just represent addition, but also concatenations of strings and sequences. Simple Cases Helium Scraper will treat the operator differently, depending on the type of data that is
Common Crawler is a free version of Helium Scraper that, instead of loading pages from the web, it loads them from the Common Crawl database. Aimed at both developers and non-developers, it makes it easy to query the common crawl data and then create selectors and actions that extract structured data from the target HTML
The web scraping community seems to be divided into two sub-worlds. One is the world of programmers, who would often use Python or JavaScript to carefully craft their agents down to the details in a time consuming but ultimately rewarding process. And the other is the world of layman users, who must choose between a
Note: This post was written for an older version of Helium Scraper. Gatherers are the eyes of Helium Scraper. And JavaScript gatherers are its user-customized eyes. Let me give you a quick example. I had a user having troubles with a kind that was supposed to select a “next” button in a page. It worked
Note: This post was written for an older version of Helium Scraper. I’m assuming you already have a little JavaScript knowledge. If not, here is a quick JavaScript tutorial that covers all you need to know for the purpose of this tutorial. I’m also assuming you have some experience working with Helium Scraper. In Helium
This is a quick javascript tutorial for total non programmers. I won’t focus on javascript as applied to webpages, which is the case for most tutorials, because I’m mainly considering Helium Scraper users. So this tutorial comes handy if you want to learn javascript without necessarily caring about how to design web pages. First off,