Basic Question

Questions and answers about anything related to Helium Scraper
Post Reply
Posts: 4
Joined: Sat Aug 13, 2011 10:00 am

Basic Question

Post by tipud » Fri Oct 21, 2011 3:11 pm

Hi I am a new baby here.

I am at the beginning of the learning curve here so please be patient with me.

I am trying to scrape a schools list site.
1. All the schools are listed in their County / State.
2. When a County has been selected i get a list of all in that County.
3. The data i want to extract is on the detail page.

I know how to extract the data.
I need to know how to get HS go from the search list to the detail page extract the data and go to the next one on the search list.
If someone can guide me to a tutorial or answer to this i would be grateful.

Site Admin
Posts: 501
Joined: Mon Dec 06, 2010 8:39 am

Re: Basic Question

Post by webmaster » Sat Oct 22, 2011 12:11 am


If your list is a list of links, you can just use a Navigate Each action, put and Extract inside this Navigate Each action, and then put underneath your Navigate Each action a Navigate action that navigates to the next page. Then you would set your Repeat action to repeat this whole process as many times as pages you want to go through. This is how it would look:
Sample.jpg (8.67 KiB) Viewed 2440 times
This is precisely what the video in our homepage shows how to do.

You could also use the Go Through All Pages premade which you can access from the Execute actions tree -> More item in the New action button. In this case, this is how your actions would look like:
Sample2.jpg (10.43 KiB) Viewed 2440 times
The Go Through All Pages would be set to navigate through your "Links" kind. Notice that the Repeat action is set to repeat only one time on this last example.
Juan Soldi
The Helium Scraper Team

Posts: 4
Joined: Sat Aug 13, 2011 10:00 am

Re: Basic Question

Post by tipud » Sat Oct 22, 2011 5:24 pm

Hi Juan

Thank you for that its worked

I tried the video but i didnt get it.

Now i think i got the hang of it.

Thanks again

Post Reply