Click Next

Questions and answers about anything related to Helium Scraper
Post Reply
Christos
Posts: 9
Joined: Tue Apr 24, 2012 10:07 am

Click Next

Post by Christos » Tue Apr 24, 2012 10:37 am

how to click next when there is no next button
1. when pages are [1-10][11-20][21-30][31-40]
2. page has tabbed browsing like as in http://stuffgate.com/wordpress.com under title WHOIS Information 3 tabs can be seen

webmaster
Site Admin
Posts: 521
Joined: Mon Dec 06, 2010 8:39 am
Contact:

Re: Click Next

Post by webmaster » Wed Apr 25, 2012 3:16 am

Hi,

I couldn't find these links in the URL you provided. Anyway, you could try importing the Turn pages without a "Next" button premade at File -> Online Premades and then create a kind that selects the link that takes you to the next page, no matter which page you are at (there is some more info in the project's description) and using it as your "Next" button (think of it as a "Next" button except that its text changes on each page).

Regarding the tabs, try creating a kind that selects the links on those tabs (you might need to select a tab and then press the Select Parent button in the selection panel until the OuterHTML starts with "<A"), and then using a Navigate Each with Simulate Click on that navigates through them.
Juan Soldi
The Helium Scraper Team

sjunk111
Posts: 4
Joined: Wed Nov 21, 2012 6:51 am

Re: Click Next

Post by sjunk111 » Wed Nov 21, 2012 7:00 am

How do I scrap pages that are set like:

1 2 3 4 5 6 7 8 9 10 ...

Where the "..." is link to another set of ten represented as ... 11 12 13 14 15 16 17 18 19 20 ...

Where the "..." before is the last set of ten and the end "..." is the next set of 10?

The set I have has over 10,000 records with max 10 records per page, so you can imagine, there are a lot of pages.

http://kepler.sos.ca.gov/cbs.aspx

Do a search under Corporations - you can use the term "Avenue" as a test. It should return 3,309 records...

I tried to import the Go Through All Pages, but it stops after getting to the 11the page - more specifically 1 2 3 4 5 6 7 8 9 10 ... - the last link represented by the "..."

Thanks.

sjunk111
Posts: 4
Joined: Wed Nov 21, 2012 6:51 am

Re: Click Next

Post by sjunk111 » Thu Nov 22, 2012 5:39 am

Never mind - figured this out. Had to create another kind that just looks at the "..." (as the next button link). Execute Go Through All Pages using that kind definition, then Navigate through each of the numbered pages.

Post Reply