Manta
Posted: Thu Sep 29, 2011 11:10 pm
I'm attempting to scrape Manta, but can't seem to get past the way the data is given from Manta.
I specify the data I want at Manta and it returns a page of links. Each link contains the data I want and I can scrape it - but none of these pages has a next link. This is not a problem since I'm working off the original page of links. However, when I get to the last link, there is nowhere to go. If I select Kind:Next and specify at least one, Helium stops because it's not there. If I go back to the starting page and select kind Next, I simply collect the 2nd page of data over and over. (At least I think that's whats going on...) I tried to use the PageNumberProperties, but haven't figured out how to incorporate that - I got it imported, but have not found documentation on how to use it yet. If I manually select page 2, page 3 and so on and then run the actions (w/o the got starting page) it works - but that's a bit tedious. Help would be appreciated!
Thanks.
I specify the data I want at Manta and it returns a page of links. Each link contains the data I want and I can scrape it - but none of these pages has a next link. This is not a problem since I'm working off the original page of links. However, when I get to the last link, there is nowhere to go. If I select Kind:Next and specify at least one, Helium stops because it's not there. If I go back to the starting page and select kind Next, I simply collect the 2nd page of data over and over. (At least I think that's whats going on...) I tried to use the PageNumberProperties, but haven't figured out how to incorporate that - I got it imported, but have not found documentation on how to use it yet. If I manually select page 2, page 3 and so on and then run the actions (w/o the got starting page) it works - but that's a bit tedious. Help would be appreciated!
Thanks.