scrape data tiered

Questions and answers about anything related to Helium Scraper
Post Reply
crookedleaf
Posts: 38
Joined: Tue Dec 11, 2012 6:44 pm

scrape data tiered

Post by crookedleaf » Fri Jan 18, 2013 1:27 am

i'm running into a bit of a problem programming a scrape. the scrape extracts data about reservations people have made with us. i think the problem is when people have more than one room reserved. when there is only one room, the data extracts perfectly. a single room reservation is as seen here:
Capturesingle.JPG
Capturesingle.JPG (106.91 KiB) Viewed 6803 times
here is an example of a reservation that has multiple rooms booked:
Capturemulti.JPG
Capturemulti.JPG (146.88 KiB) Viewed 6803 times
here is what it looks like when the data is extracted though on a reservation with multiple rooms:
output.jpg
output.jpg (54.82 KiB) Viewed 6803 times
as you can see, the data ends up being tiered. is there any way to get it to show up in one row based on occurrence, or some other way?

m1231e
Posts: 5
Joined: Wed Jun 06, 2012 6:50 am

Re: scrape data tiered

Post by m1231e » Sat Jan 19, 2013 12:33 pm

I have run into same problem. In my case it is something about how I have defined the kind. Scrape worked fine until I reset the kind 'address' and 'country'.
When I ran the scrape originally it wasn't pulling data in any fields past these fields in some records. After I noticed that, I went to those records and added the values of those fields to the kinds. Then when I ran the scrape, the data table was tired.

When I run the scrape without these kinds, table is filled correctly.

May not be the same problem, but I;d love to know what I've done wrong and hat I need to change. Thanks!

m1231e
Posts: 5
Joined: Wed Jun 06, 2012 6:50 am

Re: scrape data tiered

Post by m1231e » Sat Jan 19, 2013 12:36 pm

I can't attach the scrape, exceeds file size. ???

webmaster
Site Admin
Posts: 499
Joined: Mon Dec 06, 2010 8:39 am
Contact:

Re: scrape data tiered

Post by webmaster » Wed Jan 23, 2013 3:26 am

Use the Force elements into same row premade at File -> Online Premades. As the Heading kind you would use the Room 1, Room 2, etc. items (this will make sense after you read the project's description). Remember that you'll need to recreate all the kinds you use inside these pages (the ones the give you broken data), except the Heading kind, after running the Do Wrap actions tree (again, this will make sense once you read the project's description) and also perform the extraction after running this tree. What the Wrap tree does (and hence the Do Wrap tree does) is modify the HTML so that all elements under each Heading kind are under the same HTML element so that Helium Scraper knows they belong in the same row.

@m1231e: I believe this should apply to your case too. If you need to send a file to big try using this service.
Juan Soldi
The Helium Scraper Team

crookedleaf
Posts: 38
Joined: Tue Dec 11, 2012 6:44 pm

Re: scrape data tiered

Post by crookedleaf » Wed Jan 23, 2013 3:48 pm

Thank you so much for the response. When running the "force elements into same row" action tree and setting the Kind to the header (Room 1, Rom 2, etc.), I get the following error message: "Message from webpage: Unknown Runtime Error."
Last edited by crookedleaf on Wed Jan 23, 2013 4:28 pm, edited 1 time in total.

eden
Posts: 2
Joined: Mon Oct 22, 2012 12:05 am

Re: scrape data tiered

Post by eden » Wed Jan 23, 2013 4:04 pm

I get the message 'Invalid target element"

http://www.sendspace.com/file/m0hidx

webmaster
Site Admin
Posts: 499
Joined: Mon Dec 06, 2010 8:39 am
Contact:

Re: scrape data tiered

Post by webmaster » Thu Jan 24, 2013 7:49 am

I've updated the premade and shouldn't cause those errors now. Just delete the "Wrap" actions tree and import it again.
Juan Soldi
The Helium Scraper Team

crookedleaf
Posts: 38
Joined: Tue Dec 11, 2012 6:44 pm

Re: scrape data tiered

Post by crookedleaf » Thu Jan 24, 2013 6:26 pm

thank you so much for the update. after running the action, it causes almost the whole page to go blank. here is a pastebin of the HTML of the page as normal:

http://pastebin.com/pKJgdCPP

crookedleaf
Posts: 38
Joined: Tue Dec 11, 2012 6:44 pm

Re: scrape data tiered

Post by crookedleaf » Thu Jan 24, 2013 7:36 pm

actually it seems be working now! thank you so so much!

Post Reply