Please Help

Questions and answers about anything related to Helium Scraper
Post Reply
Katedsimmons
Posts: 11
Joined: Fri Jun 17, 2011 4:44 pm

Please Help

Post by Katedsimmons » Fri Jun 17, 2011 4:52 pm

I am trying to scrape data from website to a simple list. But when I run the scrape it is copying every thing see below. I have checked each Kind and it only selects the appropriate category, name, address, ext. Then when I run it I end up with what is below. I have tried to change the extractions and It does not matter I end up with the same thing. I know it is something I am doing. Any help would be greatly appreciated.

Thank you!!

Bolnicka 25 Sarajevo Bolnicka 25 Bolnicka 25 Bolnicka 25
Sarajevo Sarajevo Sarajevo
Alija AGINCIC Alija AGINCIC Alija AGINCIC Alija AGINCIC
University of Sarajevo University of Sarajevo University of Sarajevo
71000 71000 71000
BOSNIA-HERZEGOVINA BOSNIA-HERZEGOVINA BOSNIA-HERZEGOVINA
387-33-213-403 387-33-213-403 387-33-213-403
387-33-213-403 387-33-213-403 387-33-213-403
alijaagincic@yahoo.com alijaagincic@yahoo.com alijaagincic@yahoo.com alijaagincic@yahoo.com
"Hutt Valley DHB
Lower Hutt Hospital High Street
Private Bag 31-907 " Wellington "Hutt Valley DHB
Lower Hutt Hospital High Street
Private Bag 31-907 " "Hutt Valley DHB
Lower Hutt Hospital High Street
Private Bag 31-907 " "Hutt Valley DHB
Lower Hutt Hospital High Street
Private Bag 31-907 "
Wellington Wellington Wellington
Hakan AGIR Hakan AGIR Hakan AGIR Hakan AGIR
Wellington Regional Plastic, Maxillofacial & Burns Unit Wellington Regional Plastic, Maxillofacial & Burns Unit Wellington Regional Plastic, Maxillofacial & Burns Unit
5040 5040 5040
NEW ZEALAND NEW ZEALAND NEW ZEALAND
90-262-303-8677 90-262-303-8677 90-262-303-8677
90-262-303-8003 90-262-303-8003 90-262-303-8003
agirhakan@yahoo.com agirhakan@yahoo.com agirhakan@yahoo.com agirhakan@yahoo.com

webmaster
Site Admin
Posts: 495
Joined: Mon Dec 06, 2010 8:39 am
Contact:

Re: Please Help

Post by webmaster » Fri Jun 17, 2011 7:22 pm

Hi,

Did you just copied and pasted that from your extraction results table? If so, there must be something wrong with the kinds. It seems to be extracting phones, emails, addresses, etc, all to the same column, and repeating this data among multiple columns. Make sure you are extracting the "InnerText" from every kind.

To see what's going on in detail, create an actions tree that only contains an "Extract" action and try extracting from a single page. Use only a couple of kinds first. Also, to see exactly what your kinds will extract, select a kind in the browser, and look at the selection panel at the bottom. If you are extracting a property that is not listed there, you can make this property visible by clicking on the "Choose visible properties" button.

If you wish, send me your project or part of it and I'll take a look at it.
Juan Soldi
The Helium Scraper Team

Katedsimmons
Posts: 11
Joined: Fri Jun 17, 2011 4:44 pm

Re: Please Help

Post by Katedsimmons » Sat Jun 18, 2011 12:25 am

I did just copy and past from the table. I have rechecked and yes on everything but name and email it comes up all of the items on the page. I am going to try it again, and If I still cant figure it out I will send you the project. Thank you so much for offering to help!!!!!!

Katedsimmons
Posts: 11
Joined: Fri Jun 17, 2011 4:44 pm

Re: Please Help

Post by Katedsimmons » Sat Jun 18, 2011 3:27 am

Juan,
I was going to send you a sample on PM but I am new so not allowed to do that. If you would pm me or email me I can send you a sample, if you don't mind.
Thanks!
Katie

Post Reply