Scraping Manta
Posted: Tue May 22, 2012 7:38 pm
I put together a simple scrape to gather information from Manta.com, but I have found that the site is running exceptionally slow (I was getting about one entry every five minutes or so). I also noticed that when you load the site in the Helium Scraper Browser, the browser indicates that the page never stops loading (the "stop" button at the top of the browser never changes to the "refresh" button, like you should normally see after the page has completely finished loading.
I ran the SAVE URL feature for the pages I am interested in, and I would like to run a process to batch everything once I can figure out a way to speed each scrape in "Main" up.
I attached the document to help you visualize what I am doing. What advice do you have to speed up the scraping?
Thanks!
I ran the SAVE URL feature for the pages I am interested in, and I would like to run a process to batch everything once I can figure out a way to speed each scrape in "Main" up.
I attached the document to help you visualize what I am doing. What advice do you have to speed up the scraping?
Thanks!