Question regarding cached files...
-
vandigroup
- Posts: 7
- Joined: Sun Mar 25, 2012 5:20 am
Question regarding cached files...
Quick question. How can I scrape a site but only download images that are missing? This is on a fresh project that I have not scraped yet but have most of the images already. Thx.
Re: Question regarding cached files...
Hi,
Helium Scraper, by default, will take images from the browser's cache (which is not the same as the downloads folder) if they are available. It will still copy them to the downloads folder though, so the result will look like it has downloaded them. If you downloaded this images some other way, then I don't think there is a way to tell Helium Scraper which image in your folder correspond to which image in the web. This would require having the original URL of the picture.
Helium Scraper, by default, will take images from the browser's cache (which is not the same as the downloads folder) if they are available. It will still copy them to the downloads folder though, so the result will look like it has downloaded them. If you downloaded this images some other way, then I don't think there is a way to tell Helium Scraper which image in your folder correspond to which image in the web. This would require having the original URL of the picture.
Juan Soldi
The Helium Scraper Team
The Helium Scraper Team