Hi all,
I need to get what's behind <TITLE> tag of my targets webpages. How do I do ?
Thanx !
getting <TITLE>
Re: getting <TITLE>
Hi,
The attached project is an example of how to do this. If you press play in the "Actions tree 1", it will extract TITLE and URL from whatever page you are at. Take a look at the JS_TITLE JavaScript gatherer at Project -> JavaScript Gatherers.
The BODY kind is a kind that simply selects the BODY of the page. I use this kind because every HTML page will have a BODY. Also, see how the "Extract to table: 'Results'" action is defined by double clicking it.
Let me know if you have any question.
The attached project is an example of how to do this. If you press play in the "Actions tree 1", it will extract TITLE and URL from whatever page you are at. Take a look at the JS_TITLE JavaScript gatherer at Project -> JavaScript Gatherers.
The BODY kind is a kind that simply selects the BODY of the page. I use this kind because every HTML page will have a BODY. Also, see how the "Extract to table: 'Results'" action is defined by double clicking it.
Let me know if you have any question.
- Attachments
-
- ExtractTitle.hsp
- (297.3 KiB) Downloaded 622 times
Juan Soldi
The Helium Scraper Team
The Helium Scraper Team
Re: getting <TITLE>
so simple ... thank you for the quick and efficient reply !
And by the way, it could be interesting to have the total time the project ran, in the message 'project completed'.
Thanx again !
And by the way, it could be interesting to have the total time the project ran, in the message 'project completed'.
Thanx again !