New features in 2.3.1.2

Helium Scraper's news and announcements
Post Reply
webmaster
Site Admin
Posts: 491
Joined: Mon Dec 06, 2010 8:39 am
Contact:

New features in 2.3.1.2

Post by webmaster » Mon Feb 27, 2012 1:15 am

Among many bug fixes and code optimization, here are the main new features in version 2.3.1.2:
  • A project can be linked to a database file (.mdb) instead of using the default embedded database. This can be done by clicking on the Connect to Database File button in the database panel.
  • Helium Scraper can be run from the command line and consequently can run from batch files and from the Windows Task Scheduler. For more information see Command Line in the documentation.
  • A new action called Start Processes has been added. This action will allow you to run other instances of Helium Scraper passing a different starting URL to each of them. This allows you to have many instances running at the same time making the the extraction process significantly faster, as well as preventing excessive memory consumption when the site being scraped has a memory leak. More info can be found at Actions -> Actions List -> Start Processes in the documentation.
  • Other instances can also be run from an Execute JavaScript action by using the ProcessObject object. See Action -> Actions List -> Execute JavaScript -> Class List -> ProcessObject for more information on this.
  • You can create "Set Kinds" from the kinds panel. These kinds perform set operations (Union, Intersection, Except) between existing kinds (which can be also Set Kinds).
Feel free to post any questions you have!
Juan Soldi
The Helium Scraper Team

Post Reply