Web Content Extractor Documentation

New Project Wizard

To create a new project, click the New Project button on the application toolbar or use the File --> New Project menu item. This will launch the New Project Wizard.

In the first step of the wizard, you should type starting URLs. The starting URLs are the URLs of the pages the program will start crawling and extracting data from. If necessary, you can specify the search or login form data.

In the second step of the wizard, you should specify crawling rules. The crawling rules are rules that determine links the program should follow.

In the third step of the wizard, you should specify extraction pattern. The extraction pattern is a pattern determining the position of certain data on the page and their extraction parameters.

In the last step of the wizard, you can change the project name file, image folder and specify, if necessary, image naming convention.