Web Content Extractor Documentation

About

Web Content Extractor is a powerful and easy-to-use web scraping software. It allows you to extract text and images from any website. It is highly accurate and efficient software for web data extraction.

Web Content Extractor offers you a friendly, wizard-driven interface that will walk you through the process of creating an extraction pattern and crawling rules in a simple point-and-click manner. Not a single string of code is required! Web data extraction is completely automatic.

Features

  • Templated web data extraction. Easy to use configuration wizard;
  • Customized web crawler/web spider. Crawling rules and multithreaded downloading (up to 10 threads);
  • Exports the extracted data into Microsoft Excel, Text (TXT), HTML, XML files, SQL and MySQL script files, Microsoft Access database, and to any ODBC data source;
  • Uploads the output file to a FTP server;
  • Extracts data from password protected websites;
  • Supports a number of command line options that can be used to automate program;
  • Built-in scheduler. Runs the scraping tasks at a specific time automatically;
  • Uses multiple proxy servers. Automatically switches between proxies and rotates your ip address;
  • Very simple to use, quick learning curve and right to the point.