C Web Scraping Library

From Software Infocard Wiki
Jump to: navigation, search
Infocard

Target Platform: Windows
Version: 4.0.4.2
Release Date: February 28, 2017
License: Shareware
Price: USD 299
Publisher: The Iron Web Scraper Development Team
Product Web Site: [External Link]
Web-Scraping Framework C#/.Net/Mono
1001 Kb

Description by the Publisher

The web-scraper for C# allows .Net developers to create logical that extract content from web applications and turn it into JSON, spreadsheets, C# objects or even SQL using simple C# and Linq code.

Iron WebScraper is a web scraping library for the .Net 4.5 and Core platform which allows developers to use clean, simple logic to reverse any web resource back into C# objects or SQL. It can extract pages using set-by-step (if-this-then-that) workflows, effortlessly scraping and parsing html, javascript, xml, RSS, pdfs and office documents on the internet or local intranets back into useful structured data.

This leaves the developer with clean, efficient web-scraping applications which are easy to understand and debug.

The C# Web Scraping Library is extremely polite, ensuring that no domain or IP address has too many concurrent requests. It intelligently throttles both client and server side looking for excessive CPU usage and slowing to an appropriate pace. In addition, it can obey robots.txt directives including bot specific crawl rates and limitation. The exact urls and content types to be strapped can be set using logical workflows and regex/wildcard rules.

Screen-scraping is made easier with identity control, automatically managing threads, rate limits, urls, duplicates, retries, proxies, headers and cookies into a an army of virtual browser which can mimic human behavior and even client buttons, fill in forms or log in behind security walls. This is useful for migrating legacy systems, populating enterprise search facilities and for statistical competitive analysis

Full documentation, support and downloadable DLLS for the C# Web Scraper are available from https://ironsoftware.com/csharp/webscraper/ , in addition to links to a .Net 4.5+ Nuget package with full Azure and Mono compatibility.

Limitations in the Downloadable Version

Free C# developer license for testing and evaluation before deployment: https://ironsoftware.com/csharp/webscraper/licensing/iron-webscraper-eula-license.html

Product Identity

Unique Product ID: PID-F90074B71E87

Unique Publisher ID: BID-88001F048887

[C# Web Scraping Library PAD XML File]

Category