DEiXTo (or ΔEiXTo) is a powerful web data extraction tool that is based on the W3C Document Object Model (DOM). It allows users to create highly accurate “extraction rules” (wrappers) that describe what pieces of data to scrape from a website.
Web Scraping Tools
Monitors prices of competition
Build alerting web services
Transforms contents of digital library into suitable formats
Graphic friendly interface
Effective extraction of data
Free most of the time unless the data extraction is more complex. Contact for pricing
Small (<50 employees), Medium (50 to 1000 Enterprise (>1001 employees)
DEiXTo is a powerful web data extraction tool that is based on the W3C Document Object Model (DOM). It allows users to create highly accurate extraction rules that describe what pieces of data to scrape from a website. DEiXTo consists of three separate components to help users.
GUI DEiXTo is an MS Windows application implementing a friendly graphical user interface that is used to manage extraction rules (build, test, fine-tune, save and modify).
This is all that a user needs for small scale extraction tasks. DEiXToBot is a Perl module implementing a flexible and efficient Mechanize agent capable of extracting data of interest using GUI DEiXTo generate patterns. It contains the best of breed Perl technology and allows extensive customization which facilitates for tailor-made solutions.
Lastly, the DEiXTo CLE (Command Line Executor) is a stand-alone, DEiXToBot-based, cross-platform utility that can massively apply an extraction rule on multiple target pages and produce structured output in a variety of formats. DEiXTo can contend with a wide range of websites with high precision and recall. It provides the user with an arsenal of features aiming at the construction of well-engineered extraction rules.
Wrappers built with GUI DEiXTo can be scheduled to run automatically providing automated access to resources of interest and saves users a lot of time, energy and repetitive effort. This means that the user does not have to contribute a lot of their resources to sorting through data but can instead schedule the search so that the data is always up to date and current. Using DEiXTo will revolutionise any business as it effectively simplifies a wide variety of data.