Sign in to see all reviews and comparisons. It's Free!
By clicking Sign In with Social Media, you agree to let PAT RESEARCH store, use and/or disclose your Social Media profile and email address in accordance with the PAT RESEARCH Privacy Policy and agree to the Terms of Use.
DataCleaner is a strong data profiling engine for discovering and analyzing the quality of user’s data. DataCleaner offers features such as support adding HDFS datastores, support for simple fixed-width mainframe/EBCDIC files.
Category
Data Analysis Software
Features
• Data Quality Analysis & Profiling • Duplicate Detection • Data Standardization & Cleansing • Data Health Monitoring • Data Quality Eco-System
License
Proprietary
Pricing
Subscription
Free Trial
Available
Users Size
Small (<50 employees), Medium (50 to 1000 Enterprise (>1001 employees)
• Data Quality Analysis & Profiling • Duplicate Detection • Data Standardization & Cleansing • Data Health Monitoring
What are the benefits?
• Analyze the quality of your data • Discover the quality of your data • Avoid operational issues and bad customer experiences • Ease of configuration • Improved inferential matching
PAT Rating™
Editor Rating
Aggregated User Rating
Rate Here
Ease of use
7.6
6.4
Features & Functionality
7.6
8.2
Advanced Features
7.6
7.3
Integration
7.6
8.7
Performance
7.6
7.0
Customer Support
7.6
9.1
Implementation
4.1
Renew & Recommend
8.0
Bottom Line
DataCleaner lets users find the patterns, missing values, character sets and other characteristics of the users’ data values.
7.6
Editor Rating
7.4
Aggregated User Rating
7 ratings
You have rated this
DataCleaner is a strong data profiling engine for discovering and analyzing the quality of user’s data. DataCleaner offers features such as support adding HDFS datastores, support for simple fixed-width mainframe/EBCDIC files, extended the monitor REST job trigger, news channel into DataCleaner.
Monitor: Improve scheduling page load times, Monitor: Old repository backed up when new repository is loaded, Monitor: Reference data can now be configured in UI, IBM's ICU library used for better Unicode standard compatibility, Hashing component added, Fixed Pentaho plugin, Extended dictionary matcher (added possibility to ignore diacritic signs), Added scroll bar for value distribution (result) page and made connection lines lighter in desktop graphical job representation.
DataCleaner is built to handle data both big and small from CSV files, Excel spreadsheets to Relational Databases (RDBMs) and NoSQL databases. DataCleaner allows users to build their own cleansing rules and compose them into several use scenarios or target databases whether it is simple search/replace rules, regular expressions, and pattern matching or completely custom transformations.
DataCleaner’s Monitoring establishes the starting point and goals, and to ensure a process of following up on data quality issues. The monitoring server of DataCleaner enables users to not just make point-in-time profiles of users’ data, but to schedule periodic data quality checks and receive notifications if quality KPIs get out of control.
DataCleaner’s Data Quality Eco-System delivers not only out-of-the-box functionality, but also hosts an eco-system of community driven application extensions, integrations and shared content. Datacleaner’s duplicate detection feature, builds on Machine Learning principles for ease of configuration and improved inferential matching.
By clicking Sign In with Social Media, you agree to let PAT RESEARCH store, use and/or disclose your Social Media profile and email address in accordance with the PAT RESEARCH Privacy Policy and agree to the Terms of Use.