Sign in to see all reviews and comparisons. It's Free!
DataCleaner is a strong data profiling engine for discovering and analyzing the quality of user’s data. DataCleaner offers features such as support adding HDFS datastores, support for simple fixed-width mainframe/EBCDIC files.
Data Analysis Software
• Data Quality Analysis & Profiling • Duplicate Detection • Data Standardization & Cleansing • Data Health Monitoring • Data Quality Eco-System
Small (<50 employees), Medium (50 to 1000 Enterprise (>1001 employees)
• Data Quality Analysis & Profiling • Duplicate Detection • Data Standardization & Cleansing • Data Health Monitoring
What are the benefits?
• Analyze the quality of your data • Discover the quality of your data • Avoid operational issues and bad customer experiences • Ease of configuration • Improved inferential matching
Aggregated User Rating
Ease of use
Features & Functionality
Renew & Recommend
DataCleaner lets users find the patterns, missing values, character sets and other characteristics of the users’ data values.
Aggregated User Rating
You have rated this
DataCleaner is a strong data profiling engine for discovering and analyzing the quality of user’s data. DataCleaner offers features such as support adding HDFS datastores, support for simple fixed-width mainframe/EBCDIC files, extended the monitor REST job trigger, news channel into DataCleaner.
Monitor: Improve scheduling page load times, Monitor: Old repository backed up when new repository is loaded, Monitor: Reference data can now be configured in UI, IBM's ICU library used for better Unicode standard compatibility, Hashing component added, Fixed Pentaho plugin, Extended dictionary matcher (added possibility to ignore diacritic signs), Added scroll bar for value distribution (result) page and made connection lines lighter in desktop graphical job representation.
DataCleaner is built to handle data both big and small from CSV files, Excel spreadsheets to Relational Databases (RDBMs) and NoSQL databases. DataCleaner allows users to build their own cleansing rules and compose them into several use scenarios or target databases whether it is simple search/replace rules, regular expressions, and pattern matching or completely custom transformations.
DataCleaner’s Monitoring establishes the starting point and goals, and to ensure a process of following up on data quality issues. The monitoring server of DataCleaner enables users to not just make point-in-time profiles of users’ data, but to schedule periodic data quality checks and receive notifications if quality KPIs get out of control.
DataCleaner’s Data Quality Eco-System delivers not only out-of-the-box functionality, but also hosts an eco-system of community driven application extensions, integrations and shared content. Datacleaner’s duplicate detection feature, builds on Machine Learning principles for ease of configuration and improved inferential matching.
PAT RESEARCH is a B2B discovery platform which provides Best Practices, Buying Guides, Reviews, Ratings, Comparison, Research, Commentary, and Analysis for Enterprise Software and Services. We provide Best Practices, PAT Index™ enabled product reviews and user review comparisons to help IT decision makers such as CEO’s, CIO’s, Directors, and Executives to identify technologies, software, service and strategies.