Data Preparation Platform

Data Preparation Platform is used to manipulate data until it reaches an existential state that is befitting for further processing and analysis. A lot of efforts go into the initial stages of data manipulation, most of which are labour-intensive and difficult to fully automate, however with data preparation platforms, businesses can simplify these complex and manual cleansing, allowing them focus on the analysis and business applications. Data preparation (or data processing) involves the collection, cleaning and consolidation of data into a single file or data table for the purpose of analysis or mining, as the case may be. It is an avenue for users who are technically qualified to analyse datasets of any type or specific composition. During data preparation, extensive auditing takes place until available data is of extremely high quality and fit for intended use. Skipping this level of auditing is dangerous, and can terribly affect the quality of prepared data, as well as data mining results. More often than not, Data Preparation Platforms have embedded adaptors that allow structured and semi-structured sources (e.g. spread sheets, database tables and XML / JSON content) to be equally explored and analysed.

PAT Grid™ (Beta) for Data Preparation Platform

Upcoming
Challengers
Leaders
RapidMiner

Trifacta’s Visual Data Profiling

Microsoft Power Query for Excel

Informatica Rev

Platfora

Waterline Data

FICO Big Data Analyzer

Tamr

SAP Lumira

IBM Predictive Analytics

Looker

Teradata Loom

PAT Index
Measures how well the product or service is performing.
Rating Index
Measures how the product or service is rated in comparison to other products.
Data Preparation Platform
PAT Index™
 
Read More
95
 
Read More
95
 
Read More
91
 
Read More
89
 
Read More
87
 
Read More
78
 
Read More
78
 
Read More
75
 
Read More
67
 
Read More
64
 
Read More
64
 
Read More
60
 
Read More
57
 
Read More
52
 
Read More
51
 
Read More
51
 
Read More
48
 
Read More
48
 
Read More
46
 
Read More
46
 
Read More
77
 
Read More
77
 
Read More
75
 
Read More
72
 
Read More
66
 
Read More
66
 
Read More
6
 
Read More
52
 
Read More
47
 
Read More
46
Top Five
PAT Index™
 
1
RapidMiner Studio
 
2
Datameer
 
3
KNIME Analytics Platform
 
4
Trifacta
 
5
ClearStory Data
Compare
Go