Platfora : Platfora is an end-to-end big data analytics platform with a native-Hadoop infrastructure that enables analysts, business professionals and data scientists to instantly access and drill down into the rawest forms of petabyte-scale data without the need for IT support. Platfora analyze all of data to answer the toughest questions with no code required including data preparation, data warehousing and business analytics are included. Platfora Big Data Analytics includes significant enhancements to the visual analysis capabilities and processing engine, including interactivity at Big Data Scale, advanced visualizations and geo analytics. Platfora [...]
Datawatch : Datawatch provides a platform for visual analytics to acquire, prepare, and transform data from structured and multi-structured sources such as PDF and log files, as well as real-time streaming data, into visually rich analytic applications. This allows users to dynamically discover key factors that impact any operational aspect of their business. Datawatch Managed Analytics Platform deliver an enterprise solution for self-service data preparation and visual data discovery. The capabilities delivered with the Datawatch Managed Analytics Platform include self-service data preparation, advanced data enrichment, automation without scripting, [...]
ClearStory Data : ClearStory Data infers what’s in data to speed data preparation and converge disparate data on the fly. Internal and external data access requires no pre modeling or skills that mandate data specialists. ClearStory’s Intelligent Data Harmonization identifies data relationships across disparate data sources and converges data on-the-fly, to reach holistic, interactive answers faster. ClearStory’s advanced data harmonization platform is powered by an inference and profiling engine to extract metadata in real-time, using Apache Spark’s fast in-memory processing. Data dimensions including dates, time, currencies, geographical entities, and [...]
Waterline Data is an automated data discovery platform that helps Data architects inventory all data in Hadoop automatically at scale, and provision data to business users securely and to make the data ready for analysis automatically without having to explore every file manually. Waterline Data also helps to discover lineage and business metadata automatically, as well as manage metadata.
Waterline Data Inventory automatically profiles and catalogs all the files in Hadoop, detects when the contents of files have changed and notifies users and inspects each field in a file to infer its meaning, tags the field [...]
Trifacta : Trifacta’s Visual Data Profiling features provide immediate visibility into unique elements of the data set like data distributions and outliers to inform the transformation and analysis process.Trifacta uses data inference techniques to introspect the data and automatically apply initial shaping and metadata recommendations for the user. This greatly accelerates the transformation process. Users can quickly un-nest and iterate on the shape of their data in preparation for the dataset’s downstream use.
Trifacta’s data enrichment features make standardizing data, joining datasets and aggregating data outputs [...]
Teradata Loom : Teradata Loom enables data analysts and data scientists to easily find, access, and understand data in Hadoop. Loom quickly start with data analysis to accelerate the time from data acquisition to delivering business insights and enables highly exploratory, iterative interactions with the datasets to quickly prepare the data for meaningful statistical analysis. The Loom workbench is a simple browser based, intuitive user interface accessible in a self service fashion by multiple users in the organization.
Features include single, unified integrated platform from discovery to metadata management to data [...]
Tamr : Tamr’s data unification platform catalogues, connects and curates hundreds or thousands of internal and external data sources through a combination of machine learning algorithms and human expert guidance reducing the cost, time and effort of preparing data for analysis. Tamr, catalogs, connects and curates the vast reserves of underutilized internal and external data using a combination of machine learning with human guidance so enterprises can use all their data for analytics.
Tamr dynamically catalogs the organization’s information assets with their crawlers, entity tagging, and metadata visualization features [...]
Paxata : Paxata is self-service Adaptive Data Preparation platform that lets business analysts rapidly collect, explore, transform and combine data with the same freedom they are used to in their analytic discovery. Paxata’s solution lets business people make data sets ready for ad-hoc analytics without going through the painful and manual steps they traditionally dealt with. Paxata platform was built with a data management layer that persists data inside the Hadoop Distributed File System (HDFS) and a real-time columnar parallelized in-memory pipeline data prep engine powered by Intellifusion. The data prep engine wraps Apache Spark v1.1 with additional [...]