Talend Big Data Sandbox to Accelerate Adoption of Big Data
Talend Big Data Sandbox to Accelerate Adoption of Big Data : Talend Big Data Sandbox, is a pre configured virtual environment designed to quickly get big data projects off the ground through real world use cases and interactive learning tools. Big data projects often start with a “sandbox” or proof-of-concept project. Throughout these projects, challenges abound that derail plans and prevent companies from delivering an effective return-on-data. Big Data Sandbox virtual image includes Talend Platform for Big Data installation (evaluation), a distribution of Apache Hadoop based on either Cloudera, Hortonworks, or MapR and s step by step Big Data Insights Cookbook with five big data ready to run scenarios such as clickstream analysis, sentiment analysis with social media data, log stream analysis using Apache weblogs, ETL offloading with Hadoop and movie recommendation modeling using Apache Spark on Cloudera.
The Big Data Sandbox provides a one stop shop for big data integration, data quality and Hadoop, saving developers weeks of installation and configuration time, as well as time spent building and integrating their first big data prototype. Preconfigured with real world use cases, the Big Data Sandbox helps big data users to quickly and easily evaluate their big data needs. The Big Data Insights Cookbook, included, contains a step by step guide with several working big data examples and video tutorials, comprising: ETL offloading, clickstream analysis, Twitter sentiment analysis and Apache weblog analysis. This rich environment enables users to speed up their big data integration learning curve.
With the Big Data Sandbox, developers can start prototyping their project using the fully featured Talend Platform for Big Data, big data documentation, online video tutorials, vast open online community, and connectivity to any data source including big data distributions and NoSQL.
Talend’s Big Data Sandbox for Cloudera offers a real-time Apache Spark scenario. This allows users to gain first-hand experience with Talend’s Spark components interacting with Cloudera’s built-in Spark engine running on a YARN client without the otherwise complicated and lengthy installation and configuration process. The Spark scenario reduces setup time from weeks to minutes and showcases the ease with which Talend can connect to the Spark engine to unlock its significant potential.
Talend Big Data Sandboxes require at least 8-10GB of disk space and a minimum of 8GB RAM. They can be run in the latest versions of either VMware Player (Win), VMware Fusion (Mac) or Oracle VirtualBox.