IBM Open Platform with Apache Hadoop builds the platform for big data projects and provides the most current Apache Hadoop open source content.
IBM Open Platform with Apache Hadoop provides native support for rolling upgrades for Hadoop services. Support for long-running applications within YARN for enhanced reliability & security. Provides heterogeneous storage in HDFS for in-memory, SSD in addition to HDD.
Spark in-memory distributed compute engine for dramatic performance increases over MapReduce and simplifies developer experience, leveraging Java, Python & Scala [...]
Hortonworks Sandbox is a personal, portable Apache Hadoop environment that comes with dozens of interactive Hadoop and it’s ecosystem tutorials and the most exciting developments from the latest HDP distribution.
Hortonworks Sandbox provides performance gains up to 10 times for applications that store large datasets such as state management, through a revamped Spark Streaming state tracking API. It provides seamless Data Access to achieve higher performance with Spark. Also provides dynamic Executor Allocation to utilize cluster resources efficiently through Dynamic Executor Allocation functionality that [...]
MapR Converged Data Platform integrates the power of Hadoop and Spark with global event streaming, real-time database capabilities, and enterprise storage for developing and running innovative data applications. Modules include MapR-FS, MapR-DB, and MapR Streams. Its enterprise- friendly design provides a familiar set of file and data management services, including a global namespace, high availability, data protection, self-healing clusters, access control, real-time performance, secure multi-tenancy, and management and monitoring.
MapR tests and integrates open source ecosystem projects such as Hive, Pig, Apache HBase [...]