Bigdata
Now Reading
Apache Hadoop
0
Review

Apache Hadoop

Overview
Synopsis

The Apache Hadoop project develops open-source software for reliable, scalable, distributed computing. The Apache Hadoop software library is a framework that allows for the distributed processing of large data sets across clusters of computers using simple programming models.

Category

Bigdata

Sub Category

Hadoop Platform

PAT Rating™
Editor Rating
Aggregated User Rating
Rate Here
Ease of use
8.3
9.6
Features & Functionality
8.3
8.7
Advanced Features
8.3
9.2
Integration
8.3
8.7
Performance
8.3
8.0
Training
8.9
Customer Support
8.3
5.1
Implementation
8.4
Renew & Recommend
8.8
Bottom Line

The core of Apache Hadoop consists of a storage part, known as Hadoop Distributed File System (HDFS), and a processing part called MapReduce. Hadoop splits files into large blocks and distributes them across nodes in a cluster.

8.3
Editor Rating
8.4
Aggregated User Rating
2 ratings
You have rated this

The Apache Hadoop project develops open-source software for reliable, scalable, distributed computing. The Apache Hadoop software library is a framework that allows for the distributed processing of large data sets across clusters of computers using simple programming models.

The core of Apache Hadoop consists of a storage part, known as Hadoop Distributed File System (HDFS), and a processing part called MapReduce. Hadoop splits files into large blocks and distributes them across nodes in a cluster.

It is designed to scale up from single servers to thousands of machines, each offering local computation and storage. The project includes the modules Hadoop Common, Hadoop Distributed File System (HDFS), Hadoop YARN, and Hadoop MapReduce.

Hadoop Common includes the common utilities that support the other Hadoop modules. Hadoop Distributed File System (HDFS) is a distributed file system that provides high-throughput access to application data. Hadoop YARN is a framework for job scheduling and cluster resource management. Hadoop MapReduce is a YARN-based system for parallel processing of large data sets.

Hadoop also includes additional software packages that can be installed on top of or alongside Hadoop, such as Apache Pig, Apache Hive, Apache HBase, Apache Phoenix, Apache Spark, Apache ZooKeeper, Cloudera Impala, Apache Flume, Apache Sqoop, Apache Oozie, Apache Storm.

Hadoop

Filter reviews
User Ratings





User Company size



User role





User industry





Ease of use
Features & Functionality
Advanced Features
Integration
Performance
Training
Customer Support
Implementation
Renew & Recommend

What's your reaction?
Love It
0%
Very Good
0%
INTERESTED
0%
COOL
0%
NOT BAD
0%
WHAT !
0%
HATE IT
0%