Bigdata
Now Reading
Apache Hbase
0
Review

Apache Hbase

Overview
Synopsis

HBase is a non-relational database meant for massively large tables of data that is implicitly distributed across clusters of commodity hardware. HBase provides “linear and modular scalability” and a variety of robust administration and data management features for your tables, all hosted atop Hadoop’s underlying HDFS file system.

Category

Column-oriented DBMS

Features

• Automatic and configurable sharding of tables
• Strictly consistent reads and writes
• Query predicate push down via server side Filters
• Extensible jruby-based (JIRB) shell
• Automatic failover support between RegionServers
• Convenient base classes for backing Hadoop MapReduce jobs with Apache HBase tables

License

Proprietary

Pricing

Subscription

Free Trial

Available

Users Size

Small (<50 employees), Medium (50 to 1000 employees), Enterprise (>1001 employees)

Company

Apache Hbase

What is best?

• Automatic and configurable sharding of tables
• Strictly consistent reads and writes
• Query predicate push down via server side Filters
• Extensible jruby-based (JIRB) shell

What are the benefits?

• Multi-dimensional: Data is addressable in several dimensions: tables, rows, columns, versions
• Persistent: It stores data on disk with a log, so it sticks around
• Sparse:  If a row has no value in a column, it doesn’t use any space
• Functional: Aggregate and analyze billions of rows present in the HBase tables

PAT Rating™
Editor Rating
Aggregated User Rating
Rate Here
Ease of use
7.6
7.7
Features & Functionality
7.6
9.4
Advanced Features
7.6
9.4
Integration
7.6
9.8
Performance
7.6
8.2
Customer Support
7.6
7.8
Implementation
5.0
Renew & Recommend
10
Bottom Line

Apache HBase™ is the Hadoop® database, a distributed, scalable, big data store.

7.6
Editor Rating
8.4
Aggregated User Rating
7 ratings
You have rated this

HBase is a non-relational database meant for massively large tables of data that is implicitly distributed across clusters of commodity hardware. HBase provides “linear and modular scalability” and a variety of robust administration and data management features for your tables, all hosted atop Hadoop’s underlying HDFS file system. Designed to support queries of massive data sets, HBase is optimized for read performance. For writes, HBase seeks to maintain consistency. A write operation in HBase first records the data to a commit log (a "write-ahead log"), then to an internal memory structure called a MemStore. When the MemStore fills, it is flushed to disk as an entity called an HFile. HFiles are stored as a sequence of data blocks, with an index appended to the file's end. Another index, kept in memory, speeds searches for data in HFiles. HBase shards rows by regions, which are defined by a range of row keys. Every region in an HBase cluster is managed by a RegionServer process. Typically, there is a single RegionServer process per HBase node. As the amount of data grows, HBase splits regions and migrates the associated data to different nodes in the cluster for balancing purposes. HBase itself includes some built-in Web-based monitoring tools. An HBase master node serves a Web interface on port 60010. HBase offers a command shell, Java APIs, and REST APIs for managing everything, along with a variety of other integrations with popular big data storage, management, and processing / manipulation packages. HBase is an option on Amazon’s EMR, and is also available as part of Microsoft’s Azure offerings.

Filter reviews
User Ratings





User Company size



User role





User industry





Ease of use
Features & Functionality
Advanced Features
Integration
Performance
Customer Support
Implementation
Renew & Recommend

What's your reaction?
Love It
0%
Very Good
0%
INTERESTED
0%
COOL
0%
NOT BAD
0%
WHAT !
0%
HATE IT
0%