Sign in to see all reviews and comparisons. It's Free!
By clicking Sign In with Social Media, you agree to let PAT RESEARCH store, use and/or disclose your Social Media profile and email address in accordance with the PAT RESEARCH Privacy Policy and agree to the Terms of Use.
HBase is a non-relational database meant for massively large tables of data that is implicitly distributed across clusters of commodity hardware. HBase provides “linear and modular scalability” and a variety of robust administration and data management features for your tables, all hosted atop Hadoop’s underlying HDFS file system.
Category
Column-oriented DBMS
Features
• Automatic and configurable sharding of tables • Strictly consistent reads and writes • Query predicate push down via server side Filters • Extensible jruby-based (JIRB) shell • Automatic failover support between RegionServers • Convenient base classes for backing Hadoop MapReduce jobs with Apache HBase tables
License
Proprietary
Pricing
Subscription
Free Trial
Available
Users Size
Small (<50 employees), Medium (50 to 1000 Enterprise (>1001 employees)
Company
Apache Hbase
What is best?
• Automatic and configurable sharding of tables • Strictly consistent reads and writes • Query predicate push down via server side Filters • Extensible jruby-based (JIRB) shell
What are the benefits?
• Multi-dimensional: Data is addressable in several dimensions: tables, rows, columns, versions • Persistent: It stores data on disk with a log, so it sticks around • Sparse: If a row has no value in a column, it doesn’t use any space • Functional: Aggregate and analyze billions of rows present in the HBase tables
PAT Rating™
Editor Rating
Aggregated User Rating
Rate Here
Ease of use
7.6
7.7
Features & Functionality
7.6
9.5
Advanced Features
7.6
9.4
Integration
7.6
9.8
Performance
7.6
6.3
Customer Support
7.6
7.8
Implementation
5.0
Renew & Recommend
10
Bottom Line
Apache HBase™ is the Hadoop® database, a distributed, scalable, big data store.
7.6
Editor Rating
8.2
Aggregated User Rating
7 ratings
You have rated this
HBase is a non-relational database meant for massively large tables of data that is implicitly distributed across clusters of commodity hardware. HBase provides “linear and modular scalability” and a variety of robust administration and data management features for your tables, all hosted atop Hadoop’s underlying HDFS file system. Designed to support queries of massive data sets, HBase is optimized for read performance. For writes, HBase seeks to maintain consistency. A write operation in HBase first records the data to a commit log (a "write-ahead log"), then to an internal memory structure called a MemStore. When the MemStore fills, it is flushed to disk as an entity called an HFile. HFiles are stored as a sequence of data blocks, with an index appended to the file's end. Another index, kept in memory, speeds searches for data in HFiles. HBase shards rows by regions, which are defined by a range of row keys. Every region in an HBase cluster is managed by a RegionServer process. Typically, there is a single RegionServer process per HBase node. As the amount of data grows, HBase splits regions and migrates the associated data to different nodes in the cluster for balancing purposes. HBase itself includes some built-in Web-based monitoring tools. An HBase master node serves a Web interface on port 60010. HBase offers a command shell, Java APIs, and REST APIs for managing everything, along with a variety of other integrations with popular big data storage, management, and processing / manipulation packages. HBase is an option on Amazon’s EMR, and is also available as part of Microsoft’s Azure offerings.
By clicking Sign In with Social Media, you agree to let PAT RESEARCH store, use and/or disclose your Social Media profile and email address in accordance with the PAT RESEARCH Privacy Policy and agree to the Terms of Use.