Bigdata

Now Reading

Apache Cassandra

Next
Prev

Review

Apache Cassandra

Overview

Synopsis

Apache Cassandra is a free, open source NoSQL database designed to manage very large data sets (think petabytes) across large clusters of commodity servers.

Category

Virtual Machine Deployment No SQL Data Model

Features

• Symmetric architecture makes it relatively easy to create and scale large clusters
• SQL-like Cassandra Query Language eases developers' transition from RDBMS
• Hinted handoffs - Consistency repair feature
• Solr based integrated search
• External batch analytics powered by Hadoop and Cloudera/Hortonworks
• Operational Simplicity: This big data tool ensures that all the nodes in the cluster are the same, ensuring easy administration and configuration management

License

• Open source

Price

• Open source

Pricing

Subscription

Free Trial

Available

Users Size

Small (<50 employees), Medium (50 to 1000 Enterprise (>1001 employees)

Company

Apache Cassandra

What is best?

What are the benefits?

• Decentralized: Every node in the cluster is identical; there are no network bottlenecks
• Durable: Suitable for applications that can’t afford to lose data
• Elastic Scalability: Cassandra allows businesses to easily add to their online capacity to accommodate more customers and data whenever required
• Allows users to tune for performance or consistency or a balance of both

PAT Rating™

Editor Rating

Aggregated User Rating

Rate Here

Ease of use

7.6

8.6

Features & Functionality

7.6

8.9

Advanced Features

7.6

8.8

Integration

7.6

8.8

Performance

7.6

—

Customer Support

7.6

—

Implementation

—

Renew & Recommend

—

Bottom Line

Apache Cassandra is a free, distributed, high performance, extremely scalable & fault tolerant post-relational database solution.

7.6

Editor Rating

8.8

Aggregated User Rating

1 rating

You have rated this

Apache Cassandra is a free, open source NoSQL database designed to manage very large data sets (think petabytes) across large clusters of commodity servers. Among many distinguishing features, Cassandra excels at scaling writes as well as reads, and its "master-less" architecture makes creating and expanding clusters relatively straightforward. Cassandra's impressive hierarchy of caching mechanisms and carefully orchestrated disk I/O ensures speed and data safety. Its storage architecture is similar to a log-structured merge tree: Write operations are sent first to a persistent commit log (ensuring a durable write), then to a write-back cache called a memtable. When the memtable fills, it is flushed to an SSTable (sorted string table) on disk. All disk writes are appends -- large sequential writes, not random writes -- and therefore very efficient. A Cassandra cluster is organized as a ring, and it uses a partitioning strategy to distribute data evenly. The preferred partitioner is the RandomPartitioner, which generates a 128-bit consistent hash to determine data placement. The partitioner is assisted by another component called a "snitch," which maps between a node's IP address and its physical location in a rack or data center. When Cassandra writes data, that data is written to multiple nodes so that it remains available in the event of node failure. The nodes to which a given data element is written are called "replica nodes." Cassandra uses the snitch to ensure that the replica nodes for any particular piece of information are not in the same rack. Cassandra's support for replicating across multiple datacenters is best-in-class, providing lower latency for users and the peace of mind of knowing that you can survive regional outages.

Filter reviews