Cassandra-distributions-and-variants

Apache Cassandra Lunch #11 : Different Cassandra Distributions and Variants

In case you missed Cassandra Lunch #11, we discussed 10 different Cassandra distributions and variants. In this blog, we will give a quick introduction to those 10 distributions and variants. We have also linked a video of Cassandra Lunch #11 if you want a more in-depth discussion about these 10 Cassandra distributions and variants.

The Cassandra distributions and variants that we will be introducing range over the following categories:

  • Cassandra / Cassandra Compliant Databases on JVM
  • Cassandra Compliant Databases on C++
  • Cassandra as a Service / Managed Cassandra Based on Open Source Cassandra
  • Cassandra as a Service / Managed Cassandra Based on Proprietary Technology

Cassandra / Cassandra Compliant Databases on JVM:

  • Apache Cassandra
    • The Apache Cassandra database is the right choice when you need scalability and high availability without compromising performance. Linear scalability and proven fault-tolerance on commodity hardware or cloud infrastructure make it the perfect platform for mission-critical data. Cassandra’s support for replicating across multiple datacenters is best-in-class, providing lower latency for your users and the peace of mind of knowing that you can survive regional outages.
  • Datastax Enterprise
    • Most widely used commercial distribution of Apache Cassandra, integrated with Apache Spark (for SparkSQL, analytics), Apache Solr (for secondary index), Apache TinkerPop based Graph stored in Cassandra, and OpsCenter.
  • Elassandra
    • The open source database and search engine for multi-cloud applications. Elassandra simplifies your data stack by integrating Elasticsearch® into Apache Cassandra®.

Cassandra Compliant Databases on C++:

  • Scylla
    • Scylla is a drop-in Apache Cassandra alternative big data database that powers your applications with ultra-low latency and extremely high throughput, while reducing TCO to a fraction of most NoSQL databases. Reimplemented Apache Cassandra from scratch using C++ instead of Java to increase the raw performance and utilization of modern multi-core servers and, through self-tuning and improved uptime, minimize the overhead to DevOps. Scylla provides the NoSQL database platform your applications require to scale out and up linearly.
  • Yugabyte
    • YugaByteDB is a transactional, high-performance database for building distributed cloud services. It supports Cassandra-compatible and Redis-compatible APIs, with PostgreSQL in Beta.

Cassandra as a Service / Managed Cassandra Based on Open Source Cassandra:

  • Datastax Astra
    • DataStax Astra simplifies cloud-native Cassandra application development. It reduces deployment time from weeks to minutes, removing the biggest obstacle to using Cassandra, which is behind many of the most heavily used applications in the world.
  • Instaclustr
    • Instaclustr provides a fully managed and SOC 2 certified hosted & managed service for Apache Cassandra® on AWS, Azure, GCP and IBM Cloud.
  • Aiven
    • Aiven for Apache Cassandra is a managed and hosted distributed NoSQL database providing scalability, high availability, and excellent fault tolerance. Cassandra as a Service is available on Google Cloud Platform, Amazon Web Services, Microsoft Azure, DigitalOcean, and UpCloud.

Cassandra as a Service / Managed Cassandra Based on Proprietary Technology:

  • Microsoft Cosmos DB
    • Azure Cosmos DB is a fully managed NoSQL database service for modern app development with guaranteed single-digit millisecond response times and 99.999-percent availability backed by SLAs, automatic and instant scalability, and open source APIs for MongoDB and Cassandra. Enjoy fast writes and reads anywhere in the world with turnkey multi-master global distribution.
  • Amazon Keyspaces
    • Amazon Keyspaces (for Apache Cassandra) is a scalable, highly available, and managed Apache Cassandra–compatible database service. With Amazon Keyspaces, you can run your Cassandra workloads on AWS using the same Cassandra application code and developer tools that you use today. You don’t have to provision, patch, or manage servers, and you don’t have to install, maintain, or operate software. Amazon Keyspaces is serverless, so you pay for only the resources you use and the service can automatically scale tables up and down in response to application traffic. You can build applications that serve thousands of requests per second with virtually unlimited throughput and storage. Data is encrypted by default and Amazon Keyspaces enables you to back up your table data continuously using point-in-time recovery. Amazon Keyspaces gives you the performance, elasticity, and enterprise features you need to operate business-critical Cassandra workloads at scale.

If you would like a more in-depth discussion about these Cassandra distributions and variants, you can watch this video below:

Here is an accompanied slide deck as well:

Cassandra.Link

Cassandra.Link is a knowledge base that we created for all things Apache Cassandra. Our goal with Cassandra.Link was to not only fill the gap of Planet Cassandra, but to bring the Cassandra community together. Feel free to reach out if you wish to collaborate with us on this project in any capacity.

We are a technology company that specializes in building business platforms. If you have any questions about the tools discussed in this post or about any of our services, feel free to send us an email!