Apache Cassandra is an open source distributed NoSQL database management system designed to handle large amounts of data across many commodity servers, providing high availability with no single point of failure. Cassandra offers robust support for clusters spanning multiple datacenters, with asynchronous masterless replication allowing low latency operations for all clients. Cassandra originated at Facebook as a project based on Amazon’s Dynamo and Google’s BigTable, and has since matured into a widely adopted open source system.
Why use a Telegraf plugin for Cassandra?
Apache Cassandra databases are used to store large amounts of data, and this is one of the reasons why these databases are fault-tolerant and highly scalable. Because the data is important to your applications, monitoring your Cassandra cluster for availability, health, and performance can be done with the Cassandra Telegraf Plugin.
Please note: This Cassandra Telegraf Plugin was deprecated in version 1.7, and we recommend using the jolokia2 Telegraf Plugin with the cassandra.conf example configuration.
How to monitor Cassandra using the Telegraf plugin
Configuring the Cassandra Telegraf plugin is simple. Configure it with the context root used for jolokia url, the list of servers with the format "user:[email protected]:port", and the list of Jmx paths that identify mbeans attributes. The Cassandra plugin collects Cassandra 3 / JVM metrics exposed as MBean's attributes through jolokia REST endpoint. All metrics are collected for each server configured.
Key Cassandra Metrics to use for monitoring
Some of the important Cassandra metrics that you should proactively monitor include:
javaGarbageCollector(collection time, collection count)
cassandraCache(cache hits, requests, entries, size, capacity)
cassandraClientRequest(Total latency, latency, timeouts, unavailable, failures)
cassandraCommitLog(pending tasks, total commit log size)
cassandraCompaction(Completed tasks, pending tasks, total compaction completed, bytes compacted)
cassandraTable(Live Disk Space Used, Total Disk Space Used, Read Latency, Coordinator Read Latency, Write Latency, Read Total Latency, Write Total Latency)
cassandraThreadPools(Active tasks, pending tasks, currently blocked tasks)