
Amazon ECS Telegraf Plugin

Amazon Elastic Container Service (Amazon ECS) is a fully managed container orchestration service that makes running your containerized environment secure, reliable, and available. It supports serverless options like AWS Fargate and is integrated with a number of Amazon services like Amazon SageMaker, AWS Batch, Amazon Lex, and AWS App Mesh.
Why use a Telegraf plugin for Amazon ECS?
In order to maintain a reliable, available and performant instance of Amazon ECS and your other AWS solutions, you need to collect metrics and events from all components of your AWS solution. This will allow you to easily pinpoint the area that may be causing a failure. The Amazon ECS Telegraf Input Plugin helps you easily pull metrics that let you know how well your Amazon ECS is performing. It collects metrics about the cluster, the tasks, memory and cpu consumption and more. Pair this with one of the many Telegraf plugins to monitor the application in the containers, and you will gain full visibility into your stack.
How to monitor Amazon ECS using the Telegraf plugin
The Amazon ECS Telegraf plugin is Amazon ECS and Amazon Fargate compatible and uses the Amazon ECS metadata and stats v2 or [v3][task-metadata-endpoint-v3] API endpoints to gather metrics on the running containers in a Task. Please note that the telegraf container must be run in the same Task as the workload it is inspecting. This is similar to the Docker input plugin, with some ECS specific modifications for AWS metadata and stats formats.
Key Amazon ECS metrics to use for monitoring
Some of the important Amazon ECS metrics that you should proactively monitor include:
- ECS task metrics
- Tags:
cluster
,task_arn
,family
,revision
,id
,name
- Fields:
revision (string)
,desired_status (string)
,known_status (string)
,limit_cpu (float)
,limit_mem (float)
- Tags:
- ECS container metrics
- Tags:
cluster
,task_arn
,family
,revision
,id
,name
- Fields:
container_id
,active_anon
,active_file
,cache
,hierarchical_memory_limit
,inactive_anon
,Inactive_file
,mapped_file
,pgfault
Pgmajfault
,pgpgin
,pgpgout
,rss
,rss_huge
Total_active_anon
,total_active_file
,total_cache
,total_inactive_anon
,total_inactive_file
,Total_mapped_file
,Total_pgfault
,Total_pgmajfault
,total_pgpgin
,total_pgpgout
,Total_rss
,Total_rss_huge
,total_unevictable
,Total_writeback
,Unevictable
,writeback
,Fail_count
,limit
,max_usage
,usage
,usage_percent
- Tags:
- ECS container cpu metrics
- Tags:
cluster
,task_arn
,family
,revision
,id
,name
,usage_total
,usage_in_usermode
,usage_in_kernelmode
Usage_system
,throttling_periods
,throttling_throttled_periods
,throttling_throttled_time
,usage_percent
,usage_total
- Fields:
container_id
- Tags:
- ECS container net metrics
- Tags:
cluster
,task_arn
,family
,revision
,id
,name
- Fields:
container_id
,rx_packets
,rx_dropped
,rx_bytes
,rx_errors
,tx_packets
,tx_dropped
,tx_bytes
,tx_errors
- Tags:
- ECS container blkio metrics
- Tags:
cluster
,task_arn
,family
,revision
,id
,name
- Fields:
container_id
,io_service_bytes_recursive_async
,io_service_bytes_recursive_read
,io_service_bytes_recursive_sync
,io_service_bytes_recursive_total
,io_service_bytes_recursive_write
,io_serviced_recursive_async
,io_serviced_recursive_read
,io_serviced_recursive_sync
,io_serviced_recursive_total
,io_serviced_recursive_write
- Tags:
- ECS container meta metrics
- Tags:
cluster
,task_arn
,family
,revision
,id
,name
- Fields:
container_id
,docker_name
,image
,Image_id
,desired_status
,known_status
,Limit_cpu
,limit_mem
,created_at
,started_at
,type
- Tags:
For more information, please check out the documentation.
Related Resources
AWS & InfluxData partnership
Learn how you can get more with this powerful partnership.
Customer Case Study: Mulesoft
Learn how MuleSoft uses InfluxDB Cloud to help power its monitoring and diagnostic solutions.
AWS Cloudwatch InfluxDB Template
Get started with your own AWS CloudWatch monitoring solutions with this template.