Why does my Amazon EC2 instance exceed its network limits when average utilization is low?

Amazon Web Services · Beginner ·☁️ DevOps & Cloud ·6mo ago

Key Takeaways

Amazon EC2 instance network limits, micro bursts, and monitoring using CloudWatch and ENA metrics

Full Transcript

[music] Hello, I'm Rakkesh, a cloud support engineer here at the AWS office in Bangalore. Today I'm going to explain why your Amazon EC2 instance may exceed its network limits even when the average utilization appears low. Let's get started. When you query network performance metrics in real time on instances that support enhanced networking through the elastic network adapter or ENA, you might notice the following metrics. Bandwidth inbound allowance exceeded. The number of packets ceued or dropped due to inbound bandwidth exceeding the maximum. Bandwidth outbound allowance exceeded. The number of packets skewed or dropped due to the outbound bandwidth exceeding the maximum. Packets per second allowance exceeded. The number of packets skewed or dropped due to the packets per second or PPS exceeding the maximum. Connection tracking allowance exceeded the number of packets skewed or dropped due to connections exceeding the maximum trackable. Even if the average bandwidth or PPS shown in the Amazon Cloudatch appears low, you might still see packets being ceued or dropped. The most common reason for this is micro bursts. Microws are short spikes in network demand followed by periods of low activity. Towardatch metrics are sampled every 60 seconds and aggregated over 5 minute periods. So micro burst lasting just seconds or milliseconds might not be fully captured. Let's look at an example. Your instance has 10 Gbps network bandwidth limit. In one 60-cond sample, you transfer 20 GB of outbound data and use up all available bandwidth for around 20 seconds before going idle. While the 5 minute average is low at around.5 Gbps, you still reach the 10 Gbps limit during that micro burst. To properly monitor micro bus, use operating system tools to monitor network statistics more granularly, such as S, network load, or interface top on Linux or performance monitor on Windows. The Cloudatch agent can also publish custom metrics at 1 second resolution. However, this option incurs higher charges. It's also best practice to monitor the ENA metrics directly. The driver version must be 2.2.10 or higher on Linux or 2.2.2 or higher on Windows. Cloudatch agent can publish these ENA metrics as well. There are two ways to monitor ENA metrics. Turn on detailed monitoring. Publish ENA metrics to Cloudatch. To turn on detailed monitoring, follow these steps. Open the Amazon EC2 console. In the navigation pane, choose instances, select the instance, choose actions, select monitor and troubleshoot, and then choose manage detailed monitoring. On the detailed monitoring page, do one of the following. For detailed monitoring, select enable. For basic monitoring, clear enable. Choose confirm. Now, let's publish ENA metrics to Cloudatch. Create new IM role with Amazon SSM managed instance core and Cloudatch agent server policy attached. Attach the newly created IM role to the instance. Log to the EC2 Windows instance. Create a config file config.json at the location C program files Amazon Amazon Cloudatch agent. Restart the Cloudatch agent with the new configuration. To do this, run the a fetch config command in PowerShell as an administrator. To view network performance metrics in the Cloudatch console, follow these steps. Open the Cloudatch console. In the navigation pane, choose metrics. Choose the name space for the metrics collected by the agent. By default, this is Cloudatch agent. Choose a metric dimension such as per instance metrics. To prevent microw traffic at the sender side to avoid exceeding throughput or packet rate limits. This requires application changes and possible OS support. If limits are frequently reached, then consider scaling up to a larger instance size with higher network limits or scaling out across multiple instances. For Linux, some additional mitigations include the following. Using the socket option maximum pacing rate to pace individual sockets. Using queuing disciplines such as fade queueing to smooth traffic bursts. Reducing the kernel transmission Q length from 1024 packets default. Thanks for watching and happy cloud computing from all of us here at AWS. [music]

Original Description

For more details on this topic, visit the AWS Knowledge Center on AWS re:Post and read the full article associated with this video: https://repost.aws/knowledge-center/ec2-instance-exceeding-network-limits The AWS Knowledge Center contains trusted, expert-reviewed answers to frequently asked questions across AWS services — including EC2, S3, IAM, Lambda, Bedrock, and more. Rakesh shows you how to troubleshoot Amazon EC2 instances that exceed network limits when average utilization appears low. 0:00 Introduction 0:34 Understanding Network Performance Metrics 1:32 Microbursts Explained 3:02 Monitoring Setup 5:12 Prevention Strategies 5:50 Closing Subscribe: More AWS videos: https://go.aws/3m5yEMW More AWS events videos: https://go.aws/3ZHq4BK Do you have technical AWS questions? Ask the community of experts on AWS re:Post: https://go.aws/3lPaoPb ABOUT AWS Amazon Web Services (AWS) is the world’s most comprehensive and broadly adopted cloud platform, offering over 200 fully featured services from data centers globally. Millions of customers — including the fastest-growing startups, largest enterprises, and leading government agencies — are using AWS to lower costs, become more agile, and innovate faster. #AWS #AmazonWebServices #CloudComputing #awsknowledgecentervideos #AWSCloud #AmazonAWS #KnowledgeCenterVideos #AWSrePost
Watch on YouTube ↗ (saves to browser)
Sign in to unlock AI tutor explanation · ⚡30

Playlist

Uploads from Amazon Web Services · Amazon Web Services · 0 of 60

← Previous Next →
1 Agentic AI Design Patterns Introduction and walkthrough | Amazon Web Services
Agentic AI Design Patterns Introduction and walkthrough | Amazon Web Services
Amazon Web Services
2 Galileo on modernizing on banking infrastructure | Amazon Web Services
Galileo on modernizing on banking infrastructure | Amazon Web Services
Amazon Web Services
3 Alliander Speeds Innovation and Energy Transition Using AWS | Amazon Web Services
Alliander Speeds Innovation and Energy Transition Using AWS | Amazon Web Services
Amazon Web Services
4 AWS and Scuderia Ferrari HP streamline F1 power unit assembly | Amazon Web Services
AWS and Scuderia Ferrari HP streamline F1 power unit assembly | Amazon Web Services
Amazon Web Services
5 How AWS machine learning supports Scuderia Ferrari HP pit stops | Amazon Web Services
How AWS machine learning supports Scuderia Ferrari HP pit stops | Amazon Web Services
Amazon Web Services
6 Nasdaq Builds Market Infrastructure of the Future with AWS | Amazon Web Services
Nasdaq Builds Market Infrastructure of the Future with AWS | Amazon Web Services
Amazon Web Services
7 AWS Security Hub Exposure Findings | Amazon Web Services
AWS Security Hub Exposure Findings | Amazon Web Services
Amazon Web Services
8 How do I use Session Manager port forwarding to connect to my EC2 instance through RDP?
How do I use Session Manager port forwarding to connect to my EC2 instance through RDP?
Amazon Web Services
9 How do I extend an EBS volume with LVM partitions?
How do I extend an EBS volume with LVM partitions?
Amazon Web Services
10 AWS Graviton makes it easy to optimize performance, cost, and sustainability | Amazon Web Services
AWS Graviton makes it easy to optimize performance, cost, and sustainability | Amazon Web Services
Amazon Web Services
11 Run Cloud Adoption Framework workshops with Miro | Amazon Web Services
Run Cloud Adoption Framework workshops with Miro | Amazon Web Services
Amazon Web Services
12 Getting Started with AWS Cost Optimization Hub | Amazon Web Services
Getting Started with AWS Cost Optimization Hub | Amazon Web Services
Amazon Web Services
13 Why did my Amazon SQS messages get sent to a dead-letter queue?
Why did my Amazon SQS messages get sent to a dead-letter queue?
Amazon Web Services
14 Declarative Policies for EC2 | Amazon Web Services
Declarative Policies for EC2 | Amazon Web Services
Amazon Web Services
15 How do I troubleshoot IAM permission issues for the Billing and Cost Management console?
How do I troubleshoot IAM permission issues for the Billing and Cost Management console?
Amazon Web Services
16 Integrity at Scale: Inside the Flo Health Mission | Amazon Web Services
Integrity at Scale: Inside the Flo Health Mission | Amazon Web Services
Amazon Web Services
17 Fueling Success: Small shifts, powerful performance | Amazon Web Services
Fueling Success: Small shifts, powerful performance | Amazon Web Services
Amazon Web Services
18 WEX enhances customer experience with AI-powered chatbot | Amazon Web Services
WEX enhances customer experience with AI-powered chatbot | Amazon Web Services
Amazon Web Services
19 Accelerate troubleshooting with Amazon CloudWatch investigations | Amazon Web Services
Accelerate troubleshooting with Amazon CloudWatch investigations | Amazon Web Services
Amazon Web Services
20 Why is my Windows WorkSpace stuck in the starting, rebooting, or stopping status?
Why is my Windows WorkSpace stuck in the starting, rebooting, or stopping status?
Amazon Web Services
21 Telemetry Pipelines for AI | Amazon Web Services
Telemetry Pipelines for AI | Amazon Web Services
Amazon Web Services
22 Getting Control over Security and Observability Data | Amazon Web Services
Getting Control over Security and Observability Data | Amazon Web Services
Amazon Web Services
23 The Problem with Telemetry Data Volume | Amazon Web Services
The Problem with Telemetry Data Volume | Amazon Web Services
Amazon Web Services
24 Telemetry Pipelines on AWS | Amazon Web Services
Telemetry Pipelines on AWS | Amazon Web Services
Amazon Web Services
25 What are Telemetry Pipelines? | Amazon Web Services
What are Telemetry Pipelines? | Amazon Web Services
Amazon Web Services
26 Using AI for RegEx on Telemetry Pipelines | Amazon Web Services
Using AI for RegEx on Telemetry Pipelines | Amazon Web Services
Amazon Web Services
27 Multi-Session Support in the AWS Console | Amazon Web Services
Multi-Session Support in the AWS Console | Amazon Web Services
Amazon Web Services
28 How CloudHedge delivers assessment with AWS ISV Tooling Program at no cost?
How CloudHedge delivers assessment with AWS ISV Tooling Program at no cost?
Amazon Web Services
29 How customers speed up migration and modernization to AWS with CloudHedge | Amazon Web Services
How customers speed up migration and modernization to AWS with CloudHedge | Amazon Web Services
Amazon Web Services
30 Chaos Experiment with Amazon ElastiCache | Amazon Web Services
Chaos Experiment with Amazon ElastiCache | Amazon Web Services
Amazon Web Services
31 Amazon S3 Access Points: Easily manage access for shared datasets on S3 | Amazon Web Services
Amazon S3 Access Points: Easily manage access for shared datasets on S3 | Amazon Web Services
Amazon Web Services
32 ElastiCache Valkey 8.0 - Savings and Efficiency | Amazon Web Services
ElastiCache Valkey 8.0 - Savings and Efficiency | Amazon Web Services
Amazon Web Services
33 Pennymac scales document processing with AWS | Amazon Web Services
Pennymac scales document processing with AWS | Amazon Web Services
Amazon Web Services
34 AWS | Next Level Innovation | Amazon Web Services
AWS | Next Level Innovation | Amazon Web Services
Amazon Web Services
35 Driving Cloud Innovation: Mindtickle's Partnership with AWS Enterprise Support | Amazon Web Services
Driving Cloud Innovation: Mindtickle's Partnership with AWS Enterprise Support | Amazon Web Services
Amazon Web Services
36 A Leader's Edge from Executive Insights | Amazon Web Services
A Leader's Edge from Executive Insights | Amazon Web Services
Amazon Web Services
37 How do I create a custom Amazon WorkSpaces image?
How do I create a custom Amazon WorkSpaces image?
Amazon Web Services
38 Charles Leclerc tests his AI-generated race track | Amazon Web Services
Charles Leclerc tests his AI-generated race track | Amazon Web Services
Amazon Web Services
39 Redington Scales India’s Cloud Access with AWS Partnership | Amazon Web Services
Redington Scales India’s Cloud Access with AWS Partnership | Amazon Web Services
Amazon Web Services
40 How do I prevent the resources in my CloudFormation stack from getting deleted or updated?
How do I prevent the resources in my CloudFormation stack from getting deleted or updated?
Amazon Web Services
41 How do I troubleshoot authentication errors when I use RDP to connect to an EC2 Windows instance?
How do I troubleshoot authentication errors when I use RDP to connect to an EC2 Windows instance?
Amazon Web Services
42 Exploring the Possibilities of Digital Twin & AI at the Edge | Amazon Web Services
Exploring the Possibilities of Digital Twin & AI at the Edge | Amazon Web Services
Amazon Web Services
43 Exploring the Possibilities of Digital Twin & AI at the Edge | Amazon Web Services
Exploring the Possibilities of Digital Twin & AI at the Edge | Amazon Web Services
Amazon Web Services
44 AWS at the FORMULA 1 AWS GRAN PREMIO DELL'EMILIA-ROMAGNA 2025 | Amazon Web Services
AWS at the FORMULA 1 AWS GRAN PREMIO DELL'EMILIA-ROMAGNA 2025 | Amazon Web Services
Amazon Web Services
45 What's new in RCPs | Amazon Web Services
What's new in RCPs | Amazon Web Services
Amazon Web Services
46 API Caching using Amazon ElastiCache | Amazon Web Services
API Caching using Amazon ElastiCache | Amazon Web Services
Amazon Web Services
47 Pendula: Amazon Nova Customer Testimonial | Amazon Web Services
Pendula: Amazon Nova Customer Testimonial | Amazon Web Services
Amazon Web Services
48 InDebted : Amazon Nova Customer Testimonial | Amazon Web Services
InDebted : Amazon Nova Customer Testimonial | Amazon Web Services
Amazon Web Services
49 Amazon DynamoDB global tables with multi-Region strong consistency | Amazon Web Services
Amazon DynamoDB global tables with multi-Region strong consistency | Amazon Web Services
Amazon Web Services
50 Siemens Mobility uses AWS to operate securely, efficiently on a global scale | Amazon Web Services
Siemens Mobility uses AWS to operate securely, efficiently on a global scale | Amazon Web Services
Amazon Web Services
51 How do I reuse a knowledge base session in Amazon Bedrock?
How do I reuse a knowledge base session in Amazon Bedrock?
Amazon Web Services
52 EP5: MBZUAI, CMU : Causal AI, Answering The “Why“ and “What if“ Questions | AWS for AI Podcast
EP5: MBZUAI, CMU : Causal AI, Answering The “Why“ and “What if“ Questions | AWS for AI Podcast
Amazon Web Services
53 Hema scales time to market developing a data mesh on AWS (Technical) - Cloud Adventures
Hema scales time to market developing a data mesh on AWS (Technical) - Cloud Adventures
Amazon Web Services
54 Hema scales time to market developing a data mesh on AWS (Business) - Cloud Adventures
Hema scales time to market developing a data mesh on AWS (Business) - Cloud Adventures
Amazon Web Services
55 How Langfuse Scaled Their AI Platform with AWS: From Open-Source to Enterprise | Amazon Web Services
How Langfuse Scaled Their AI Platform with AWS: From Open-Source to Enterprise | Amazon Web Services
Amazon Web Services
56 SLMs and LLMs: What’s the Difference? | Amazon Web Services
SLMs and LLMs: What’s the Difference? | Amazon Web Services
Amazon Web Services
57 SLMs and LLMs: When to use them? | Amazon Web Services
SLMs and LLMs: When to use them? | Amazon Web Services
Amazon Web Services
58 SLMs on CPU | Amazon Web Services
SLMs on CPU | Amazon Web Services
Amazon Web Services
59 Intelligent Model Routing | Amazon Web Services
Intelligent Model Routing | Amazon Web Services
Amazon Web Services
60 SLMs, LLMs, and Model Routing in Agents | Amazon Web Services
SLMs, LLMs, and Model Routing in Agents | Amazon Web Services
Amazon Web Services

This video explains why Amazon EC2 instances may exceed network limits despite low average utilization, and provides steps to monitor and troubleshoot network performance using CloudWatch and ENA metrics.

Key Takeaways
  1. Query network performance metrics in real-time
  2. Monitor for micro bursts using operating system tools or CloudWatch agent
  3. Turn on detailed monitoring
  4. Publish ENA metrics to CloudWatch
  5. Create a new IAM role with necessary policies
  6. Attach the IAM role to the instance
  7. Configure the CloudWatch agent
  8. View network performance metrics in the CloudWatch console
💡 Micro bursts can cause network limits to be exceeded even if average utilization is low, and monitoring at a more granular level is necessary to detect and troubleshoot these issues.

Related Reads

📰
What Is an MCP Registry? (And the NxM Problem It Solves)
Learn about MCP registries and how they solve the NxM problem by providing a centralized catalog of MCP servers
Dev.to · Sahajmeet Kaur
📰
Built a suite of client-side dev tools to fix the "production data" privacy gap
Learn how to build client-side dev tools to address production data privacy gaps and improve development efficiency
Dev.to · Rayan Ahmad
📰
5 Best BrowserStack Alternatives to Optimize Your Testing Infrastructure
Discover the top 5 BrowserStack alternatives to optimize testing infrastructure for better execution speed, pricing, and test management
Medium · DevOps
📰
️ The Lifecycle Symphony: A Senior SRE’s Deep Dive into Init and Sidecar Containers
Learn how to optimize container initialization and sidecar containers for resilient multi-cloud platforms
Medium · DevOps

Chapters (6)

Introduction
0:34 Understanding Network Performance Metrics
1:32 Microbursts Explained
3:02 Monitoring Setup
5:12 Prevention Strategies
5:50 Closing
Up next
Containers on Amazon ECS with Mama J
AWS Developers
Watch →