Why does my Amazon EC2 instance exceed its network limits when average utilization is low?

Amazon Web Services · Beginner ·☁️ DevOps & Cloud ·6mo ago

Skills: Cloud Fundamentals80%

Key Takeaways

Amazon EC2 instance network limits, micro bursts, and monitoring using CloudWatch and ENA metrics

Full Transcript

[music] Hello, I'm Rakkesh, a cloud support engineer here at the AWS office in Bangalore. Today I'm going to explain why your Amazon EC2 instance may exceed its network limits even when the average utilization appears low. Let's get started. When you query network performance metrics in real time on instances that support enhanced networking through the elastic network adapter or ENA, you might notice the following metrics. Bandwidth inbound allowance exceeded. The number of packets ceued or dropped due to inbound bandwidth exceeding the maximum. Bandwidth outbound allowance exceeded. The number of packets skewed or dropped due to the outbound bandwidth exceeding the maximum. Packets per second allowance exceeded. The number of packets skewed or dropped due to the packets per second or PPS exceeding the maximum. Connection tracking allowance exceeded the number of packets skewed or dropped due to connections exceeding the maximum trackable. Even if the average bandwidth or PPS shown in the Amazon Cloudatch appears low, you might still see packets being ceued or dropped. The most common reason for this is micro bursts. Microws are short spikes in network demand followed by periods of low activity. Towardatch metrics are sampled every 60 seconds and aggregated over 5 minute periods. So micro burst lasting just seconds or milliseconds might not be fully captured. Let's look at an example. Your instance has 10 Gbps network bandwidth limit. In one 60-cond sample, you transfer 20 GB of outbound data and use up all available bandwidth for around 20 seconds before going idle. While the 5 minute average is low at around.5 Gbps, you still reach the 10 Gbps limit during that micro burst. To properly monitor micro bus, use operating system tools to monitor network statistics more granularly, such as S, network load, or interface top on Linux or performance monitor on Windows. The Cloudatch agent can also publish custom metrics at 1 second resolution. However, this option incurs higher charges. It's also best practice to monitor the ENA metrics directly. The driver version must be 2.2.10 or higher on Linux or 2.2.2 or higher on Windows. Cloudatch agent can publish these ENA metrics as well. There are two ways to monitor ENA metrics. Turn on detailed monitoring. Publish ENA metrics to Cloudatch. To turn on detailed monitoring, follow these steps. Open the Amazon EC2 console. In the navigation pane, choose instances, select the instance, choose actions, select monitor and troubleshoot, and then choose manage detailed monitoring. On the detailed monitoring page, do one of the following. For detailed monitoring, select enable. For basic monitoring, clear enable. Choose confirm. Now, let's publish ENA metrics to Cloudatch. Create new IM role with Amazon SSM managed instance core and Cloudatch agent server policy attached. Attach the newly created IM role to the instance. Log to the EC2 Windows instance. Create a config file config.json at the location C program files Amazon Amazon Cloudatch agent. Restart the Cloudatch agent with the new configuration. To do this, run the a fetch config command in PowerShell as an administrator. To view network performance metrics in the Cloudatch console, follow these steps. Open the Cloudatch console. In the navigation pane, choose metrics. Choose the name space for the metrics collected by the agent. By default, this is Cloudatch agent. Choose a metric dimension such as per instance metrics. To prevent microw traffic at the sender side to avoid exceeding throughput or packet rate limits. This requires application changes and possible OS support. If limits are frequently reached, then consider scaling up to a larger instance size with higher network limits or scaling out across multiple instances. For Linux, some additional mitigations include the following. Using the socket option maximum pacing rate to pace individual sockets. Using queuing disciplines such as fade queueing to smooth traffic bursts. Reducing the kernel transmission Q length from 1024 packets default. Thanks for watching and happy cloud computing from all of us here at AWS. [music]

Original Description

For more details on this topic, visit the AWS Knowledge Center on AWS re:Post and read the full article associated with this video: https://repost.aws/knowledge-center/ec2-instance-exceeding-network-limits The AWS Knowledge Center contains trusted, expert-reviewed answers to frequently asked questions across AWS services — including EC2, S3, IAM, Lambda, Bedrock, and more. Rakesh shows you how to troubleshoot Amazon EC2 instances that exceed network limits when average utilization appears low. 0:00 Introduction 0:34 Understanding Network Performance Metrics 1:32 Microbursts Explained 3:02 Monitoring Setup 5:12 Prevention Strategies 5:50 Closing Subscribe: More AWS videos: https://go.aws/3m5yEMW More AWS events videos: https://go.aws/3ZHq4BK Do you have technical AWS questions? Ask the community of experts on AWS re:Post: https://go.aws/3lPaoPb ABOUT AWS Amazon Web Services (AWS) is the world’s most comprehensive and broadly adopted cloud platform, offering over 200 fully featured services from data centers globally. Millions of customers — including the fastest-growing startups, largest enterprises, and leading government agencies — are using AWS to lower costs, become more agile, and innovate faster. #AWS #AmazonWebServices #CloudComputing #awsknowledgecentervideos #AWSCloud #AmazonAWS #KnowledgeCenterVideos #AWSrePost

Watch on YouTube ↗ (saves to browser)

Sign in to unlock AI tutor explanation · ⚡30

Playlist

Uploads from Amazon Web Services · Amazon Web Services · 0 of 60

← Previous Next →

Agentic AI Design Patterns Introduction and walkthrough | Amazon Web Services

Agentic AI Design Patterns Introduction and walkthrough | Amazon Web Services

Amazon Web Services

Galileo on modernizing on banking infrastructure | Amazon Web Services

Galileo on modernizing on banking infrastructure | Amazon Web Services

Amazon Web Services

Alliander Speeds Innovation and Energy Transition Using AWS | Amazon Web Services

Alliander Speeds Innovation and Energy Transition Using AWS | Amazon Web Services

Amazon Web Services

AWS and Scuderia Ferrari HP streamline F1 power unit assembly | Amazon Web Services

AWS and Scuderia Ferrari HP streamline F1 power unit assembly | Amazon Web Services

Amazon Web Services

How AWS machine learning supports Scuderia Ferrari HP pit stops | Amazon Web Services

How AWS machine learning supports Scuderia Ferrari HP pit stops | Amazon Web Services

Amazon Web Services

Nasdaq Builds Market Infrastructure of the Future with AWS | Amazon Web Services

Nasdaq Builds Market Infrastructure of the Future with AWS | Amazon Web Services

Amazon Web Services

AWS Security Hub Exposure Findings | Amazon Web Services

AWS Security Hub Exposure Findings | Amazon Web Services

Amazon Web Services

How do I use Session Manager port forwarding to connect to my EC2 instance through RDP?

How do I use Session Manager port forwarding to connect to my EC2 instance through RDP?

Amazon Web Services

How do I extend an EBS volume with LVM partitions?

How do I extend an EBS volume with LVM partitions?

Amazon Web Services

AWS Graviton makes it easy to optimize performance, cost, and sustainability | Amazon Web Services

AWS Graviton makes it easy to optimize performance, cost, and sustainability | Amazon Web Services

Amazon Web Services

Run Cloud Adoption Framework workshops with Miro | Amazon Web Services

Run Cloud Adoption Framework workshops with Miro | Amazon Web Services

Amazon Web Services

Getting Started with AWS Cost Optimization Hub | Amazon Web Services

Getting Started with AWS Cost Optimization Hub | Amazon Web Services

Amazon Web Services

Why did my Amazon SQS messages get sent to a dead-letter queue?

Why did my Amazon SQS messages get sent to a dead-letter queue?

Amazon Web Services

Declarative Policies for EC2 | Amazon Web Services

Declarative Policies for EC2 | Amazon Web Services

Amazon Web Services

How do I troubleshoot IAM permission issues for the Billing and Cost Management console?

How do I troubleshoot IAM permission issues for the Billing and Cost Management console?

Amazon Web Services

Integrity at Scale: Inside the Flo Health Mission | Amazon Web Services

Integrity at Scale: Inside the Flo Health Mission | Amazon Web Services

Amazon Web Services

Fueling Success: Small shifts, powerful performance | Amazon Web Services

Fueling Success: Small shifts, powerful performance | Amazon Web Services

Amazon Web Services

WEX enhances customer experience with AI-powered chatbot | Amazon Web Services

WEX enhances customer experience with AI-powered chatbot | Amazon Web Services

Amazon Web Services

Accelerate troubleshooting with Amazon CloudWatch investigations | Amazon Web Services

Accelerate troubleshooting with Amazon CloudWatch investigations | Amazon Web Services

Amazon Web Services

Why is my Windows WorkSpace stuck in the starting, rebooting, or stopping status?

Why is my Windows WorkSpace stuck in the starting, rebooting, or stopping status?

Amazon Web Services

Telemetry Pipelines for AI | Amazon Web Services

Telemetry Pipelines for AI | Amazon Web Services

Amazon Web Services

Getting Control over Security and Observability Data | Amazon Web Services

Getting Control over Security and Observability Data | Amazon Web Services

Amazon Web Services

The Problem with Telemetry Data Volume | Amazon Web Services

The Problem with Telemetry Data Volume | Amazon Web Services

Amazon Web Services

Telemetry Pipelines on AWS | Amazon Web Services

Telemetry Pipelines on AWS | Amazon Web Services

Amazon Web Services

What are Telemetry Pipelines? | Amazon Web Services

What are Telemetry Pipelines? | Amazon Web Services

Amazon Web Services

Using AI for RegEx on Telemetry Pipelines | Amazon Web Services

Using AI for RegEx on Telemetry Pipelines | Amazon Web Services

Amazon Web Services

Multi-Session Support in the AWS Console | Amazon Web Services

Multi-Session Support in the AWS Console | Amazon Web Services

Amazon Web Services

How CloudHedge delivers assessment with AWS ISV Tooling Program at no cost?

How CloudHedge delivers assessment with AWS ISV Tooling Program at no cost?

Amazon Web Services

How customers speed up migration and modernization to AWS with CloudHedge | Amazon Web Services

How customers speed up migration and modernization to AWS with CloudHedge | Amazon Web Services

Amazon Web Services

Chaos Experiment with Amazon ElastiCache | Amazon Web Services

Chaos Experiment with Amazon ElastiCache | Amazon Web Services

Amazon Web Services

Amazon S3 Access Points: Easily manage access for shared datasets on S3 | Amazon Web Services

Amazon S3 Access Points: Easily manage access for shared datasets on S3 | Amazon Web Services

Amazon Web Services

ElastiCache Valkey 8.0 - Savings and Efficiency | Amazon Web Services

ElastiCache Valkey 8.0 - Savings and Efficiency | Amazon Web Services

Amazon Web Services

Pennymac scales document processing with AWS | Amazon Web Services

Pennymac scales document processing with AWS | Amazon Web Services

Amazon Web Services

AWS | Next Level Innovation | Amazon Web Services

AWS | Next Level Innovation | Amazon Web Services

Amazon Web Services

Driving Cloud Innovation: Mindtickle's Partnership with AWS Enterprise Support | Amazon Web Services

Driving Cloud Innovation: Mindtickle's Partnership with AWS Enterprise Support | Amazon Web Services

Amazon Web Services

A Leader's Edge from Executive Insights | Amazon Web Services

A Leader's Edge from Executive Insights | Amazon Web Services

Amazon Web Services

How do I create a custom Amazon WorkSpaces image?

How do I create a custom Amazon WorkSpaces image?

Amazon Web Services

Charles Leclerc tests his AI-generated race track | Amazon Web Services

Charles Leclerc tests his AI-generated race track | Amazon Web Services

Amazon Web Services

Redington Scales India’s Cloud Access with AWS Partnership | Amazon Web Services

Redington Scales India’s Cloud Access with AWS Partnership | Amazon Web Services

Amazon Web Services

How do I prevent the resources in my CloudFormation stack from getting deleted or updated?

How do I prevent the resources in my CloudFormation stack from getting deleted or updated?

Amazon Web Services

How do I troubleshoot authentication errors when I use RDP to connect to an EC2 Windows instance?

How do I troubleshoot authentication errors when I use RDP to connect to an EC2 Windows instance?

Amazon Web Services

Exploring the Possibilities of Digital Twin & AI at the Edge | Amazon Web Services

Exploring the Possibilities of Digital Twin & AI at the Edge | Amazon Web Services

Amazon Web Services

Exploring the Possibilities of Digital Twin & AI at the Edge | Amazon Web Services

Exploring the Possibilities of Digital Twin & AI at the Edge | Amazon Web Services

Amazon Web Services

AWS at the FORMULA 1 AWS GRAN PREMIO DELL'EMILIA-ROMAGNA 2025 | Amazon Web Services

AWS at the FORMULA 1 AWS GRAN PREMIO DELL'EMILIA-ROMAGNA 2025 | Amazon Web Services

Amazon Web Services

What's new in RCPs | Amazon Web Services

What's new in RCPs | Amazon Web Services

Amazon Web Services

API Caching using Amazon ElastiCache | Amazon Web Services

API Caching using Amazon ElastiCache | Amazon Web Services

Amazon Web Services

Pendula: Amazon Nova Customer Testimonial | Amazon Web Services

Pendula: Amazon Nova Customer Testimonial | Amazon Web Services

Amazon Web Services

InDebted : Amazon Nova Customer Testimonial | Amazon Web Services

InDebted : Amazon Nova Customer Testimonial | Amazon Web Services

Amazon Web Services

Amazon DynamoDB global tables with multi-Region strong consistency | Amazon Web Services

Amazon DynamoDB global tables with multi-Region strong consistency | Amazon Web Services

Amazon Web Services

Siemens Mobility uses AWS to operate securely, efficiently on a global scale | Amazon Web Services

Siemens Mobility uses AWS to operate securely, efficiently on a global scale | Amazon Web Services

Amazon Web Services

How do I reuse a knowledge base session in Amazon Bedrock?

How do I reuse a knowledge base session in Amazon Bedrock?

Amazon Web Services

EP5: MBZUAI, CMU : Causal AI, Answering The “Why“ and “What if“ Questions | AWS for AI Podcast

EP5: MBZUAI, CMU : Causal AI, Answering The “Why“ and “What if“ Questions | AWS for AI Podcast

Amazon Web Services

Hema scales time to market developing a data mesh on AWS (Technical) - Cloud Adventures

Hema scales time to market developing a data mesh on AWS (Technical) - Cloud Adventures

Amazon Web Services

Hema scales time to market developing a data mesh on AWS (Business) - Cloud Adventures

Hema scales time to market developing a data mesh on AWS (Business) - Cloud Adventures

Amazon Web Services

How Langfuse Scaled Their AI Platform with AWS: From Open-Source to Enterprise | Amazon Web Services

How Langfuse Scaled Their AI Platform with AWS: From Open-Source to Enterprise | Amazon Web Services

Amazon Web Services

SLMs and LLMs: What’s the Difference? | Amazon Web Services

SLMs and LLMs: What’s the Difference? | Amazon Web Services

Amazon Web Services

SLMs and LLMs: When to use them? | Amazon Web Services

SLMs and LLMs: When to use them? | Amazon Web Services

Amazon Web Services

SLMs on CPU | Amazon Web Services

SLMs on CPU | Amazon Web Services

Amazon Web Services

Intelligent Model Routing | Amazon Web Services

Intelligent Model Routing | Amazon Web Services

Amazon Web Services

SLMs, LLMs, and Model Routing in Agents | Amazon Web Services

SLMs, LLMs, and Model Routing in Agents | Amazon Web Services

Amazon Web Services

This video explains why Amazon EC2 instances may exceed network limits despite low average utilization, and provides steps to monitor and troubleshoot network performance using CloudWatch and ENA metrics.

Key Takeaways

Query network performance metrics in real-time
Monitor for micro bursts using operating system tools or CloudWatch agent
Turn on detailed monitoring
Publish ENA metrics to CloudWatch
Create a new IAM role with necessary policies
Attach the IAM role to the instance
Configure the CloudWatch agent
View network performance metrics in the CloudWatch console

💡 Micro bursts can cause network limits to be exceeded even if average utilization is low, and monitoring at a more granular level is necessary to detect and troubleshoot these issues.

🔒 Pro feature: Ask AI to explain this lesson →

More on: Cloud Fundamentals

View skill →

GKE Backup and Restore

HTTPS Content-Based Load Balancer with Terraform

Managing SMB Workloads and Optimizing Storage Usage with NetApp Cloud Manager & Cloud Volumes ONTAP

Set Up Network and HTTP Load Balancers

GKE Workload Optimization

Google AppSheet: Getting Started

Related Reads

Beyond Auto-Restart: Building a True Self-Healing Architecture with Runbooks and AI Agents

Learn to build a self-healing architecture using runbooks and AI agents to minimize downtime and automate recovery

Medium · DevOps

55 Percent of Enterprise Storage Is Dead Weight

55% of enterprise storage is unused, resulting in $350 billion waste, and optimizing storage can help reduce costs

Medium · DevOps

Kiro CLI Gets Worse After an Hour. Here's How I Fixed It.

Improve Kiro CLI performance in long sessions with context management techniques for a smoother AWS workflow

Dev.to · Sarvar Nadaf

Scarab Diagnostic Field Test #038 - pnpm GitHub Actions Setup State Boundary

Learn to set up pnpm with GitHub Actions for efficient package management and automated workflows

Dev.to · Scarab Systems

Chapters (6)

Introduction

0:34 Understanding Network Performance Metrics

1:32 Microbursts Explained

3:02 Monitoring Setup

5:12 Prevention Strategies

5:50 Closing

Containers on Amazon ECS with Mama J