Using AI for RegEx on Telemetry Pipelines | Amazon Web Services
Learn how AI-powered Pipelines can recommend the right RegEx expressions for large volume telemetry data to save time.
Learn more at - http://go.aws/45ECtNJ
Subscribe to AWS: https://go.aws/subscribe
Sign up for AWS: https://go.aws/signup
AWS free tier: https://go.aws/free
Explore more: https://go.aws/more
Contact AWS: https://go.aws/contact
Next steps:
Explore on AWS in Analyst Research: https://go.aws/reports
Discover, deploy, and manage software that runs on AWS: https://go.aws/marketplace
Join the AWS Partner Network: https://go.aws/partners
Learn more on how Amazon builds and operates software: https://go.aws/library
Do you have technical AWS questions?
Ask the community of experts on AWS re:Post: https://go.aws/3lPaoPb
Why AWS?
Amazon Web Services (AWS) is the world’s most comprehensive and broadly adopted cloud. Millions of customers—including the fastest-growing startups, largest enterprises, and leading government agencies—use AWS to be more agile, lower costs, and innovate faster.
#AmazonWebServices #CloudComputing #s3 #AI #RegEx #TelemetryPipelines #DataProcessing #AutomatedRegEx #DataEngineering #MachineLearning #DataAnalytics #AWS #EdgeDelta #IntelligentProcessing #IntelligentPipelines #AIInsights #AutomatedIntelligence
What You'll Learn
The video demonstrates how AI-powered pipelines can recommend RegEx expressions for large volume telemetry data, saving time and effort. It showcases Edge Delta's solution, which uses AI to analyze small samples of data and provide recommendations for filtering, masking, or hashing sensitive information.
Full Transcript
Hello there. I'm Nolan Chen, partner solutions architect for AWS. And I'm Ozan al-NU, founder at Edge Delta. Ozon, earlier in my career, when I was a Linux programmer, I remember I often had to write complicated and tedious reax expressions to make sure we collected and filtered the right logs, traces, and metrics from our Linux systems. Can you tell us how AI today can now make that job easier? Absolutely. Let's take a standard log file or even a log message within a file. Reax and logs are as old as time. So if we think about a regular log file, uh they're going to have lots of different messages. And each of those messages is going to have different fields or attributes within that log. And if we can identify the various different attributes, one thing that you typically did as you probably have a lot of experience with is you wrote regular expressions to either filter, parse or transform that data. So let's take an IP address for example. Maybe you might want to cut out that IP address whether you're masking or hashing that data because you don't need it for a downstream uh system. So that is a very typical thing that you might do and you're going to have to write a regular expression to match that uh piece of data. The other thing you might want to do is pre-agregate. So you might want to look at what is a specific error message. What are the various different metadata or attributes associated with that? And you might want to pull out that error message as a different field as well. The challenge ends up being is that you have potentially billions of logged messages coming through. Those could be in hundreds or thousands of different formats. And as a human to have to sit there and manually write all these regular expressions has been quite difficult. You have your team that moves on, new systems are onboarded, new sources are onboarded, new destinations have various different requirements. And so one of the things that pipelines can actually do is give you a tremendous amount of value in automating this process. So one thing that Edge Delta has done is it takes small samples of your data and will actually feed it into AI. What AI can do with those samples is recommend certain things like masks or filters or hashing certain information and will actually apply that within the pipeline as a recommendation for you as a user to accept. So again in one of those instances going back to the first example it can be an IP address that the system has identified and said this might potentially be PII data. Do you want to filter it out? You can apply that filter right here and one thing that happens is it automatically applies and all the data going out that information is masked at that point. This is one way where if you're going to send this data to S3, you could be sending it into your SIM. You could be sending it into, you know, an alerting system or Slack or Pedager Duty or Service Now. There could be a lot of various different systems you're sending this data to. to be able to have these AI insights that work in stream in real time to give you recommendations for how you want to be transforming or filtering or optimizing your data sets. That's one thing that we've seen to be extremely valuable for the engineers like yourself who might have been writing reax for decades but might not want to do that anymore. Thanks Ozan. The hard part wasn't actually writing it. It was trying to analyze all that data to figure out what to write in the first place. So if I understand correctly with AI pip AI powered pipelines in the middle, you can now automatically get those recommendations. I can look at those recommendations. I still have to manually implement it myself, but now I've been saved how who knows how many hours having to pour through thousands and thousands of lines of telemetry data to figure out the right reax expressions. Absolutely. And I think today you have to manually implement it. Maybe tomorrow we automate that part as well. But I think that's the key for the user such as yourself. If you're the person who's the system administrator, if you're the person who is in charge and you it's your responsibility to ensure that all this data gets from source to destination in the right format, then of course you want to be able to use a tool like AI that can recommend certain things and then you get to be the master of your data. You get to choose any source, any destination and specifically in any format. Thank you, Ozan. I appreciate you appreciate you telling us today about the power of AI and how it can help you make better reax expressions for your telemetry pipelines. Thank you.
Watch on YouTube ↗
(saves to browser)
Sign in to unlock AI tutor explanation · ⚡30
Playlist
Uploads from Amazon Web Services · Amazon Web Services · 26 of 60
1
2
3
4
5
6
7
8
9
10
11
12
13
14
15
16
17
18
19
20
21
22
23
24
25
▶
27
28
29
30
31
32
33
34
35
36
37
38
39
40
41
42
43
44
45
46
47
48
49
50
51
52
53
54
55
56
57
58
59
60
Agentic AI Design Patterns Introduction and walkthrough | Amazon Web Services
Amazon Web Services
Galileo on modernizing on banking infrastructure | Amazon Web Services
Amazon Web Services
Alliander Speeds Innovation and Energy Transition Using AWS | Amazon Web Services
Amazon Web Services
AWS and Scuderia Ferrari HP streamline F1 power unit assembly | Amazon Web Services
Amazon Web Services
How AWS machine learning supports Scuderia Ferrari HP pit stops | Amazon Web Services
Amazon Web Services
Nasdaq Builds Market Infrastructure of the Future with AWS | Amazon Web Services
Amazon Web Services
AWS Security Hub Exposure Findings | Amazon Web Services
Amazon Web Services
How do I use Session Manager port forwarding to connect to my EC2 instance through RDP?
Amazon Web Services
How do I extend an EBS volume with LVM partitions?
Amazon Web Services
AWS Graviton makes it easy to optimize performance, cost, and sustainability | Amazon Web Services
Amazon Web Services
Run Cloud Adoption Framework workshops with Miro | Amazon Web Services
Amazon Web Services
Getting Started with AWS Cost Optimization Hub | Amazon Web Services
Amazon Web Services
Why did my Amazon SQS messages get sent to a dead-letter queue?
Amazon Web Services
Declarative Policies for EC2 | Amazon Web Services
Amazon Web Services
How do I troubleshoot IAM permission issues for the Billing and Cost Management console?
Amazon Web Services
Integrity at Scale: Inside the Flo Health Mission | Amazon Web Services
Amazon Web Services
Fueling Success: Small shifts, powerful performance | Amazon Web Services
Amazon Web Services
WEX enhances customer experience with AI-powered chatbot | Amazon Web Services
Amazon Web Services
Accelerate troubleshooting with Amazon CloudWatch investigations | Amazon Web Services
Amazon Web Services
Why is my Windows WorkSpace stuck in the starting, rebooting, or stopping status?
Amazon Web Services
Telemetry Pipelines for AI | Amazon Web Services
Amazon Web Services
Getting Control over Security and Observability Data | Amazon Web Services
Amazon Web Services
The Problem with Telemetry Data Volume | Amazon Web Services
Amazon Web Services
Telemetry Pipelines on AWS | Amazon Web Services
Amazon Web Services
What are Telemetry Pipelines? | Amazon Web Services
Amazon Web Services
Using AI for RegEx on Telemetry Pipelines | Amazon Web Services
Amazon Web Services
Multi-Session Support in the AWS Console | Amazon Web Services
Amazon Web Services
How CloudHedge delivers assessment with AWS ISV Tooling Program at no cost?
Amazon Web Services
How customers speed up migration and modernization to AWS with CloudHedge | Amazon Web Services
Amazon Web Services
Chaos Experiment with Amazon ElastiCache | Amazon Web Services
Amazon Web Services
Amazon S3 Access Points: Easily manage access for shared datasets on S3 | Amazon Web Services
Amazon Web Services
ElastiCache Valkey 8.0 - Savings and Efficiency | Amazon Web Services
Amazon Web Services
Pennymac scales document processing with AWS | Amazon Web Services
Amazon Web Services
AWS | Next Level Innovation | Amazon Web Services
Amazon Web Services
Driving Cloud Innovation: Mindtickle's Partnership with AWS Enterprise Support | Amazon Web Services
Amazon Web Services
A Leader's Edge from Executive Insights | Amazon Web Services
Amazon Web Services
How do I create a custom Amazon WorkSpaces image?
Amazon Web Services
Charles Leclerc tests his AI-generated race track | Amazon Web Services
Amazon Web Services
Redington Scales India’s Cloud Access with AWS Partnership | Amazon Web Services
Amazon Web Services
How do I prevent the resources in my CloudFormation stack from getting deleted or updated?
Amazon Web Services
How do I troubleshoot authentication errors when I use RDP to connect to an EC2 Windows instance?
Amazon Web Services
Exploring the Possibilities of Digital Twin & AI at the Edge | Amazon Web Services
Amazon Web Services
Exploring the Possibilities of Digital Twin & AI at the Edge | Amazon Web Services
Amazon Web Services
AWS at the FORMULA 1 AWS GRAN PREMIO DELL'EMILIA-ROMAGNA 2025 | Amazon Web Services
Amazon Web Services
What's new in RCPs | Amazon Web Services
Amazon Web Services
API Caching using Amazon ElastiCache | Amazon Web Services
Amazon Web Services
Pendula: Amazon Nova Customer Testimonial | Amazon Web Services
Amazon Web Services
InDebted : Amazon Nova Customer Testimonial | Amazon Web Services
Amazon Web Services
Amazon DynamoDB global tables with multi-Region strong consistency | Amazon Web Services
Amazon Web Services
Siemens Mobility uses AWS to operate securely, efficiently on a global scale | Amazon Web Services
Amazon Web Services
How do I reuse a knowledge base session in Amazon Bedrock?
Amazon Web Services
EP5: MBZUAI, CMU : Causal AI, Answering The “Why“ and “What if“ Questions | AWS for AI Podcast
Amazon Web Services
Hema scales time to market developing a data mesh on AWS (Technical) - Cloud Adventures
Amazon Web Services
Hema scales time to market developing a data mesh on AWS (Business) - Cloud Adventures
Amazon Web Services
How Langfuse Scaled Their AI Platform with AWS: From Open-Source to Enterprise | Amazon Web Services
Amazon Web Services
SLMs and LLMs: What’s the Difference? | Amazon Web Services
Amazon Web Services
SLMs and LLMs: When to use them? | Amazon Web Services
Amazon Web Services
SLMs on CPU | Amazon Web Services
Amazon Web Services
Intelligent Model Routing | Amazon Web Services
Amazon Web Services
SLMs, LLMs, and Model Routing in Agents | Amazon Web Services
Amazon Web Services
More on: AI Alignment Basics
View skill →Related AI Lessons
⚡
⚡
⚡
⚡
What a Symantec Ghost Build Taught Me About Infrastructure Engineering
Medium · DevOps
Large Files Don't Belong in Your Workflow State
Medium · Python
I Stopped Using Docker for Local Dev. Nobody on My Team Noticed.
Medium · ChatGPT
hermes-memory-installer: System Metrics, Auto-Archive, Token Rotation, Dead-Letter Replay, and Prof
Dev.to AI
🎓
Tutor Explanation
DeepCamp AI