Future of AI

AI Safety & Ethics

Alignment, interpretability, AI risks, and building safe AI systems

6,155
lessons
Skills in this topic
View full skill map →
AI Alignment Basics
beginner
Explain the alignment problem
AI Ethics & Policy
beginner
Identify types of bias in ML systems
AI Safety Engineering
intermediate
Implement input and output guardrails
All Reads (1,346) Articles (522)Blog Posts (170)Tutorials (532)Research Papers (53)News (69)
Principles of Privacy by Design: Embedding Ethics and Trust into Every System
Dev.to · Jayita Gulati 🛡️ AI Safety & Ethics 8mo ago
Principles of Privacy by Design: Embedding Ethics and Trust into Every System
In a world increasingly defined by data, privacy is no longer a luxury—it is a fundamental right and...
AI's Blind Spot: A Simple Filter for Unlearning Bias
Dev.to · Arvind SundaraRajan 🛡️ AI Safety & Ethics 8mo ago
AI's Blind Spot: A Simple Filter for Unlearning Bias
AI's Blind Spot: A Simple Filter for Unlearning Bias Imagine your AI is a painter,...
Federated Learning Unleashed: Balancing Bias and Variance in Wireless AI by Arvind Sundararajan
Dev.to · Arvind SundaraRajan 🛡️ AI Safety & Ethics 8mo ago
Federated Learning Unleashed: Balancing Bias and Variance in Wireless AI by Arvind Sundararajan
Federated Learning Unleashed: Balancing Bias and Variance in Wireless AI Imagine training...
📈 A Key Metric for Measuring AI Ethics Success: Bias Reducti
Dev.to · Dr. Carlos Ruiz Viquez 🛡️ AI Safety & Ethics 8mo ago
📈 A Key Metric for Measuring AI Ethics Success: Bias Reducti
📈 A Key Metric for Measuring AI Ethics Success: Bias Reduction Rate (BRR) In the pursuit of...
AI Ethics in Action: How We Ensure Fairness, Bias Mitigation, and Explainability
Dev.to · CapeStart 🛡️ AI Safety & Ethics 8mo ago
AI Ethics in Action: How We Ensure Fairness, Bias Mitigation, and Explainability
Like many challenges, it began with a single student who continued receiving the wrong videos on her...
Bias and Fairness in AI Models: Why Responsible AI Matters
Dev.to · Manognya Lokesh Reddy 🛡️ AI Safety & Ethics 8mo ago
Bias and Fairness in AI Models: Why Responsible AI Matters
Hi Dev Community! 👋 I’m Manognya Lokesh Reddy, an AI researcher and engineer currently pursuing my...
Can We Really Trust AI? Lies, Poison, and the Need for Responsible AI
Dev.to · Santosh Shelar 🛡️ AI Safety & Ethics 8mo ago
Can We Really Trust AI? Lies, Poison, and the Need for Responsible AI
Technical, practical, and a little bit skeptical – just the way we like it. ...
Puzzle Piece Challenge 🧩: Design a Bias Mitigation Strategy
Dev.to · Dr. Carlos Ruiz Viquez 🛡️ AI Safety & Ethics 8mo ago
Puzzle Piece Challenge 🧩: Design a Bias Mitigation Strategy
Puzzle Piece Challenge 🧩: Design a Bias Mitigation Strategy for Rare Disease Treatment...
AGI: Beyond the Checklist - Evaluating for Sustained Performance by Arvind Sundararajan
Dev.to · Arvind SundaraRajan 🛡️ AI Safety & Ethics 8mo ago
AGI: Beyond the Checklist - Evaluating for Sustained Performance by Arvind Sundararajan
AGI: Beyond the Checklist - Evaluating for Sustained Performance Imagine a brilliant...
⚠️ Warning: "Optimism Bias" in Reinforcement Learning 🚨 Whe
Dev.to · Dr. Carlos Ruiz Viquez 🛡️ AI Safety & Ethics 8mo ago
⚠️ Warning: "Optimism Bias" in Reinforcement Learning 🚨 Whe
⚠️ Warning: "Optimism Bias" in Reinforcement Learning 🚨 When designing reinforcement learning...
LLMs: Decoding the Geometry of Alignment
Dev.to · Arvind SundaraRajan 🛡️ AI Safety & Ethics 8mo ago
LLMs: Decoding the Geometry of Alignment
LLMs: Decoding the Geometry of Alignment Imagine a world where AI not only answers your...
**AI Bias Alert: A Wake-Up Call for Fair Hiring Practices**
Dev.to · Dr. Carlos Ruiz Viquez 🛡️ AI Safety & Ethics 8mo ago
**AI Bias Alert: A Wake-Up Call for Fair Hiring Practices**
AI Bias Alert: A Wake-Up Call for Fair Hiring Practices Did you know that Amazon's AI recruitment...
The Cultural Iceberg: Unmasking Bias in Video AI
Dev.to · Arvind SundaraRajan 🛡️ AI Safety & Ethics 8mo ago
The Cultural Iceberg: Unmasking Bias in Video AI
The Cultural Iceberg: Unmasking Bias in Video AI Imagine an AI designed to analyze video...
🧩 Can you spot the hidden bias? A chatbot is designed to pro
Dev.to · Dr. Carlos Ruiz Viquez 🛡️ AI Safety & Ethics 9mo ago
🧩 Can you spot the hidden bias? A chatbot is designed to pro
🧩 Can you spot the hidden bias? A chatbot is designed to provide personalized job recommendations. To...
The Perilous Pursuit of Superintelligence: Heeding Mustafa Suleyman's AI Safety Warning
Dev.to · Yathin Chandra 🛡️ AI Safety & Ethics 9mo ago
The Perilous Pursuit of Superintelligence: Heeding Mustafa Suleyman's AI Safety Warning
Mustafa Suleyman, a co-founder of DeepMind and now CEO of Inflection AI, stands as a pivotal voice in...
AI Agents + Judge + Cron Job + Self-Learning Loop = The Pathway to AGI ?
Dev.to · Adil Maqsood 🛡️ AI Safety & Ethics 10mo ago
AI Agents + Judge + Cron Job + Self-Learning Loop = The Pathway to AGI ?
Introduction Artificial General Intelligence (AGI) has long been the holy grail of the AI world — a...
Cracking the Code of Generalization: Cross-Modal Alignment Meets Cross-Domain Learning
Dev.to · Dechun Wang 🛡️ AI Safety & Ethics 11mo ago
Cracking the Code of Generalization: Cross-Modal Alignment Meets Cross-Domain Learning
Why Cross-Modal + Cross-Domain = Smarter AI In an age where AI needs to not just recognize...
Respect to Ashkan Rajaee for prioritizing ethics over desperation. That’s not always easy.
Dev.to · Dan 🛡️ AI Safety & Ethics 1y ago
Respect to Ashkan Rajaee for prioritizing ethics over desperation. That’s not always easy.
How Ashkan Rajaee Handled a $250K Client Betrayal With Real...
How I got AWS to update a managed IAM policy :)
Dev.to · Paul SANTUS 🛡️ AI Safety & Ethics 1y ago
How I got AWS to update a managed IAM policy :)
Amazon/AWS people speak a lot on how they're customer obsessed, and have a bias for action. And...
Averaging Our Way to AGI
Dev.to · Will Vincent 🛡️ AI Safety & Ethics 1y ago
Averaging Our Way to AGI
I’ve been thinking a lot recently about this LinkedIn post (shout out to my colleague, Michelle...
Running deepseek locally, Ollama and langchain
Dev.to · shrey vijayvargiya 🛡️ AI Safety & Ethics 1y ago
Running deepseek locally, Ollama and langchain
Hello people, Welcome to iHateReading new blog Well AI is close to bringing AGI later on in 2025 or...
Rights for human and AI minds are needed to prevent a dystopia
Dev.to · Aram Panasenco 🛡️ AI Safety & Ethics 1y ago
Rights for human and AI minds are needed to prevent a dystopia
UPDATE: My thinking on the issue has changed a lot since doing more research on AI safety, and I now...
Misalignment of Agile with Product and Design needs (feat. Jessica Hall)
Dev.to · Andrew Park 🛡️ AI Safety & Ethics 1y ago
Misalignment of Agile with Product and Design needs (feat. Jessica Hall)
I recently interviewed Product Management expert Jessica Hall. In this snippet, Jessica articulates...
Composing Python: Beyond Inherited Code
Dev.to · Justin L Beall 🛡️ AI Safety & Ethics 2y ago
Composing Python: Beyond Inherited Code
Unlock the potential of Python development by embracing composition over inheritance. Through practical examples and insights, explore the benefits of modularit
Cooeee World! Passenger 🗞️🐦
Dev.to · Adam Crockett 🌀 🛡️ AI Safety & Ethics 2y ago
Cooeee World! Passenger 🗞️🐦
I have written extensively about a new category of tools known as Code Alignment Assistants. Both...
Stop procrastinating, start User Story Mapping
Dev.to · Matej Konrad 🛡️ AI Safety & Ethics 3y ago
Stop procrastinating, start User Story Mapping
The alignment of requirements is one of the biggest challenges a team has to face when working on a...