Average Reward Reinforcement Learning for Omega-Regular and Mean-Payoff Objectives

📰 ArXiv cs.AI

Average reward reinforcement learning for omega-regular and mean-payoff objectives is explored as a principled alternative to manual reward function design

advanced Published 23 Mar 2026

Action Steps

Specify behavioral requirements in omega-regular languages
Compile these requirements into learning objectives
Use average reward reinforcement learning to optimize agent behavior
Evaluate the performance of the learned policy using mean-payoff objectives

Who Needs to Know This

Machine learning researchers and engineers working on reinforcement learning and formal verification can benefit from this research to improve agent behavior and automate reward function design

Key Insight

💡 Omega-regular languages can be used to specify behavioral requirements and automatically compile them into learning objectives for reinforcement learning