Multi-action Tangled Program Graphs for Multi-task Reinforcement Learning with Continuous Control

📰 ArXiv cs.AI

arXiv:2604.25369v1 Announce Type: new Abstract: Over the past few decades, machine learning has been widely used to learn complex tasks. Reinforcement Learning (RL), inspired by human behavior, is a great example, as it involves developing specific behaviours for specific tasks. To further challenge algorithms, Multi-Task RL (MTRL) environments have been introduced, requiring a single model to learn multiple behaviors. The Tangled Program Graph (TPG) algorithm is a Genetic Programming (GP) algor

Published 29 Apr 2026
Read full paper → ← Back to Reads