SCIEPublish

Dogfight Simulation of Autonomous Swarm UAVs Based on Multi-Agent Deep Reinforcement Learning

Article Open Access

Dogfight Simulation of Autonomous Swarm UAVs Based on Multi-Agent Deep Reinforcement Learning

Author Information
Department of Computer Engineering, Faculty of Technology, Gazi University, Ankara 06560, Türkiye
*
Authors to whom correspondence should be addressed.

Received: 30 December 2025 Revised: 06 February 2026 Accepted: 23 March 2026 Published: 08 April 2026

Creative Commons

© 2026 The authors. This is an open access article under the Creative Commons Attribution 4.0 International License (https://creativecommons.org/licenses/by/4.0/).

Views:7
Downloads:3
Drones Auton. Veh. 2026, 3(2), 10011; DOI: 10.70322/dav.2026.10011
ABSTRACT: The operational utility of Unmanned Aerial Vehicles (UAVs) has evolved from passive surveillance to active engagement in disputed environments, where autonomous control must operate under highly dynamic and adversarial conditions. Hand-crafted heuristics often exhibit limited robustness when facing stochastic opponent behavior and non-stationary interactions. To address these challenges, we propose a Multi-Agent Deep Reinforcement Learning (MADRL) framework implemented in a Unity 6–based, physics-driven simulation that models flight dynamics and weapon kinematics. Agents are trained using Proximal Policy Optimization (PPO) with a composite reward function designed to encourage cooperative behaviors (e.g., coordinated target engagement) while enforcing safety constraints such as collision avoidance. In empirical evaluations, the learned policies achieve an 85% win rate against a heuristic baseline under the tested scenarios, exhibiting coordinated maneuvers and adaptive engagement strategies. These results indicate that multi-agent learning with decentralized execution can reduce operator workload and improve swarm effectiveness and survivability in conflict zone.
Keywords: Deep reinforcement learning (DRL); Multi-agent systems (MAS); Unmanned aerial vehicles (UAV); Proximal policy optimization (PPO); Autonomous combat simulation; Unity ML-agents
TOP