Transformer-based Multi-agent Reinforcement Learning for Multiple Unmanned Aerial Vehicle Coordination in Air Corridors
submiteed to IEEE International Conference on Communications
D3MOVE_v4.py for visualization of UAVs coordination in air corridors.
UAVs need to traverse several air corridors to reach their destinations. Air corridors are modelled as cylinder and partial torus.
- H(), embedding layer, normalizes the input values and standardize the input dimensions.
- G(), transformer layer, deals with stochastic neighbors information
- F(), actor-critic network combined.
related package can be found in environment.yml