Document Type : Original Article

Authors

1 Faculty of Computer Science and Engineering, Shahid Beheshti University, Tehran, Iran

2 Faculty of Computer Science and Engineering Shahid Beheshti University Tehran, Iran

Abstract

Abstract - - Abstract - - Abstract - This paper proposes a master-slave approach to simultaneously control two drones with the aim of carrying an object toward a goal. The proposed method utilizes the Double Deep Q-Learning (DDQN) technique to train a master agent to be able to carry an object toward a goal with the help of a slave agent. This procedure is implemented such that the master agent gathers the observations and specifies the actions to be made by itself and the slave agent. Indeed, the slave agent just applies a predefined action and does not process any input for producing the output. This manner of learning, leads to a unified convergence to an optimal solution compared to the situation in which each agent is trained separately. To verify the functionality of the proposed method, the algorithm is examined in the webots simulation environment. The simulations show that the introduced method has a good performance when controlling the drones to reach to the goal. The introduced method, other than algorithmic benefits which leads to a faster convergence of the model, suggests some reduction in the processing demand. The reason is that the learning procedure is guided by one of the agents and consequently only one of the agents is responsible for doing the calculations that lead to choosing the action. In this scenario, the slave agent does not require any processing resources for choosing the action and just simply applies a predefined action dictated by the master agent.

Keywords

Main Subjects