0% Complete
صفحه اصلی
/
اولین همایش بین المللی هوش مصنوعی
A Master-Slave Approach for Simultaneously Controlling Two Drones when Carrying an Object
نویسندگان :
Seyyed Mohammad Ali Ardehali
1
Amin Faraji
2
Monireh Abdoos
3
Armin Salimi-Badr
4
1- Shahid Beheshti University
2- Shahid Beheshti University
3- Shahid Beheshti University
4- Shahid Beheshti University
کلمات کلیدی :
Reinforcement Learning،, Double Deep Q-Learning،master-slave approach
چکیده :
This paper proposes a master-slave approach to simultaneously control two drones with the aim of carrying an object toward a goal. The proposed method utilizes the Double Deep Q-Learning (DDQN) technique to train a master agent to be able to carry an object toward a goal with help of an slave agent. This procedure is implemented such that the master agent gathers the observations and specifies the actions to be made by itself and the slave agent. Indeed, the slave agent just applies a predefined action and does not process any input for producing the output. This manner of learning, leads to a unified convergence to an optimal solution compared to the situation in which each agent is trained separately. To verify the functionality of the proposed method, the algorithm is examined in the webots simulation environment. The simulations show that the introduced method has a good performance when controlling the drones to reach to the goal. The introduced method, other than algorithmic benefits which leads to a faster convergence of the model, suggests some reduction in the processing demand. The reason is that the learning procedure is guided by one of the agents and consequently only one of the agents is responsible for doing the calculations that leads to choosing the action. In this scenario, the slave agent does not require any processing resources for choosing the action and just simply applies a predefined action dictated by the master agent.
لیست مقالات
لیست مقالات بایگانی شده
A Comprehensive Review of Machine Learning Applications in Multiple Sclerosis: From Diagnosis to Prognosis and Treatment Response Prediction
Mahdie Azizi hashjin - Babak Nouri-Moghaddam - Abbas Mirzaei
Evaluating Parkinson’s Disease Severity Through Attention-Based STGCN and S2AGCN Models Utilizing Kinect Skeleton Images
Fatemeh Fadaei Ardestani - Nima Asadi
Improvement in intent detection and slot filling by model enhancement and different data augmentation strategies
Mohammad Mahdi HajiRamezanAli - Hasan Deldar - Mohammad Mehdi Homayounpour
LDA-ML: A Hybrid DDoS Detection Attacks in SDN Environment using Machine Leraning
Alireza Rezaei - Amineh Amini
Hybrid ANN and Ant Colony Algorithm for IoT Data Classification
Khadejeh Nemati - Safouro Ashoori - Moohamad hadi Amini
Potential of machine learning algorithms for predicting the properties of medium-density fiberboard (MDF): preliminary results
Rahim Mohebbi Gargari - Ali Shalbafan - Seyed Jalil Alavi - Maryam Amirmazlaghni - Seyed Hamzeh Sadatnejad - Heiko Thoemen
The Role of Ethics in Autonomous Decision Making: Advancements in Artificial Moral Agents
Fatemeh Ghazali - Touraj BaniRostam - MirMohsen Pedram
Reconstruction of ECoG signals in response to visual stimuli using a model based on convolutional and regression networks.
Mohammad Amin Lotfi - Kimiya ٍEghbal - Fateneh Zareayan Jahromy
Strategies and Future Horizons of Innovative Entrepreneurship in AI-Based Programming
Milad Ghiasspour
Split and rephrase: Simple Syntactic Sentences for NLP applications
Mahdi Asghari - Alireza Talebpour - Ghasem Darzi
بیشتر
ثمین همایش، سامانه مدیریت کنفرانس ها و جشنواره ها - نگارش 41.1.5