Keywords:Robotics, Reinforcement Learning, Fault Tolerance
Robots have the possibility of breaking down, and when in an environment where access is limited, they still need to accomplish tasks they are required to do, even when reparation is not a possibility. The purpose of this research is to derive a policy using reinforcement learning that produces a high-performance robot, even in the case of failure. The proposed method learns the normal transition function and adds the difference between the predicted state transition and the actual state transition to the input of the policy network. The results of the experiment show that our method outperformed the baseline method that uses no state transition differen.
Authentication for paper PDF access
A password is required to view paper PDFs. If you are a registered participant, please log on the site from Participant Log In.
You could view the PDF with entering the PDF viewing password bellow.