[3Rin4-02] Reinforcement Learning with Randomized Physical Parameters for Fault Tolerant Robots
Keywords:Reinforcement learning, Fault tolerant, Physical parameters, robotics
In reinforcement learning, policy is normally learned in simulation environments and is applied to the real world for cost and safety reasons. However, the learned policy cannot often adapt because real world disturbances and failures cause gaps between the two environments. In order to narrow such gap, the policy that is able to adapt to various scenarios are needed. In this paper we propose a reinforcement learning method for acquiring a robust policy against failures. In the proposed method, the failure is represented by adjusting the physical parameters of the robot. Reinforcement learning under various faults is made by randomizing the physical parameters during learning. In experiments, we show that the robot learned with the proposed method has higher average rewards than a normal robot for quadruped walking task in a simulation environment with/without robot failures.
Authentication for paper PDF access
A password is required to view paper PDFs. If you are a registered participant, please log on the site from Participant Log In.
You could view the PDF with entering the PDF viewing password bellow.