2:20 PM - 2:40 PM
[3J4-GS-5-03] Using Curiosity-driven Search to adapt to changes in prey behavior for Multi-agent Pursuit of Learning Preys
Keywords:Multi-Agent, Reinforcement learning
Multi-agent pursuit problems are difficult to learn with reinforcement learning due to few chance of getting rewards. Curiosity-driven search that gives intrinsic rewards based on familiarity of states is a promising method but those depending on a fixed network like RND do not detect a slight but important change, e.g., behavior of preys, in the state expressed as a high-dimensional vector. This work propose to use SND, a novel curiosity-driven search method which first learns the target network to separate its outputs depending on such important elements in the state vectors. Once we learn the target network to separate its outputs based on the velocities of preys, the predators are given intrinsic rewards from a slight change of them and encouraged to explore such states. We compare the result of the proposal with that of an existing work using RND.
Authentication for paper PDF access
A password is required to view paper PDFs. If you are a registered participant, please log on the site from Participant Log In.
You could view the PDF with entering the PDF viewing password bellow.