2018年度人工知能学会全国大会(第32回)

講演情報

口頭発表

一般セッション » [一般セッション] 2.機械学習

[1N3] 機械学習-強化学習

2018年6月5日(火) 17:20 〜 19:00 N会場 (2F 桜島)

座長:松井 藤五郎(中部大学)

18:40 〜 19:00

[1N3-05] Inverse Reinforcement Learning with BDI Agents for Pedestrian Behavior Simulation

〇Nahum Alvarez1, Itsuki Noda1 (1. The National Institute of Advanced Industrial Science and Technology (AIST))

キーワード:inverse reinforcement learning, multi-agent system, pedestrian simulator

Crowd behavior has been subject of study in fields like disaster evacuation, smart town planning and business strategic placing. It is possible to create a model for those scenarios using machine learning techniques and a relatively small training data set to identify behavioral. We implemented a BDI-based agent model that uses such techniques into a large-scale crowd simulator, and apply inverse reinforcement learning to adjust agents' behaviors by examples. The goal of the system is to provide to the agents a realistic behavior model and a method to orient themselves without knowing the scenario's layout, based in learnt patterns around environment features.