JSAI2025

Presentation information

Organized Session

Organized Session » OS-41

[1B5-OS-41c] OS-41

Tue. May 27, 2025 5:40 PM - 7:20 PM Room B (Small hall)

オーガナイザ:鈴木 雅大(東京大学),岩澤 有祐(東京大学),河野 慎(東京大学),熊谷 亘(オムロンサイニックエックス),松嶋 達也(東京大学),Paavo Parmas(東京大学),谷口 尚平(東京大学)

5:40 PM - 6:00 PM

[1B5-OS-41c-01] Web Agent with Meta-Prompt-Driven Expert Integration.

〇Satoi Yamaguchi1, Yuna Mastunaga2, Takayasu Ikeda3, Masahiro Suzuki4, Yutaka Matuso4 (1. Waseda University, 2. Makuhari Senior High School, 3. Advanced Institute of Industrial Technology, 4. Graduate School of Engineering, The University of Tokyo)

Keywords:Web agent, LLM, meta prompt

This research explores a novel approach to task completion that does not rely on extensive pre-training in the rapidly evolving field of Web Agents. A major challenge existing Agents face is the automation of tasks accompanying image recognition. However, previous methods highlight limited compatibility between image recognition performance and approaches that do not require pre-training. To address this limitation, we propose an approach that strategically integrates several expert methods employing meta-prompt within LLM, achieving advanced environmental analysis that enables performance improvement. We evaluate the proposed approach using MiniWob++. Additionally, we compare existing Agents to the proposed approach to access task success rate. This paper offers insight into the potential of integration of meta-prompt using LLM to improve task completion rate, suggesting the possibility of a decrease in the necessity of extensive data collection and training required by current agents.

Authentication for paper PDF access
A password is required to view paper PDFs. If you are a registered participant, please log on the site from Participant Log In.
You could view the PDF with entering the PDF viewing password bellow.

Password