JSAI2023

Presentation information

General Session

General Session » GS-2 Machine learning

[3D5-GS-2] Machine learning

Thu. Jun 8, 2023 3:30 PM - 5:10 PM Room D (A1)

座長:宮川 大輝(NEC) [オンライン]

3:50 PM - 4:10 PM

[3D5-GS-2-02] Comment generation using a large-scale language model for fashion item recommendation

〇Yugen Sato1, Sayaka Aoba1, Ryozo Masukawa1, Makoto Sato1, Shosuke Haji1, Taiga Matsui2, Keita Ishikawa2, Tomohiro Takagi1 (1. Meiji University, 2. airCloset, Inc.)

Keywords:Large-scale model, Fashion Item Recommendation, comment generation, Vision&Language

In personal styling, the stylist selects fashion items based on the client's characteristics, purpose of use, season, and various other factors. The stylist then carefully comments on the reasons for the selection and sends it to the customer along with the recommended fashion item. In response to this, we use MAGMA, a method that supports multimodal input of language models by means of adapter-based fine tuning, to construct a model that generates comment suggestions based on the combination of item images and prompts. We conducted quantitative and qualitative evaluations of the proposed model and confirmed that the model using MAGMA is superior to the conventional method.

Authentication for paper PDF access

A password is required to view paper PDFs. If you are a registered participant, please log on the site from Participant Log In.
You could view the PDF with entering the PDF viewing password bellow.

Password