JSAI2024

Presentation information

Poster Session

Poster session » Poster session

[4Xin2] Poster session 2

Fri. May 31, 2024 12:00 PM - 1:40 PM Room X (Event hall 1)

[4Xin2-65] Sentence Tagging as Metric Learning Using Data Augumentation with Large Language Model

〇Kenya Nonaka1, Koutaro Tamura1 (1.Uzabase Inc)

Keywords:deep metric learning, Q&A Service

The task of tagging articles is one of the most fundamental tasks in natural language processing. Uzabase, Inc., which provides business information infrastructure, frequently faces the task of tagging economic articles. In particular, Flash Opinion service matches user questions with experts by tagging the questions to represent the field of expertise. There is a demand to reduce the workload of operators who perform the question tagging. If this tagging task is formulated as a conventional multi-label classification problem, the model would need to be retrained every time tags are added or removed. In the present study, we demonstrate a method that transforms the task into a problem of distance learning between tags and question texts by using data augmentation with tag names through large-scale language models. The proposed method was applied to an actual dataset obtained from operations and verified to provide better tag recommendation for operators than multi-label classification models.

Authentication for paper PDF access

A password is required to view paper PDFs. If you are a registered participant, please log on the site from Participant Log In.
You could view the PDF with entering the PDF viewing password bellow.

Password