JSAI2025

Presentation information

Organized Session

Organized Session » OS-42

[3F4-OS-42a] OS-42

Thu. May 29, 2025 1:40 PM - 3:20 PM Room F (Room 1001)

オーガナイザ:金子 正弘(MBZUAI),小島 武(東京大学),磯沼 大(The University of Edinburgh/東京大学),丹羽 彩奈(MBZUAI),大葉 大輔(ELYZA/東京科学大学),村上 明子(AIセーフティーインスティチュート),関根 聡(情報学研究所),内山 将夫(情報通信研究機構),Danushka Bollegala(The University of Liverpool/Amazon)

2:00 PM - 2:20 PM

[3F4-OS-42a-02] "Negative In-context Learning for Mitigating Copyright Infringement"

〇Satoru Utsunomiya1, Masaru Isonuma1,2,3, Junichiro Mori1,4, Ichiro Sakata1 (1. The University of Tokyo, 2. The University of Edinburgh, 3. National Institute of Informatics, 4. RIKEN Center for Advanced Intelligence Project)

Keywords:LLM, Incontext learning, contrastive decoding

This study introduces a novel unlearning technique to address the unauthorized reproduction of copyrighted materials by large language models (LLMs). Although unlearning techniques have recently been introduced as an efficient, low-cost solution for addressing copyright infringement, they require access to model parameters and are therefore not applicable to black-box LLMs.

In this study, we propose negative in-context learning, an unlearning method that can be applied for black-box LLMs based on in-context learning. In-context learning allows LLMs to learn knowledge given a few examples without access to model parameters. In contrast, negative in-context learning makes LLM unlearn knowledge by providing negative in-context examples made by using contrastive decoding. By learning these negative in-context examples, LLMs can selectively forget specific knowledge without updating model parameters.

Experimental results show that the introduction of negative in-context examples leads to a significant decrease in BLEU, Jaccard, and ROUGE-L scores, confirming that our method effectively interferes with the model’s recall of the original information.

Authentication for paper PDF access
A password is required to view paper PDFs. If you are a registered participant, please log on the site from Participant Log In.
You could view the PDF with entering the PDF viewing password bellow.

Password