2:00 PM - 2:20 PM
[3F4-OS-42a-02] "Negative In-context Learning for Mitigating Copyright Infringement"
Keywords:LLM, Incontext learning, contrastive decoding
In this study, we propose negative in-context learning, an unlearning method that can be applied for black-box LLMs based on in-context learning. In-context learning allows LLMs to learn knowledge given a few examples without access to model parameters. In contrast, negative in-context learning makes LLM unlearn knowledge by providing negative in-context examples made by using contrastive decoding. By learning these negative in-context examples, LLMs can selectively forget specific knowledge without updating model parameters.
Experimental results show that the introduction of negative in-context examples leads to a significant decrease in BLEU, Jaccard, and ROUGE-L scores, confirming that our method effectively interferes with the model’s recall of the original information.
Authentication for paper PDF access
A password is required to view paper PDFs. If you are a registered participant, please log on the site from Participant Log In.
You could view the PDF with entering the PDF viewing password bellow.