JSAI2024

Presentation information

Organized Session

Organized Session » OS-4

[1L4-OS-4a] OS-4

Tue. May 28, 2024 3:00 PM - 4:40 PM Room L (Room 52)

オーガナイザ:伊藤 貴之(お茶の水女子大学)、脇田 建(東京工業大学)

3:40 PM - 4:00 PM

[1L4-OS-4a-03] Visualizing the Effects of Gender Bias and Debias in Word Embedding

〇Arisa Sugino1, Takayuki Itoh1 (1. Ochanomizu University)

Keywords:Word Embedding, Gender Bias, Visualization

In recent years, services utilizing AI such as chatbots and image analysis for translation have been increasing daily, and AI is playing an active role in various aspects of our lives. Word embedding is a technique in natural language processing that converts words into numerical representations for computational manipulation. Here, biases inherent in the data set used to pre-train the language model may affect the resulting model is one of the drawbacks of word embedding. There are a limited number of prior studies on biases and debiasing in Japanese word embedding, with most studies focusing on English word embedding. Therefore, this study aimed to detect gender bias from a pretrained Word2vec dataset trained on the Japanese version of Wikipedia, visualize the biased results, and classify the biases. The experimental results demonstrate differences between categories with high bias and those susceptible to the effects of debiasing.

Authentication for paper PDF access

A password is required to view paper PDFs. If you are a registered participant, please log on the site from Participant Log In.
You could view the PDF with entering the PDF viewing password bellow.

Password