2023年度 人工知能学会全国大会(第37回)

講演情報

オーガナイズドセッション

オーガナイズドセッション » OS-16 AIとデモクラシー

[1P5-OS-16b] AIとデモクラシー

2023年6月6日(火) 17:00 〜 18:40 P会場 (会議室 G1+G2)

オーガナイザ:伊藤 孝行、大沼 進、松尾 徳朗、白松 俊

18:00 〜 18:20

[1P5-OS-16b-04] A Lage-scale Labeled English Text Datasets for Machine Learning: Case of Issue-based Information System

〇Jawad Haqbeen1, Sofia Sahab1, Takayuki Ito1 (1. Kyoto University)

キーワード:annotation, text dataset, machine learning, deep learning, natural language processing

Textual data has emerged as one of the fastest-growing data types on the internet. This development has led to significant advancements in the field of Natural Language Processing (NLP) in recent years, primarily driven by the utilization of Deep Learning (DL) and Machine Learning (ML) techniques. These methods are known to require copious amounts of labeled text data in a specific format and structure for model training purposes using some sort of dialogue mapping. For instance, node and link extractor models have been trained in D-Agree using text-based training data while adopting Issue-based Information System (IBIS) notation. However, training such models in English has been challenging due to the arduousness of preparing labeled IBIS English datasets. In this study, we present a process for annotating and releasing large quantities of training data for machine learning based on IBIS, providing researchers with a free environment to train their opinion extractor models in English.

講演PDFパスワード認証
論文PDFの閲覧にはログインが必要です。参加登録者の方は「参加者用ログイン」画面からログインしてください。あるいは論文PDF閲覧用のパスワードを以下にご入力ください。

パスワード