JSAI2022

Presentation information

General Session

General Session » GS-5 Language media processing

[2A6-GS-6] Language media processing: applications

Wed. Jun 15, 2022 5:20 PM - 7:00 PM Room A (Main Hall)

座長:有本 庸浩(NTT)[遠隔]

6:20 PM - 6:40 PM

[2A6-GS-6-04] The Endeavour to Advance Short Text Classification: Using Heterogeneous Graph Neural Network via Building Sememe-relationships

〇Xiaoran Li1, Toshiaki Takano1 (1. Shizuoka Institute of Science and Technology)

[[Online]]

Keywords:Short Text Classification, Graph Neural Network, Sememe

Short Text Classification (STC) is one of the fundamental tasks in natural language processing. The lack of grammatical structure and contextual information causes it challenging. One approach is to improve the STC by introducing the label information of entities via the entity knowledge base to build a hierarchical heterogeneous graph. However, the previous entity knowledge bases do not consider the complex semantic relationships of entities, and the number of entities in the articles is too large, affecting the computational resources. This paper proposes using sememes instead of entities to exploit the deeper semantic relations between words better to build heterogeneous graph networks. As the smallest semantic unit, the sememe consists of a finite number of words. We utilized Self-attention to find the sememe in short texts and the weight parameter between them. Extensive experiments results have demonstrated that our proposed method outperforms state-of-the-art methods on the Snippets dataset for STC.

Authentication for paper PDF access

A password is required to view paper PDFs. If you are a registered participant, please log on the site from Participant Log In.
You could view the PDF with entering the PDF viewing password bellow.

Password