Presentation information

International Session

International Session (Work in progress) » EW-1 Knowledge engineering

[1N4-IS-1a] Knowledge engineering (1/3)

Tue. Jun 8, 2021 5:20 PM - 7:00 PM Room N (IS room)

Chair: Akinori Abe (Chiba University)

5:40 PM - 6:00 PM

[1N4-IS-1a-02] MTabES: Entity Search with Keyword Search, Fuzzy Search, and Entity Popularities

〇Phuc Nguyen1, Ikuya Yamada2, Hideaki Takeda1 (1. National Institute of Informatics, Japan, 2. Studio Ousia, Japan)

Keywords:Knowledge graph, Entity Search, Lookup

Entity search (ES) is a problem finding relevant entities given a query as an entity label. This problem is challenging because many entities could have the same label names, and entities could have many names. Moreover, the query values are noisy, such as abbreviations or misspellings. It is a more challenging problem when the query is expressed in multilingual.
(1) Objectives: We introduce an entity search tool called MTabES focused on dealing with noisy queries. In particular, we introduce a reranking function as a weighted fusion of fuzzy search with edit distance, keyword search with BM25 algorithm, and entities' popularities with PageRank scores. MTabES key advantage is the ability to boost the hit rate performance with the fuzzy search.
(2) Conclusions: Entity search experimental results on SemTab 2020 and Tough Table datasets show that our toolkit could achieve a higher hit rate than knowledge graphs standard lookups i.g. Wikidata, and Wikipedia. Moreover, MTabES also work efficiently with about five queries/second in MTabES efficiency mode. MTabES toolkit is available at https://github.com/phucty/mtab_tool.

Authentication for paper PDF access

A password is required to view paper PDFs. If you are a registered participant, please log on the site from Participant Log In.
You could view the PDF with entering the PDF viewing password bellow.