Graph-based wrong IsA relation detection in a large-scale lexical taxonomy

Jiaqing Liang, Yanghua Xiao, Yi Zhang, Seung Won Hwang, Haixun Wang

Research output: Contribution to conferencePaper

4 Citations (Scopus)

Abstract

Knowledge base(KB) plays an important role in artificial intelligence. Much effort has been taken to both manually and automatically construct web-scale knowledge bases. Comparing with manually constructed KBs, automatically constructed KB is broader but with more noises. In this paper, we study the problem of improving the quality for automatically constructed web-scale knowledge bases, in particular, lexical taxonomies of isA relationships. We find that these taxonomies usually contain cycles, which are often introduced by incorrect isA relations. Inspired by this observation, we introduce two kinds of models to detect incorrect isA relations from cycles. The first one eliminates cycles by extracting directed acyclic graphs, and the other one eliminates cycles by grouping nodes into different levels. We implement our models on Probase, a state-of-the-art, automatically constructed, web-scale taxonomy. After processing tens of millions of relations, our models eliminate 74 thousand wrong relations with 91% accuracy.

Original languageEnglish
Pages1178-1184
Number of pages7
Publication statusPublished - 2017 Jan 1
Event31st AAAI Conference on Artificial Intelligence, AAAI 2017 - San Francisco, United States
Duration: 2017 Feb 42017 Feb 10

Other

Other31st AAAI Conference on Artificial Intelligence, AAAI 2017
CountryUnited States
CitySan Francisco
Period17/2/417/2/10

Fingerprint

Taxonomies
Artificial intelligence
Processing

All Science Journal Classification (ASJC) codes

  • Artificial Intelligence

Cite this

Liang, J., Xiao, Y., Zhang, Y., Hwang, S. W., & Wang, H. (2017). Graph-based wrong IsA relation detection in a large-scale lexical taxonomy. 1178-1184. Paper presented at 31st AAAI Conference on Artificial Intelligence, AAAI 2017, San Francisco, United States.
Liang, Jiaqing ; Xiao, Yanghua ; Zhang, Yi ; Hwang, Seung Won ; Wang, Haixun. / Graph-based wrong IsA relation detection in a large-scale lexical taxonomy. Paper presented at 31st AAAI Conference on Artificial Intelligence, AAAI 2017, San Francisco, United States.7 p.
@conference{98a70fa746d14fa994164120c7c51772,
title = "Graph-based wrong IsA relation detection in a large-scale lexical taxonomy",
abstract = "Knowledge base(KB) plays an important role in artificial intelligence. Much effort has been taken to both manually and automatically construct web-scale knowledge bases. Comparing with manually constructed KBs, automatically constructed KB is broader but with more noises. In this paper, we study the problem of improving the quality for automatically constructed web-scale knowledge bases, in particular, lexical taxonomies of isA relationships. We find that these taxonomies usually contain cycles, which are often introduced by incorrect isA relations. Inspired by this observation, we introduce two kinds of models to detect incorrect isA relations from cycles. The first one eliminates cycles by extracting directed acyclic graphs, and the other one eliminates cycles by grouping nodes into different levels. We implement our models on Probase, a state-of-the-art, automatically constructed, web-scale taxonomy. After processing tens of millions of relations, our models eliminate 74 thousand wrong relations with 91{\%} accuracy.",
author = "Jiaqing Liang and Yanghua Xiao and Yi Zhang and Hwang, {Seung Won} and Haixun Wang",
year = "2017",
month = "1",
day = "1",
language = "English",
pages = "1178--1184",
note = "31st AAAI Conference on Artificial Intelligence, AAAI 2017 ; Conference date: 04-02-2017 Through 10-02-2017",

}

Liang, J, Xiao, Y, Zhang, Y, Hwang, SW & Wang, H 2017, 'Graph-based wrong IsA relation detection in a large-scale lexical taxonomy' Paper presented at 31st AAAI Conference on Artificial Intelligence, AAAI 2017, San Francisco, United States, 17/2/4 - 17/2/10, pp. 1178-1184.

Graph-based wrong IsA relation detection in a large-scale lexical taxonomy. / Liang, Jiaqing; Xiao, Yanghua; Zhang, Yi; Hwang, Seung Won; Wang, Haixun.

2017. 1178-1184 Paper presented at 31st AAAI Conference on Artificial Intelligence, AAAI 2017, San Francisco, United States.

Research output: Contribution to conferencePaper

TY - CONF

T1 - Graph-based wrong IsA relation detection in a large-scale lexical taxonomy

AU - Liang, Jiaqing

AU - Xiao, Yanghua

AU - Zhang, Yi

AU - Hwang, Seung Won

AU - Wang, Haixun

PY - 2017/1/1

Y1 - 2017/1/1

N2 - Knowledge base(KB) plays an important role in artificial intelligence. Much effort has been taken to both manually and automatically construct web-scale knowledge bases. Comparing with manually constructed KBs, automatically constructed KB is broader but with more noises. In this paper, we study the problem of improving the quality for automatically constructed web-scale knowledge bases, in particular, lexical taxonomies of isA relationships. We find that these taxonomies usually contain cycles, which are often introduced by incorrect isA relations. Inspired by this observation, we introduce two kinds of models to detect incorrect isA relations from cycles. The first one eliminates cycles by extracting directed acyclic graphs, and the other one eliminates cycles by grouping nodes into different levels. We implement our models on Probase, a state-of-the-art, automatically constructed, web-scale taxonomy. After processing tens of millions of relations, our models eliminate 74 thousand wrong relations with 91% accuracy.

AB - Knowledge base(KB) plays an important role in artificial intelligence. Much effort has been taken to both manually and automatically construct web-scale knowledge bases. Comparing with manually constructed KBs, automatically constructed KB is broader but with more noises. In this paper, we study the problem of improving the quality for automatically constructed web-scale knowledge bases, in particular, lexical taxonomies of isA relationships. We find that these taxonomies usually contain cycles, which are often introduced by incorrect isA relations. Inspired by this observation, we introduce two kinds of models to detect incorrect isA relations from cycles. The first one eliminates cycles by extracting directed acyclic graphs, and the other one eliminates cycles by grouping nodes into different levels. We implement our models on Probase, a state-of-the-art, automatically constructed, web-scale taxonomy. After processing tens of millions of relations, our models eliminate 74 thousand wrong relations with 91% accuracy.

UR - http://www.scopus.com/inward/record.url?scp=85030472586&partnerID=8YFLogxK

UR - http://www.scopus.com/inward/citedby.url?scp=85030472586&partnerID=8YFLogxK

M3 - Paper

SP - 1178

EP - 1184

ER -

Liang J, Xiao Y, Zhang Y, Hwang SW, Wang H. Graph-based wrong IsA relation detection in a large-scale lexical taxonomy. 2017. Paper presented at 31st AAAI Conference on Artificial Intelligence, AAAI 2017, San Francisco, United States.