Resource – Paper 235

WebIsALOD: Providing Hypernymy Relations extracted from the Web as Linked Open Data

Sven Hertling and Heiko Paulheim

Resource

clock_eventOctober 24, 2017, 14:30.
house Lehár 1-3
download Download paper (preprint)

Abstract

Hypernymy relations are an important asset in many applications, and a central ingredient to Semantic Web ontologies. The IsA database is a large collection of such hypernymy relations extracted from the Common Crawl. In this paper, we introduce WebIsALOD, a Linked Open Data release of the IsA database, containing 400M hypernymy relations, each provided with rich provenance information. As the original dataset contained more than 80% wrong, noisy extractions, we run a machine learning algorithm to assign confidence scores to the individual statements. Furthermore, 2.5M links to DBpedia and 23.7k links to the YAGO class hierarchy were created at a precision of 97%. In total, the dataset contains 5.4B triples.

1
Leave a Reply (Click here to read the code of conduct)

avatar
1 Comment threads
0 Thread replies
0 Followers
 
Most reacted comment
Hottest comment thread
1 Comment authors
Recent comment authors
  Subscribe  
newest oldest most voted
Notify of
Guest
Svitlana

webisa.webdatacommons.org