TY - JOUR
T1 - Negative statements considered useful
AU - Arnaout, Hiba
AU - Razniewski, Simon
AU - Weikum, Gerhard
AU - Pan, Jeff Z.
N1 - Funding Information:
This work is supported by the German Science Foundation (DFG: Deutsche Forschungsgemeinschaft) by grant 4530095897 : “Negative Knowledge at Web Scale”.
Publisher Copyright:
© 2021 Elsevier B.V.
PY - 2021/9/30
Y1 - 2021/9/30
N2 - Knowledge bases (KBs) about notable entities and their properties are an important asset in applications such as search, question answering and dialog. All popular KBs capture virtually only positive statements, and abstain from taking any stance on statements not stored in the KB. This paper makes the case for explicitly stating salient statements that do not hold. Negative statements are useful to overcome limitations of question answering systems that are mainly geared for positive questions; they can also contribute to informative summaries of entities. Due to the abundance of such invalid statements, any effort to compile them needs to address ranking by saliency. We present a statistical inference method for compiling and ranking negative statements, based on expectations from positive statements of related entities in peer groups. Experimental results, with a variety of datasets, show that the method can effectively discover notable negative statements, and extrinsic studies underline their usefulness for entity summarization. Datasets and code are released as resources for further research.
AB - Knowledge bases (KBs) about notable entities and their properties are an important asset in applications such as search, question answering and dialog. All popular KBs capture virtually only positive statements, and abstain from taking any stance on statements not stored in the KB. This paper makes the case for explicitly stating salient statements that do not hold. Negative statements are useful to overcome limitations of question answering systems that are mainly geared for positive questions; they can also contribute to informative summaries of entities. Due to the abundance of such invalid statements, any effort to compile them needs to address ranking by saliency. We present a statistical inference method for compiling and ranking negative statements, based on expectations from positive statements of related entities in peer groups. Experimental results, with a variety of datasets, show that the method can effectively discover notable negative statements, and extrinsic studies underline their usefulness for entity summarization. Datasets and code are released as resources for further research.
KW - Information extraction
KW - Knowledge bases
KW - Negative knowledge
KW - Ranking
KW - Statistical inference
UR - http://www.scopus.com/inward/record.url?scp=85116194939&partnerID=8YFLogxK
U2 - 10.1016/j.websem.2021.100661
DO - 10.1016/j.websem.2021.100661
M3 - Article
AN - SCOPUS:85116194939
SN - 1570-8268
VL - 71
JO - Journal of Web Semantics
JF - Journal of Web Semantics
M1 - 100661
ER -