您的位置 首页 > 腾讯云社区

通过将语义和统计技术结合来动态丰富网络主体(cs.CL)---用户7199428

通过将语义和统计技术结合来动态丰富网络主体(cs.CL)

翻译:伴随着语义网技术的发展,开始更多使用本体来存储和提取覆盖多个领域的信息。但是,很少有本体能够处理得当不断更新的语义信息日益增长的需求,亦或是针对专业领域用户的具体需求。因此,现今最大的问题就在于无法使用不同概念之间相关联的信息,也是所谓的丢失的背景知识。针对这种问题的一个解决方案就是通过领域专家来人力丰富主体,但这是一个消耗时间和成本的过程,因为这就产生了对动态主体丰富的需求,在这篇论文里我们将展现一种自动地结合统计语义框架用于动态地丰富来自于万维网的大范围通用主体。使用编码于网站上的文章的大量信息充当语料库,丢失的背景信息因此能够通过语义关联性测量和模式学习技术的组合来挖掘得到,并且用于之后进一步开发。我们方法的优势在于:1、提出了一种动态丰富存在缺失背景知识的大范围通用主体的方式,并且同时来实现了这类知识的重复使用。2、解决了需要领域专家人工丰富主体的成本较大的问题,实验结果经过了精确评估,展现了我们提出技术的有效性。

原文题目:Coupling semantic and statistical techniques for dynamically enriching web ontologies

原文:With the development of the Semantic Web technology, the use of ontologies to store and retrieve information covering several domains has increased. However, very few ontologies are able to cope with the ever-growing need of frequently updated semantic information or specific user requirements in specialized domains. As a result, a critical issue is related to the unavailability of relational information between concepts, also coined missing background knowledge. One solution to address this issue relies on the manual enrichment of ontologies by domain experts which is however a time consuming and costly process, hence the need for dynamic ontology enrichment. In this paper we present an automatic coupled statistical/semantic framework for dynamically enriching large-scale generic ontologies from the World Wide Web. Using the massive amount of information encoded in texts on the Web as a corpus, missing background knowledge can therefore be discovered through a combination of semantic relatedness measures and pattern acquisition techniques and subsequently exploited. The benefits of our approach are: (i) proposing the dynamic enrichment of large-scale generic ontologies with missing background knowledge, and thus, enabling the reuse of such knowledge, (ii) dealing with the issue of costly ontological manual enrichment by domain experts. Experimental results in a precision-based evaluation setting demonstrate the effectiveness of the proposed techniques.

原文作者:Mohammed Maree, Mohammed Belkhatir

原文地址:https://arxiv.org/abs/2004.11081

通过将语义和统计技术结合来动态丰富网络主体(cs.CL).pdf ---来自腾讯云社区的---用户7199428

关于作者: 瞎采新闻

这里可以显示个人介绍!这里可以显示个人介绍!

热门文章

留言与评论(共有 0 条评论)
   
验证码: