您的位置 首页 > 腾讯云社区

迈向库尔德语的有限状态形态学(CS CL)---刘子蔚

形态分析是对词的形成和结构的研究。它在自然语言处理(NLP)和计算语言学(CL)的各种任务(例如机器翻译以及文本和语音生成)中起着至关重要的作用。库尔德语是一种资源较少的多方言印欧语言,其词形变化很大。在本文中,作为同类尝试之一,从计算的角度描述了库尔德语(Sorani方言)的形态。我们提取形态规则,将其转换成用于生成和分析单词的有限状态换能器。

原文标题:Towards Finite-State Morphology of Kurdish

原文:Morphological analysis is the study of the formation and structure of words. It plays a crucial role in various tasks in Natural Language Processing (NLP) and Computational Linguistics (CL) such as machine translation and text and speech generation. Kurdish is a less-resourced multi-dialect Indo-European language with highly inflectional morphology. In this paper, as the first attempt of its kind, the morphology of the Kurdish language (Sorani dialect) is described from a computational point of view. We extract morphological rules which are transformed into finite-state transducers for generating and analyzing words. The result of this research assists in conducting studies on language generation for Kurdish and enhances the Information Retrieval (IR) capacity for the language while leveraging the Kurdish NLP and CL into a more advanced computational level.

原文作者:Sina Ahmadi, Hossein Hassani

原文地址:https://arxiv.org/abs/2005.10652

Towards Finite-State Morphology of Kurdish.pdf ---来自腾讯云社区的---刘子蔚

关于作者: 瞎采新闻

这里可以显示个人介绍!这里可以显示个人介绍!

热门文章

留言与评论(共有 0 条评论)
   
验证码: