您的位置 首页 > 腾讯云社区

具有学术论文链接的GitHub存储库:开放访问,可追溯性和演进(CS.SE)---蔡小雪7100294

在已发布的科学突破及其实现之间的可追溯性至关重要,尤其是在开源软件将前沿科学实现到其代码中的情况下。但是,对齐GitHub存储库和学术论文之间的链接可能会很困难,并且链接影响仍然未知。本文研究了这些知识库中包含的学术论文参考的作用。我们对2万个GitHub存储库进行了大规模研究,以建立对学术论文引用的普遍性。我们使用混合方法来识别链接的开放访问(OA),可追溯性和演化方面。尽管参考论文不是很典型,但我们发现绝大多数参考学术论文都是OA。在可追溯性方面,我们的分析表明,机器学习是存储库中最普遍的主题。这些存储库往往隶属于学术团体。超过一半的论文没有链接回任何存储库。引用arXiv论文的案例研究表明,这些论文大多数具有很高的影响力和影响力,并且与学术界保持一致,并由以不同编程语言编写的存储库引用。从进化的角度来看,我们发现所引用论文及其链接的变化很小。

原文标题:GitHub Repositories with Links to Academic Papers: Open Access, Traceability, and Evolution

原文:Traceability between published scientific breakthroughs and their implementation is essential, especially in the case of Open Source Software implements bleeding edge science into its code. However, aligning the link between GitHub repositories and academic papers can prove difficult, and the link impact remains unknown. This paper investigates the role of academic paper references contained in these repositories. We conducted a large-scale study of 20 thousand GitHub repositories to establish prevalence of references to academic papers. We use a mixed-methods approach to identify Open Access (OA), traceability and evolutionary aspects of the links. Although referencing a paper is not typical, we find that a vast majority of referenced academic papers are OA. In terms of traceability, our analysis revealed that machine learning is the most prevalent topic of repositories. These repositories tend to be affiliated with academic communities. More than half of the papers do not link back to any repository. A case study of referenced arXiv paper shows that most of these papers are high-impact and influential and do align with academia, referenced by repositories written in different programming languages. From the evolutionary aspect, we find very few changes of papers being referenced and links to them.

原文作者:Supatsara Wattanakriengkrai, Bodin Chinthanet, Hideaki Hata, Raula Gaikovina Kula, Christoph Treude, Jin Guo, Kenichi Matsumoto

原文地址:https://arxiv.org/abs/2004.00199

具有学术论文链接的GitHub存储库:开放访问,可追溯性和演进(CS.SE).pdf ---来自腾讯云社区的---蔡小雪7100294

关于作者: 瞎采新闻

这里可以显示个人介绍!这里可以显示个人介绍!

热门文章

留言与评论(共有 0 条评论)
   
验证码: