当前位置: X-MOL 学术IEEE Trans. Softw. Eng. › 论文详情
Our official English website, www.x-mol.net, welcomes your feedback! (Note: you will need to create a separate account there.)
Pull Request Decisions Explained: An Empirical Overview
IEEE Transactions on Software Engineering ( IF 7.4 ) Pub Date : 2022-04-05 , DOI: 10.1109/tse.2022.3165056
Xunhui Zhang 1 , Yue Yu 1 , Georgios Gousios 2 , Ayushi Rastogi 3
Affiliation  

Context : The pull-based development model is widely used in open source projects, leading to the emergence of trends in distributed software development. One aspect that has garnered significant attention concerning pull request decisions is the identification of explanatory factors. Objective : This study builds on a decade of research on pull request decisions and provides further insights. We empirically investigate how factors influence pull request decisions and the scenarios that change the influence of such factors. Method : We identify factors influencing pull request decisions on GitHub through a systematic literature review and infer them by mining archival data. We collect a total of 3,347,937 pull requests with 95 features from 11,230 diverse projects on GitHub. Using these data, we explore the relations among the factors and build mixed effects logistic regression models to empirically explain pull request decisions. Results : Our study shows that a small number of factors explain pull request decisions, with that concerning whether the integrator is the same as or different from the submitter being the most important factor. We also note that the influence of factors on pull request decisions change with a change in context; e.g., the area hotness of pull request is important only in the early stage of project development, however it becomes unimportant for pull request decisions as projects become mature.

中文翻译:

拉取请求决策解释:实证概述

背景:基于拉动的开发模式在开源项目中被广泛使用,导致分布式软件开发趋势的出现。在拉取请求决策方面引起广泛关注的一个方面是解释因素的识别。目标:本研究建立在对拉取请求决策的十年研究的基础上,并提供了进一步的见解。我们根据经验调查因素如何影响拉取请求决策以及改变这些因素影响的场景。方法:我们通过系统的文献回顾确定影响 GitHub 上的拉取请求决策的因素,并通过挖掘档案数据来推断它们。我们从 GitHub 上的 11,230 个不同项目中收集了总共 3,347,937 个具有 95 个特性的拉取请求。使用这些数据,我们探索了因素之间的关系,并建立了混合效应逻辑回归模型,以凭经验解释拉取请求决策。结果:我们的研究表明,少数因素可以解释拉取请求的决定,其中关于集成者是否与提交者相同或不同的因素是最重要的因素。我们还注意到,因素对拉取请求决策的影响随着上下文的变化而变化;例如,pull request 的区域热度仅在项目开发的早期阶段很重要,但是随着项目的成熟,pull request 的决策变得不重要。
更新日期:2022-04-05
down
wechat
bug