Sequential coding patterns: How to use them effectively in code recommendation,Information and Software Technology

当前位置： X-MOL 学术 › Inf. Softw. Technol. › 论文详情

Our official English website, www.x-mol.net, welcomes your feedback! (Note: you will need to create a separate account there.)

Sequential coding patterns: How to use them effectively in code recommendation
Information and Software Technology ( IF 3.8 ) Pub Date : 2021-07-19 , DOI: 10.1016/j.infsof.2021.106690
Luiz Laerte Nunes da Silva ₁ , Troy Costa Kohwalter ₁ , Alexandre Plastino ₁ , Leonardo Gresta Paulino Murta ₁

Affiliation

Context:

Some programming constructs frequently appear together in different parts of the code, representing sequential coding patterns throughout the project. These sequential coding patterns can be mined from the project repository and, whenever the code a developer is writing coincides with the beginning of a sequential pattern, the remainder of this pattern can be suggested to the developer. This is equivalent to the usual Code Completion, which suggests syntactic structures based on the line being programmed. However, instead of providing syntactic suggestions for completing the current line, such feature suggests code snippets containing multiple lines.

Objective:

This paper contributes with an in-depth study on how code pattern recommendation can be used effectively.

Method:

We answer three research questions through a quantitative study using a robust experimental infrastructure with a corpus of five open-source projects: (1) “In a code recommendation, how many frequent coding patterns should be presented?”, (2) “What is the impact of filtering sequential patterns by their confidence?”, and (3) “Does the effectiveness of the sequential coding patterns degrade over time?”.

Results:

Our study shows that it is possible to achieve correctness above 80% when using suggestions with the highest confidence values and that a threshold confidence of 30% generally provides better outcomes. Furthermore, it shows that frequent code pattern completion effectiveness tends to degrade 50 commits after the patterns have been mined.

Conclusion:

We could observe that: (1) the top five ranked suggestions are the ones that deliver the best results; (2) the code recommendations that deliver the best results are the ones with the highest confidence values; and (3) the code recommendation performance degrades as the source code evolves because patterns become outdated.

中文翻译：

顺序编码模式：如何在代码推荐中有效地使用它们

语境：

一些编程结构经常一起出现在代码的不同部分，代表整个项目中的顺序编码模式。这些顺序编码模式可以从项目存储库中挖掘，只要开发人员编写的代码与顺序模式的开头一致，就可以向开发人员建议该模式的其余部分。这相当于通常的代码完成，它根据正在编程的行建议句法结构。但是，该功能不会为完成当前行提供句法建议，而是建议包含多行的代码片段。