当前位置: X-MOL 学术bioRxiv. Plant Biol. › 论文详情
Our official English website, www.x-mol.net, welcomes your feedback! (Note: you will need to create a separate account there.)
Principles of resilient coding for plant ecophysiologists
bioRxiv - Plant Biology Pub Date : 2021-09-10 , DOI: 10.1101/2020.09.11.293530
Joseph R Stinziano , Cassaundra Roback , Demi Gamble , Bridget K Murphy , Patrick J Hudson , Christopher D Muir

Plant ecophysiology is founded on a rich body of physical and chemical theory, but it is challenging to connect theory with data in unambiguous, analytically rigorous, and reproducible ways. Custom scripts written in computer programming languages (coding) enable plant ecophysiologists to model plant processes and fit models to data reproducibly using advanced statistical techniques. Since many ecophysiologists lack formal programming education, we have yet to adopt a unified set of coding principles and standards that could make coding easier to learn, use, and modify. We identify eight principles to help in plant ecophysiologists without much programming experience to write resilient code: 1) standardized nomenclature, 2) consistency in style, 3) increased modularity/extensibility for easier editing and understanding, 4) code scalability for application to large datasets, 5) documented contingencies for code maintenance, 6) documentation to facilitate user understanding; 7) extensive tutorials, and 8) unit testing. We illustrate these principles using a new R package, {photosynthesis}, which provides a set of analytical and simulation tools for plant ecophysiology. Our goal with these principles is to advance scientific discovery in plant ecophysiology by making it easier to use code for simulation and data analysis, reproduce results, and rapidly incorporate new biological understanding and analytical tools.

中文翻译:

植物生态生理学家的弹性编码原则

植物生态生理学建立在丰富的物理和化学理论基础之上,但以明确、分析严谨和可重复的方式将理论与数据联系起来具有挑战性。以计算机编程语言(编码)编写的自定义脚本使植物生态生理学家能够使用先进的统计技术对植物过程进行建模并将模型与数据可重复地拟合。由于许多生态生理学家缺乏正规的编程教育,我们还没有采用一套统一的编码原则和标准,使编码更容易学习、使用和修改。我们确定了八项原则,以帮助没有太多编程经验的植物生态生理学家编写弹性代码:1) 标准化命名法,2) 风格的一致性,3) 增加模块化/可扩展性,以便于编辑和理解,4) 应用于大型数据集的代码可扩展性, 5) 记录代码维护的意外事件, 6) 便于用户理解的文档;7) 广泛的教程,和 8) 单元测试。我们使用新的 R 包 {photosynthesis} 来说明这些原理,它为植物生态生理学提供了一组分析和模拟工具。我们遵循这些原则的目标是通过更轻松地使用代码进行模拟和数据分析、重现结果并快速整合新的生物学理解和分析工具来推进植物生态生理学的科学发现。它为植物生态生理学提供了一套分析和模拟工具。我们遵循这些原则的目标是通过更轻松地使用代码进行模拟和数据分析、重现结果并快速整合新的生物学理解和分析工具来推进植物生态生理学的科学发现。它为植物生态生理学提供了一套分析和模拟工具。我们遵循这些原则的目标是通过更轻松地使用代码进行模拟和数据分析、重现结果并快速整合新的生物学理解和分析工具来推进植物生态生理学的科学发现。
更新日期:2021-09-13
down
wechat
bug