论文标题
实验作为代码:可重现,可审计,可辩论,可重复使用和可扩展实验的概念
Experiments as Code: A Concept for Reproducible, Auditable, Debuggable, Reusable, & Scalable Experiments
论文作者
论文摘要
实验研究中的一个普遍关注点是实验的可唤起性和可重复性。实验通常是由各种专家团队(例如研究人员,技术人员和工程师)设计,提供,管理,管理和分析的,并且可能需要许多资源(例如云基础架构,专业设备)。即使研究人员努力准确记录实验,但通常缺乏此过程,因此很难重现它们。此外,当有必要创建类似的实验时,通常我们最终会“重新发明轮子”,因为从头开始比尝试重复现有工作要容易启动,从而失去了有价值的嵌入式嵌入式最佳实践和以前的经验。在行为研究中,这导致了可重复性危机。为了应对这一挑战,我们提出了“作为代码”范式的“实验”,其中整个实验不仅记录下来,还记录了自动化代码以提供,提供,部署,管理和分析。为此,我们将实验定义为代码概念,为实用实现的组成部分提供了分类法,并通过简单的桌面VR实验提供了概念证明,该实验显示了其“代码”表示的好处,即可重复性,可审核性,可审核性,可调节性,可重复性,可重复性和可伸缩性。
A common concern in experimental research is the auditability and reproducibility of experiments. Experiments are usually designed, provisioned, managed, and analyzed by diverse teams of specialists (e.g., researchers, technicians and engineers) and may require many resources (e.g. cloud infrastructure, specialized equipment). Even though researchers strive to document experiments accurately, this process is often lacking, making it hard to reproduce them. Moreover, when it is necessary to create a similar experiment, very often we end up "reinventing the wheel" as it is easier to start from scratch than trying to reuse existing work, thus losing valuable embedded best practices and previous experiences. In behavioral studies this has contributed to the reproducibility crisis. To tackle this challenge, we propose the "Experiments as Code" paradigm, where the whole experiment is not only documented but additionally the automation code to provision, deploy, manage, and analyze it is provided. To this end we define the Experiments as Code concept, provide a taxonomy for the components of a practical implementation, and provide a proof of concept with a simple desktop VR experiment that showcases the benefits of its "as code" representation, i.e., reproducibility, auditability, debuggability, reusability, and scalability.