通过从开放世界数据转移学习来增强自动软件可追溯性

论文标题

通过从开放世界数据转移学习来增强自动软件可追溯性

Enhancing Automated Software Traceability by Transfer Learning from Open-World Data

论文作者

Lin, Jinfeng, Poudel, Amrit, Yu, Wenhao, Zeng, Qingkai, Jiang, Meng, Cleland-Huang, Jane

论文摘要

软件需求可追溯性是软件工程过程的关键组成部分，启用了要求验证，合规性验证和安全保证等活动。但是，手动在自然语言伪像（例如需求，设计和测试案例）上手动建立一组痕量链接的成本和精力可能是非常昂贵的。因此，研究人员提出了主要基于信息回程（IR）技术的自动连接生成解决方案。但是，这些解决方案未能实现在工业项目中充分采用所需的准确性。使用深度学习的可追溯性模型可以改善；但是，它们的功效受到项目级文物的规模和可用性的有限和可用性链接的障碍，以作为培训数据。在本文中，我们通过提出和评估文本到文本可追溯性的几种深度学习方法来解决这个问题。我们的方法名为NLTRACE，探讨了三种使用从开放世界平台开采的数据集的三种转移学习策略。通过预训练的语言模型（LMS）并利用相邻的跟踪任务，我们证明，当培训链接可用时，NLTRACE可以显着提高基于LM的痕量模型的性能。在这种情况下，NLTRACE的表现优于最佳性能经典IR方法，其F2得分提高了188％，平均平均精度为94.01％（MAP）。对于F2和地图，它还优于基于LM的一般痕量模型，分别胜过7％和23％。此外，Nltrace可以适应其他LM型号无法的低资源跟踪场景。从相邻任务中学到的知识使NLTRACE在提供少量培训示例时，在发电挑战方面可以优于VSM模型28％F2。

Software requirements traceability is a critical component of the software engineering process, enabling activities such as requirements validation, compliance verification, and safety assurance. However, the cost and effort of manually creating a complete set of trace links across natural language artifacts such as requirements, design, and test-cases can be prohibitively expensive. Researchers have therefore proposed automated link-generation solutions primarily based on information-retrieval (IR) techniques; however, these solutions have failed to deliver the accuracy needed for full adoption in industrial projects. Improvements can be achieved using deep-learning traceability models; however, their efficacy is impeded by the limited size and availability of project-level artifacts and links to serve as training data. In this paper, we address this problem by proposing and evaluating several deep-learning approaches for text-to-text traceability. Our method, named NLTrace, explores three transfer learning strategies that use datasets mined from open world platforms. Through pretraining Language Models (LMs) and leveraging adjacent tracing tasks, we demonstrate that NLTrace can significantly improve the performance of LM based trace models when training links are available. In such scenarios NLTrace outperforms the best performing classical IR method with an 188% improvement in F2 score and 94.01% in Mean Average Precision (MAP). It also outperforms the general LM based trace model by 7% and 23% for F2 and MAP respectively. In addition, NLTrace can adapt to low-resource tracing scenarios where other LM models can not. The knowledge learned from adjacent tasks enables NLTrace to outperform VSM models by 28% F2 on generation challenges when presented with a small number of training examples.

下载PDF全文

下载文献需遵守相关版权规定

论文标题