论文标题
超越重复:旨在理解和预测问题跟踪系统中的链接类型
Beyond Duplicates: Towards Understanding and Predicting Link Types in Issue Tracking Systems
论文作者
论文摘要
软件项目使用问题跟踪系统(ITS)之类的JIRA来跟踪问题并组织周围的工作流程。问题通常是通过不同的链接相互联系的,例如默认的JIRA链接类型重复,相关,块或子任务。尽管先前的研究主要集中于分析和预测重复链接,但这项工作旨在了解其他各种链接类型,其流行率和特征,以实现更可靠的链接类型预测。为此,我们研究了607,208个链接,在15个公共JIRA存储库中连接了698,790期。除默认类型外,自定义类型依赖,合并,分裂和原因也很常见。我们将存储库中使用的所有75种链接类型分为五个一般类别:一般关系,重复,组成,时间 /因果和工作流程。比较相应图的结构,我们观察到了几种趋势。例如,重复链接通常代表更简单的问题图,其中有两个组件和组成链接呈现最高量的分层树结构(97.7%)。出乎意料的是,一般关系链接的传输评分明显高于重复和时间 /因果链接。由链接类型之间的差异和其受欢迎程度之间的动机,我们评估了JIRA数据集文献中两种最先进的副本检测方法的鲁棒性。我们发现,当前的深度学习方法几乎在所有存储库中都混淆了重复和其他链接之间的困惑。平均而言,一种方法的分类精度下降了6%,另一种方法的分类精度为12%。使用其他链接类型扩展培训集似乎部分解决了这个问题。我们讨论我们的发现及其对研究和实践的影响。
Software projects use Issue Tracking Systems (ITS) like JIRA to track issues and organize the workflows around them. Issues are often inter-connected via different links such as the default JIRA link types Duplicate, Relate, Block, or Subtask. While previous research has mostly focused on analyzing and predicting duplication links, this work aims at understanding the various other link types, their prevalence, and characteristics towards a more reliable link type prediction. For this, we studied 607,208 links connecting 698,790 issues in 15 public JIRA repositories. Besides the default types, the custom types Depend, Incorporate, Split, and Cause were also common. We manually grouped all 75 link types used in the repositories into five general categories: General Relation, Duplication, Composition, Temporal / Causal, and Workflow. Comparing the structures of the corresponding graphs, we observed several trends. For instance, Duplication links tend to represent simpler issue graphs often with two components and Composition links present the highest amount of hierarchical tree structures (97.7%). Surprisingly, General Relation links have a significantly higher transitivity score than Duplication and Temporal / Causal links. Motivated by the differences between the link types and by their popularity, we evaluated the robustness of two state-of-the-art duplicate detection approaches from the literature on the JIRA dataset. We found that current deep-learning approaches confuse between Duplication and other links in almost all repositories. On average, the classification accuracy dropped by 6% for one approach and 12% for the other. Extending the training sets with other link types seems to partly solve this issue. We discuss our findings and their implications for research and practice.