基于肯德尔的tau

论文标题

基于肯德尔的tau

On Variants of Root Normalised Order-aware Divergence and a Divergence based on Kendall's Tau

论文作者

Sakai, Tetsuya

论文摘要

本文报告了对Sakai报告的工作的后续研究，该研究探讨了序数定量任务的适当评估措施。更具体地说，本研究还定义和评估了前面考虑的量化度量之外，还定义了一种称为根归一级的订单差异差异（RNOD）的序数定量措施的几种变体，以及我们称之为基于Kendall $τ$（DNKT）的差异的措施。 RNOD变体代表了基于Sakai的距离加权正方形总和（DW）的思想的替代设计选择，而DNKT旨在确保系统对类的估计分布忠实于类别的目标优先级。由于DNKT的优先保留属性（PPP）在某些应用中可能很有用，因此我们还考虑将某些现有的量化指标与DNKT合并。我们使用八个序数量化数据集的实验表明，在系统排名一致性方面，RNOD的变体至少与原始RNOD相比没有任何好处，即系统排名对测试数据的选择的鲁棒性。在本研究中考虑的所有序数定量措施（包括归一化匹配距离，又称地球移动器的距离）中，RNOD是总体上最健壮的度量。因此，从这个角度来看，RNOD的设计选择是一个不错的选择。同样，就系统排名一致性而言，DNKT是表现最差的人。因此，如果DNKT似乎适合执行任务，则样本量设计应考虑其统计不稳定。

This paper reports on a follow-up study of the work reported in Sakai, which explored suitable evaluation measures for ordinal quantification tasks. More specifically, the present study defines and evaluates, in addition to the quantification measures considered earlier, a few variants of an ordinal quantification measure called Root Normalised Order-aware Divergence (RNOD), as well as a measure which we call Divergence based on Kendall's $τ$ (DNKT). The RNOD variants represent alternative design choices based on the idea of Sakai's Distance-Weighted sum of squares (DW), while DNKT is designed to ensure that the system's estimated distribution over classes is faithful to the target priorities over classes. As this Priority Preserving Property (PPP) of DNKT may be useful in some applications, we also consider combining some of the existing quantification measures with DNKT. Our experiments with eight ordinal quantification data sets suggest that the variants of RNOD do not offer any benefit over the original RNOD at least in terms of system ranking consistency, i.e., robustness of the system ranking to the choice of test data. Of all ordinal quantification measures considered in this study (including Normalised Match Distance, a.k.a. Earth Mover's Distance), RNOD is the most robust measure overall. Hence the design choice of RNOD is a good one from this viewpoint. Also, DNKT is the worst performer in terms of system ranking consistency. Hence, if DNKT seems appropriate for a task, sample size design should take its statistical instability into account.

下载PDF全文

下载文献需遵守相关版权规定

论文标题