论文标题

谐波的Indel距离

The Harmonic Indel Distance

论文作者

Pepin, Bob

论文摘要

此简短说明介绍了谐波indel距离(HID),这是插入或删除成本与字符串长度成反比的字符串之间的新距离。我们提出封闭式公式,并表明HID是适当的距离度量。然后,我们将HID与生物医学序列数据基准任务上的Indel距离的归一化和非均衡版本进行实验比较。我们最终显示了基准数据集的平面嵌入,以提供一些与不同距离指标相关的度量空间的几何形状的见解。

This short note introduces the harmonic indel distance (HID), a new distance between strings where the cost of an insertion or deletion is inversely proportional to the string length. We present a closed-form formula and show that the HID is a proper distance metric. Then we perform an experimental comparison of HID to normalized and unnormalized versions of the indel distance on benchmark tasks for biomedical sequence data. We finally show planar embeddings of the benchmark datasets to provide some insights into the geometry of the metric spaces associated with the different distance metrics.

扫码加入交流群

加入微信交流群

微信交流群二维码

扫码加入学术交流群,获取更多资源