T-Visne：T-SNE预测的互动评估和解释

论文标题

T-Visne：T-SNE预测的互动评估和解释

t-viSNE: Interactive Assessment and Interpretation of t-SNE Projections

论文作者

Chatzimparmpas, Angelos, Martins, Rafael M., Kerren, Andreas

论文摘要

事实证明，用于可视化多维数据的T-分布的随机邻居嵌入（T-SNE）已被证明是一种流行的方法，并在广泛的域中成功应用。尽管它们有用，但T-SNE预测可能很难解释甚至误导，这损害了结果的可信度。了解T-SNE本身的细节以及其输出中特定模式背后的原因可能是一项艰巨的任务，尤其是对于降低维度的非专家而言。在这项工作中，我们提出了T-Visne，这是一种可视探索T-SNE投影的交互式工具，使分析师能够检查其准确性和含义的不同方面，例如超参数，距离和邻居保存，特定社区的密度和成本的影响，特定邻里的密度和成本以及维度和视觉模式之间的相关性。我们提出了一个连贯，易于访问且完善的不同视图集合，以可视化T-SNE投影。 T-Visne的适用性和可用性通过假设的使用方案（带有实际数据集）来证明。最后，我们介绍了评估工具有效性的用户研究结果。通过带来通常在运行T-SNE后通常会丢失的光信息，我们希望支持分析师使用T-SNE并使其结果更好地理解。

t-Distributed Stochastic Neighbor Embedding (t-SNE) for the visualization of multidimensional data has proven to be a popular approach, with successful applications in a wide range of domains. Despite their usefulness, t-SNE projections can be hard to interpret or even misleading, which hurts the trustworthiness of the results. Understanding the details of t-SNE itself and the reasons behind specific patterns in its output may be a daunting task, especially for non-experts in dimensionality reduction. In this work, we present t-viSNE, an interactive tool for the visual exploration of t-SNE projections that enables analysts to inspect different aspects of their accuracy and meaning, such as the effects of hyper-parameters, distance and neighborhood preservation, densities and costs of specific neighborhoods, and the correlations between dimensions and visual patterns. We propose a coherent, accessible, and well-integrated collection of different views for the visualization of t-SNE projections. The applicability and usability of t-viSNE are demonstrated through hypothetical usage scenarios with real data sets. Finally, we present the results of a user study where the tool's effectiveness was evaluated. By bringing to light information that would normally be lost after running t-SNE, we hope to support analysts in using t-SNE and making its results better understandable.

下载PDF全文

下载文献需遵守相关版权规定

论文标题