论文标题

Calliope:电子表格的自动视觉数据故事生成

Calliope: Automatic Visual Data Story Generation from a Spreadsheet

论文作者

Shi, Danqing, Xu, Xinyue, Sun, Fuling, Shi, Yang, Cao, Nan

论文摘要

以叙事可视化(例如海报或数据视频)形式显示的视觉数据故事经常用于以数据为导向的讲故事,以促进对故事内容的理解和记忆。尽管有用,但技术障碍(例如数据分析,可视化和脚本)使视觉数据故事的产生变得困难。现有的创作工具依靠用户的技能和经验,这些技能和经验通常效率低下且仍然困难。在本文中,我们介绍了一种新颖的视觉数据故事生成系统Calliope,该系统通过自动过程和设施从输入电子表格中创建视觉数据故事,并根据在线故事编辑器对生成的故事的简化修订。尤其是,Calliope结合了一种新的面向逻辑的蒙特卡洛树搜索算法,该算法探讨了输入电子表格给出的数据空间,以逐步生成故事作品(即数据事实)并以逻辑顺序组织它们。数据事实的重要性是根据信息理论来衡量的,每个数据事实在图表中可视化并由自动生成的描述标题。我们通过三个示例故事,两个受控的实验以及对10个领域专家的一系列访谈来评估提出的技术。我们的评估表明,Calliope对有效的视觉数据故事生成有益。

Visual data stories shown in the form of narrative visualizations such as a poster or a data video, are frequently used in data-oriented storytelling to facilitate the understanding and memorization of the story content. Although useful, technique barriers, such as data analysis, visualization, and scripting, make the generation of a visual data story difficult. Existing authoring tools rely on users' skills and experiences, which are usually inefficient and still difficult. In this paper, we introduce a novel visual data story generating system, Calliope, which creates visual data stories from an input spreadsheet through an automatic process and facilities the easy revision of the generated story based on an online story editor. Particularly, Calliope incorporates a new logic-oriented Monte Carlo tree search algorithm that explores the data space given by the input spreadsheet to progressively generate story pieces (i.e., data facts) and organize them in a logical order. The importance of data facts is measured based on information theory, and each data fact is visualized in a chart and captioned by an automatically generated description. We evaluate the proposed technique through three example stories, two controlled experiments, and a series of interviews with 10 domain experts. Our evaluation shows that Calliope is beneficial to efficient visual data story generation.

扫码加入交流群

加入微信交流群

微信交流群二维码

扫码加入学术交流群,获取更多资源