论文标题

抽象的电子表格数据流通过Hypergraph Redrawing

Abstracting spreadsheet data flow through hypergraph redrawing

论文作者

Birch, David, Stawinoga, Nicolai, Binks, Jack, Nicoletti, Bruno, Kelly, Paul

论文摘要

我们认为,传统电子表格的错误性质是由于它们的抽象水平低。最终用户程序员被迫从低级单元格构建其数据模型,我们将其定义为“通过用户链接的数据容器或操纵器来建模其世界并定位以反映其结构”。电子表格单元格在可能包含的内容(标量值)上受到限制,并且它们之间的链接本质上隐藏了。本文提出了一种通过“重新绘制细胞的边界”来提高电子表格抽象水平的方法。为了公开隐藏的链接结构,我们将电子表格转换为具有操作员和值作为节点的细粒度图。然后,通过在一组操作员/数据节点周围绘制边界“墙”来表示“单元格”表示为超图边缘。为了扩展细胞可能包含的内容并创建电子表格的更高级别的模型,我们建议研究人员应寻求重新划分这些界限的技术,以创建更高级别的“细胞”,这将更忠实地代表最终用户的现实世界/心理模型。我们通过常见的亚表达识别和子树同构在检测矢量(数组)操作中的应用来说明这种方法。

We believe the error prone nature of traditional spreadsheets is due to their low level of abstraction. End user programmers are forced to construct their data models from low level cells which we define as "a data container or manipulator linked by user-intent to model their world and positioned to reflect its structure". Spreadsheet cells are limited in what they may contain (scalar values) and the links between them are inherently hidden. This paper proposes a method of raising the level of abstraction of spreadsheets by "redrawing the boundary" of the cell. To expose the hidden linkage structure we transform spreadsheets into fine-grained graphs with operators and values as nodes. "cells" are then represented as hypergraph edges by drawing a boundary "wall" around a set of operator/data nodes. To extend what cells may contain and to create a higher level model of the spreadsheet we propose that researchers should seek techniques to redraw these boundaries to create higher level "cells" which will more faithfully represent the end-user's real world/mental model. We illustrate this approach via common sub-expression identification and the application of sub-tree isomorphisms for the detection of vector (array) operations.

扫码加入交流群

加入微信交流群

微信交流群二维码

扫码加入学术交流群,获取更多资源