论文标题
短号:学习表格格式规则
CORNET: Learning Table Formatting Rules By Example
论文作者
论文摘要
电子表格广泛用于桌面操作和演示。这些表的风格格式是演示和分析的重要属性。结果,流行的电子表格软件(例如Excel)支持基于规则的自动格式表。不幸的是,编写此类格式规则对于用户来说可能是具有挑战性的,因为它需要了解基本规则语言和数据逻辑。我们提出Cornet,该系统可以解决新的问题,即以格式化的单元格的形式从用户示例中自动学习此类格式化规则。 Cornet从归纳编程的进步中汲取灵感,并将象征性规则枚举与神经等级者相结合,以学习有条件的格式规则。为了激励和评估我们的方法,我们从超过180万个实际工作表的语料库中提取了超过450k的唯一格式规则的表格。由于我们是第一个引入条件格式化的人,因此我们将短号与从相关域改编的各种符号和神经基准进行了比较。我们的结果表明,Cornet可以准确地学习各种评估设置的规则。此外,我们表明Cornet找到的规则比用户所编写的规则要短,并在用户手动格式化的电子表格中发现规则。
Spreadsheets are widely used for table manipulation and presentation. Stylistic formatting of these tables is an important property for both presentation and analysis. As a result, popular spreadsheet software, such as Excel, supports automatically formatting tables based on rules. Unfortunately, writing such formatting rules can be challenging for users as it requires knowledge of the underlying rule language and data logic. We present CORNET, a system that tackles the novel problem of automatically learning such formatting rules from user examples in the form of formatted cells. CORNET takes inspiration from advances in inductive programming and combines symbolic rule enumeration with a neural ranker to learn conditional formatting rules. To motivate and evaluate our approach, we extracted tables with over 450K unique formatting rules from a corpus of over 1.8M real worksheets. Since we are the first to introduce conditional formatting, we compare CORNET to a wide range of symbolic and neural baselines adapted from related domains. Our results show that CORNET accurately learns rules across varying evaluation setups. Additionally, we show that CORNET finds shorter rules than those that a user has written and discovers rules in spreadsheets that users have manually formatted.