连续时间事件数据的用户依赖性神经序列模型

论文标题

连续时间事件数据的用户依赖性神经序列模型

User-Dependent Neural Sequence Models for Continuous-Time Event Data

论文作者

Boyd, Alex, Bamler, Robert, Mandt, Stephan, Smyth, Padhraic

论文摘要

连续时间事件数据在个人行为数据，金融交易和医疗健康记录等应用中很常见。对此类数据进行建模可能非常具有挑战性，特别是对于许多不同类型事件的应用程序，因为它需要模型来预测事件类型以及发生的时间。参数化时间变化强度函数的复发性神经网络是使用此类数据进行预测建模的当前最新技术。这些模型通常假定所有事件序列均来自相同的数据分布。但是，在许多应用程序中，事件序列是由不同来源或用户生成的，它们的特征可能会大不相同。在本文中，我们将广泛的神经标记点过程模型扩展到潜在嵌入的混合物，在该混合物中，每个混合物组件都会建造给定用户的特征性状。我们的方法依赖于使用编码用户特征的潜在变量来增强这些模型，该变量由混合模型代表，而不是用户行为，该模型是通过摊销的变异推理训练的。我们在四个大型现实世界数据集上评估了我们的方法，并从现有工作的方法中证明了系统的进步，以获取各种预测指标，例如日志样式，下一个事件排名和序列验证源标识。

Continuous-time event data are common in applications such as individual behavior data, financial transactions, and medical health records. Modeling such data can be very challenging, in particular for applications with many different types of events, since it requires a model to predict the event types as well as the time of occurrence. Recurrent neural networks that parameterize time-varying intensity functions are the current state-of-the-art for predictive modeling with such data. These models typically assume that all event sequences come from the same data distribution. However, in many applications event sequences are generated by different sources, or users, and their characteristics can be very different. In this paper, we extend the broad class of neural marked point process models to mixtures of latent embeddings, where each mixture component models the characteristic traits of a given user. Our approach relies on augmenting these models with a latent variable that encodes user characteristics, represented by a mixture model over user behavior that is trained via amortized variational inference. We evaluate our methods on four large real-world datasets and demonstrate systematic improvements from our approach over existing work for a variety of predictive metrics such as log-likelihood, next event ranking, and source-of-sequence identification.

下载PDF全文

下载文献需遵守相关版权规定

论文标题