增量双线性网络用于增量多股时间序列分类

论文标题

增量双线性网络用于增量多股时间序列分类

Augmented Bilinear Network for Incremental Multi-Stock Time-Series Classification

论文作者

Shabani, Mostafa, Tran, Dat Thanh, Kanniainen, Juho, Iosifidis, Alexandros

论文摘要

深度学习模型已在解决财务时间序列分析问题，推翻常规机器学习和统计方法方面占主导地位。大多数情况下，由于市场条件固有的差异，无法直接将针对一个市场或安全培训的模型直接应用于另一个市场或安全性。此外，随着市场随着时间的推移的发展，有必要在提供新数据时更新现有模型或培训新模型。大多数财务预测应用程序中固有的这种情况自然会提出以下研究问题：如何有效地将预训练的模型适应一组新的数据，同时保留旧数据的性能，尤其是当旧数据无法访问时？在本文中，我们提出了一种方法，以有效保留在一组证券上预先培训的神经网络中可用的知识，并使其适应以实现新的知识。在我们的方法中，通过保持现有连接的固定来维护预先训练的神经网络中编码的先验知识，并且通过一组增强连接对新证券进行调整，并使用新数据对新证券进行调整。辅助连接被限制为低级。这不仅使我们能够快速针对新任务进行优化，而且还可以降低部署阶段的存储和运行时间复杂性。我们的方法的效率在使用大规模限制订单数据集的股票中价移动预测问题中得到了经验验证。实验结果表明，我们的方法增强了预测性能，并减少了网络参数的总数。

Deep Learning models have become dominant in tackling financial time-series analysis problems, overturning conventional machine learning and statistical methods. Most often, a model trained for one market or security cannot be directly applied to another market or security due to differences inherent in the market conditions. In addition, as the market evolves through time, it is necessary to update the existing models or train new ones when new data is made available. This scenario, which is inherent in most financial forecasting applications, naturally raises the following research question: How to efficiently adapt a pre-trained model to a new set of data while retaining performance on the old data, especially when the old data is not accessible? In this paper, we propose a method to efficiently retain the knowledge available in a neural network pre-trained on a set of securities and adapt it to achieve high performance in new ones. In our method, the prior knowledge encoded in a pre-trained neural network is maintained by keeping existing connections fixed, and this knowledge is adjusted for the new securities by a set of augmented connections, which are optimized using the new data. The auxiliary connections are constrained to be of low rank. This not only allows us to rapidly optimize for the new task but also reduces the storage and run-time complexity during the deployment phase. The efficiency of our approach is empirically validated in the stock mid-price movement prediction problem using a large-scale limit order book dataset. Experimental results show that our approach enhances prediction performance as well as reduces the overall number of network parameters.

下载PDF全文

下载文献需遵守相关版权规定

论文标题