Dreamnet：基于SPD流形学习的深层Riemannian网络用于视觉分类

论文标题

Dreamnet：基于SPD流形学习的深层Riemannian网络用于视觉分类

DreamNet: A Deep Riemannian Network based on SPD Manifold Learning for Visual Classification

论文作者

Wang, Rui, Wu, Xiao-Jun, Chen, Ziheng, Xu, Tianyang, Kittler, Josef

论文摘要

基于图像集的视觉分类方法通过以对称阳性（SPD）歧管上的非单个协方差矩阵来表征图像来实现出色的性能。为了更好地适应复杂的视觉场景，最近研究了一些用于SPD矩阵非线性处理的Riemannian网络（Riemnets）。但是，有必要问，是否可以通过简单地增加riemnets的深度来实现更大的准确性提高。答案似乎是负面的，因为更深层次的riemnets倾向于失去概括能力。为了探索这个问题的可能解决方案，我们为SPD矩阵学习提供了新的架构。具体来说，为了丰富深层表示，我们采用SPDNET [1]作为骨干，并用堆积的Riemannian AutoCoder（SRAE）建造在尾巴上。相关的重建误差项可以使SRAE和每个RAE的嵌入功能成为近似身份映射，这有助于防止统计信息的退化。然后，我们插入具有快捷方式连接的几个剩余块，以增强SRAE的代表能力，并简化更深层的网络的训练。实验证据表明，随着网络深度的增加，我们的DreamNet可以提高准确性。

Image set-based visual classification methods have achieved remarkable performance, via characterising the image set in terms of a non-singular covariance matrix on a symmetric positive definite (SPD) manifold. To adapt to complicated visual scenarios better, several Riemannian networks (RiemNets) for SPD matrix nonlinear processing have recently been studied. However, it is pertinent to ask, whether greater accuracy gains can be achieved by simply increasing the depth of RiemNets. The answer appears to be negative, as deeper RiemNets tend to lose generalization ability. To explore a possible solution to this issue, we propose a new architecture for SPD matrix learning. Specifically, to enrich the deep representations, we adopt SPDNet [1] as the backbone, with a stacked Riemannian autoencoder (SRAE) built on the tail. The associated reconstruction error term can make the embedding functions of both SRAE and of each RAE an approximate identity mapping, which helps to prevent the degradation of statistical information. We then insert several residual-like blocks with shortcut connections to augment the representational capacity of SRAE, and to simplify the training of a deeper network. The experimental evidence demonstrates that our DreamNet can achieve improved accuracy with increased depth of the network.

下载PDF全文

下载文献需遵守相关版权规定

论文标题