论文标题
分类的变异自动编码器内核解释和选择
Variational Autoencoder Kernel Interpretation and Selection for Classification
论文作者
论文摘要
这项工作提出了基于差异自动编码器的卷积编码器产生的概率分类器的内核选择方法。特别是,开发的方法允许选择最相关的潜在变量子集。在拟议的实施中,每个潜在变量都是从与最后一个编码器卷积层的单个内核相关的分布中取样的,因为为每个内核创建了个体分布。因此,在采样的潜在变量上选择相关功能使得可以执行内核选择,从而过滤非信息性特征和内核。这种导致模型参数数量减少。评估包装器和过滤器方法以进行特征选择。第二个特别相关,因为它仅基于内核的分布。通过测量所有分布之间的kullback-leibler差异来评估它,假设可以丢弃其分布更相似的内核。该假设得到了证实,因为观察到最相似的内核不会传达相关信息,并且可以删除。结果,所提出的方法适用于为资源受限设备开发应用程序。
This work proposed kernel selection approaches for probabilistic classifiers based on features produced by the convolutional encoder of a variational autoencoder. Particularly, the developed methodologies allow the selection of the most relevant subset of latent variables. In the proposed implementation, each latent variable was sampled from the distribution associated with a single kernel of the last encoder's convolution layer, as an individual distribution was created for each kernel. Therefore, choosing relevant features on the sampled latent variables makes it possible to perform kernel selection, filtering the uninformative features and kernels. Such leads to a reduction in the number of the model's parameters. Both wrapper and filter methods were evaluated for feature selection. The second was of particular relevance as it is based only on the distributions of the kernels. It was assessed by measuring the Kullback-Leibler divergence between all distributions, hypothesizing that the kernels whose distributions are more similar can be discarded. This hypothesis was confirmed since it was observed that the most similar kernels do not convey relevant information and can be removed. As a result, the proposed methodology is suitable for developing applications for resource-constrained devices.