论文标题
用于大规模图像搜索的非抗形图哈希算法
A non-alternating graph hashing algorithm for large scale image search
论文作者
论文摘要
在大数据时代,提高记忆和计算效率的方法对于成功部署技术已经至关重要。哈希是处理大数据所带来的计算限制的最有效方法之一。制定此问题的一种自然方法是光谱哈希,它直接纳入了学习二进制代码的亲和力。但是,由于二进制约束,优化变得棘手。为了减轻这一挑战,已经提出了不同的放松方法,以减少获得二进制代码的计算负载并仍然达到一个好的解决方案。所有现有放松方法的问题是诉诸于一个或多个额外的辅助变量,以在放松问题的同时获得高质量的二进制代码。辅助变量的存在导致坐标下降方法,从而增加了计算复杂性。我们认为引入这些变量是不必要的。为此,我们提出了一种新颖的光谱散列宽松公式,该配方却没有增加问题的其他变量。此外,我们没有在变量数量等于数据点的原始空间中解决问题,而是在较小的空间中解决该问题并从该解决方案中检索二进制代码。此技巧同时降低了内存和计算复杂性。我们应用两种优化技术,即投影梯度和在歧管上优化,以获取解决方案。使用四个公共数据集上的综合实验,我们表明,与低复杂性的最新技术相比,提出的有效光谱哈希(ESH)算法可以实现高度竞争性的检索性能。
In the era of big data, methods for improving memory and computational efficiency have become crucial for successful deployment of technologies. Hashing is one of the most effective approaches to deal with computational limitations that come with big data. One natural way for formulating this problem is spectral hashing that directly incorporates affinity to learn binary codes. However, due to binary constraints, the optimization becomes intractable. To mitigate this challenge, different relaxation approaches have been proposed to reduce the computational load of obtaining binary codes and still attain a good solution. The problem with all existing relaxation methods is resorting to one or more additional auxiliary variables to attain high quality binary codes while relaxing the problem. The existence of auxiliary variables leads to coordinate descent approach which increases the computational complexity. We argue that introducing these variables is unnecessary. To this end, we propose a novel relaxed formulation for spectral hashing that adds no additional variables to the problem. Furthermore, instead of solving the problem in original space where number of variables is equal to the data points, we solve the problem in a much smaller space and retrieve the binary codes from this solution. This trick reduces both the memory and computational complexity at the same time. We apply two optimization techniques, namely projected gradient and optimization on manifold, to obtain the solution. Using comprehensive experiments on four public datasets, we show that the proposed efficient spectral hashing (ESH) algorithm achieves highly competitive retrieval performance compared with state of the art at low complexity.