论文标题
深刻激活的显着区域例如搜索
Deeply Activated Salient Region for Instance Search
论文作者
论文摘要
实例搜索的性能在很大程度上取决于在视频/图像集合中找到和描述各种对象实例的能力。由于缺乏找到实例和得出特征表示的适当机制,实例搜索通常仅对检索已知对象类别的实例有效。在本文中,提出了简单但有效的实例级特征表示。与其他方法不同,班级无关实例本地化和独特的特征表示中的问题被考虑。前者是通过通过图层的后传源过程从图像中检测出显着实例区域来实现的。后传播从最初用于分类的预训练的CNN的最后一个卷积层开始。后传播逐层进行直到到达输入层。这允许从已知类别和未知类别的输入图像中的显着实例区域激活。每个激活的显着区域涵盖了整个或更大的实例范围。通过检测到的实例区域在某些层的特征图上的平均值来产生独特的特征表示。实验表明,这种特征表示在大多数现有方法中表现出更好的性能。此外,我们表明所提出的功能描述符也适用于基于内容的图像搜索。
The performance of instance search depends heavily on the ability to locate and describe a wide variety of object instances in a video/image collection. Due to the lack of proper mechanism in locating instances and deriving feature representation, instance search is generally only effective for retrieving instances of known object categories. In this paper, a simple but effective instance-level feature representation is presented. Different from other approaches, the issues in class-agnostic instance localization and distinctive feature representation are considered. The former is achieved by detecting salient instance regions from an image by a layer-wise back-propagation process. The back-propagation starts from the last convolution layer of a pre-trained CNN that is originally used for classification. The back-propagation proceeds layer-by-layer until it reaches the input layer. This allows the salient instance regions in the input image from both known and unknown categories to be activated. Each activated salient region covers the full or more usually a major range of an instance. The distinctive feature representation is produced by average-pooling on the feature map of certain layer with the detected instance region. Experiments show that such kind of feature representation demonstrates considerably better performance over most of the existing approaches. In addition, we show that the proposed feature descriptor is also suitable for content-based image search.