论文标题
样本:通过自适应抽样在视频存储库上有效搜索
ExSample: Efficient Searches on Video Repositories through Adaptive Sampling
论文作者
论文摘要
随着相机的部署更便宜,捕获和处理视频越来越普遍。同时,在过去的十年中,丰富的视频理解方法取得了长足的进步。结果,许多组织现在拥有大量的视频数据存储库,并提供了映射,导航,自动驾驶和其他领域的应用程序。由于最新的对象检测方法缓慢而昂贵,因此我们在此累积数据上处理简单的临时对象搜索查询(“在仪表板视频中找到100个交通信号灯”)的能力远远落后于我们收集它的能力。以降低采样率处理视频是这些类型的查询的合理默认策略,但是,理想的采样率既是数据又取决于查询。我们介绍了example,这是一个低成本框架,用于对对象搜索的范围搜索无索引的视频,该视频通过调整采样帧的数量和位置来快速处理搜索查询,以便对特定数据和正在处理的查询进行处理。将视频存储库中的框架处理优先级,以便将处理集中在最有可能包含感兴趣对象的视频的部分。它根据先前处理的框架的反馈不断重新确定处理。在大型现实世界数据集上,除了有效的随机采样基线,将处理时间最多减少了6倍,并且在训练专业的每电量替代模型的最先进方法上,通过几个数量级。因此,样本是构建具有成本效益的视频数据管理系统的关键组成部分。
Capturing and processing video is increasingly common as cameras become cheaper to deploy. At the same time, rich video understanding methods have progressed greatly in the last decade. As a result, many organizations now have massive repositories of video data, with applications in mapping, navigation, autonomous driving, and other areas. Because state-of-the-art object detection methods are slow and expensive, our ability to process even simple ad-hoc object search queries ('find 100 traffic lights in dashcam video') over this accumulated data lags far behind our ability to collect it. Processing video at reduced sampling rates is a reasonable default strategy for these types of queries, however, the ideal sampling rate is both data and query dependent. We introduce ExSample, a low cost framework for object search over unindexed video that quickly processes search queries by adapting the amount and location of sampled frames to the particular data and query being processed. ExSample prioritizes the processing of frames in a video repository so that processing is focused in portions of video that most likely contain objects of interest. It continually re-prioritizes processing based on feedback from previously processed frames. On large, real-world datasets, ExSample reduces processing time by up to 6x over an efficient random sampling baseline and by several orders of magnitude over state-of-the-art methods that train specialized per-query surrogate models. ExSample is thus a key component in building cost-efficient video data management systems.