论文标题
使用语义神经网络合奏的枪支检测和分割
Firearm Detection and Segmentation Using an Ensemble of Semantic Neural Networks
论文作者
论文摘要
近年来,我们看到了世界各地的恐怖袭击激增。这种攻击通常发生在人群庞大的公共场所,以造成最大的损害并获得最大的关注。即使假定监视摄像机被认为是一种强大的工具,但由于人类警惕监视视频监视的能力或出于简单原因,它们在预防犯罪方面的作用远远不算明显。在本文中,我们基于语义卷积神经网络的合奏提出了一个武器检测系统,该系统分解了检测和将武器定位为与武器各个组件部分有关的较小问题的问题。这种方法具有计算和实用的优势:一组专门针对特定任务的简单神经网络需要更少的计算资源,并且可以并行培训;用户可以调整由单个网络输出的汇总给出的系统的总体输出,以权衡误报和虚假负面因素;最后,根据整体理论,即使在存在薄弱的个体模型的情况下,整个系统的输出也将是强大而可靠的。我们评估了旨在评估单个网络和整个系统的准确性的系统运行模拟。关于综合数据和现实世界数据的结果是有希望的,他们认为与基于单个深层卷积神经网络的单片方法相比,我们的方法可能具有优势。
In recent years we have seen an upsurge in terror attacks around the world. Such attacks usually happen in public places with large crowds to cause the most damage possible and get the most attention. Even though surveillance cameras are assumed to be a powerful tool, their effect in preventing crime is far from clear due to either limitation in the ability of humans to vigilantly monitor video surveillance or for the simple reason that they are operating passively. In this paper, we present a weapon detection system based on an ensemble of semantic Convolutional Neural Networks that decomposes the problem of detecting and locating a weapon into a set of smaller problems concerned with the individual component parts of a weapon. This approach has computational and practical advantages: a set of simpler neural networks dedicated to specific tasks requires less computational resources and can be trained in parallel; the overall output of the system given by the aggregation of the outputs of individual networks can be tuned by a user to trade-off false positives and false negatives; finally, according to ensemble theory, the output of the overall system will be robust and reliable even in the presence of weak individual models. We evaluated our system running simulations aimed at assessing the accuracy of individual networks and the whole system. The results on synthetic data and real-world data are promising, and they suggest that our approach may have advantages compared to the monolithic approach based on a single deep convolutional neural network.