谁监督主管？使用深层功能嵌入与工件检查应用的模型监测

论文标题

谁监督主管？使用深层功能嵌入与工件检查应用的模型监测

Who supervises the supervisor? Model monitoring in production using deep feature embeddings with applications to workpiece inspection

论文作者

Banf, Michael, Steinhagen, Gregor

论文摘要

条件监测和工件检查的自动化在维持高质量和高吞吐量方面起着至关重要的作用。为此，机器学习的最新发展幅度的崛起已导致自主过程监督领域的巨大改善。但是，这些模型变得越复杂和强大，它们通常也越透明和解释。主要挑战之一是监视这些机器学习系统的实时部署，并在遇到可能影响模型性能的事件时提高警报。特别是，监督分类器通常是在基础数据分布中的平稳性的假设下建立的。例如，在一组材料表面缺陷上训练的视觉检查系统通常不会适应甚至识别数据分布的逐渐变化 - 一种称为“数据漂移”的问题 - 例如新型表面缺陷的出现。反过来，这可能导致有害的错误预测，例如来自新缺陷类的样本被归类为无缺陷。为此，希望提供对分类器性能的实时跟踪，以告知其他错误类的推定发作以及相对于分类器重新训练的手动干预的必要性。在这里，我们提出了一个无监督的框架，该框架在监督分类系统之上，从而利用其内部深度特征表示形式，以跟踪部署过程中数据分布的变化，从而预测分类器性能降级。

The automation of condition monitoring and workpiece inspection plays an essential role in maintaining high quality as well as high throughput of the manufacturing process. To this end, the recent rise of developments in machine learning has lead to vast improvements in the area of autonomous process supervision. However, the more complex and powerful these models become, the less transparent and explainable they generally are as well. One of the main challenges is the monitoring of live deployments of these machine learning systems and raising alerts when encountering events that might impact model performance. In particular, supervised classifiers are typically build under the assumption of stationarity in the underlying data distribution. For example, a visual inspection system trained on a set of material surface defects generally does not adapt or even recognize gradual changes in the data distribution - an issue known as "data drift" - such as the emergence of new types of surface defects. This, in turn, may lead to detrimental mispredictions, e.g. samples from new defect classes being classified as non-defective. To this end, it is desirable to provide real-time tracking of a classifier's performance to inform about the putative onset of additional error classes and the necessity for manual intervention with respect to classifier re-training. Here, we propose an unsupervised framework that acts on top of a supervised classification system, thereby harnessing its internal deep feature representations as a proxy to track changes in the data distribution during deployment and, hence, to anticipate classifier performance degradation.

下载PDF全文

下载文献需遵守相关版权规定

论文标题