论文标题
学习异常检测的图像表示:在药物开发中发现组织学改变的应用
Learning image representations for anomaly detection: application to discovery of histological alterations in drug development
论文作者
论文摘要
我们提出了一个用于组织病理学图像中异常检测的系统。在组织学中,正常样品通常很丰富,而异常的病例(病理)病例很少或不可用。在这种情况下,接受健康数据培训的单级分类器可以检测到分布异常的样本。此类方法与先前使用预训练的卷积神经网络(CNN)表示形式相结合进行异常检测(AD)。但是,预先训练的现成的CNN表示可能对组织中的异常条件不敏感,而健康组织的自然变化可能会导致较远的表示。为了适应健康组织中的相关细节,我们建议对CNN进行辅助任务培训,该辅助任务区分了不同物种,器官和染色试剂的健康组织。几乎不需要额外的标签工作量,因为健康样品会自动带有上述标签。在训练期间,我们使用中央损失项执行紧凑的图像表示,这进一步改善了AD的表示。所提出的系统优于在公开的肝脏异常数据集上建立的AD方法。此外,它提供了与专门针对肝异常定量的常规方法相当的结果。我们表明,我们的方法可用于在早期开发阶段对候选药物的毒性评估,从而可以减少昂贵的后期药物流失。
We present a system for anomaly detection in histopathological images. In histology, normal samples are usually abundant, whereas anomalous (pathological) cases are scarce or not available. Under such settings, one-class classifiers trained on healthy data can detect out-of-distribution anomalous samples. Such approaches combined with pre-trained Convolutional Neural Network (CNN) representations of images were previously employed for anomaly detection (AD). However, pre-trained off-the-shelf CNN representations may not be sensitive to abnormal conditions in tissues, while natural variations of healthy tissue may result in distant representations. To adapt representations to relevant details in healthy tissue we propose training a CNN on an auxiliary task that discriminates healthy tissue of different species, organs, and staining reagents. Almost no additional labeling workload is required, since healthy samples come automatically with aforementioned labels. During training we enforce compact image representations with a center-loss term, which further improves representations for AD. The proposed system outperforms established AD methods on a published dataset of liver anomalies. Moreover, it provided comparable results to conventional methods specifically tailored for quantification of liver anomalies. We show that our approach can be used for toxicity assessment of candidate drugs at early development stages and thereby may reduce expensive late-stage drug attrition.