论文标题
与虚拟老师包装的分类器,用于培训基于物理的分类器,该分类器未标记的雷达数据
Wrapped Classifier with Dummy Teacher for training physics-based classifier at unlabeled radar data
论文作者
论文摘要
在本文中描述了EKB和MAGW ISTP SB RAS相干散点雷达(8-20MHz操作频率)在2021年中自动分类的一种方法。该方法适合于实时对实验数据进行的自动物理解释。我们称这种算法与虚拟老师包装了分类器。该方法在未标记的数据集上进行了培训,并基于使用聚类结果培训基于最佳物理的分类。该方法接近最佳嵌入搜索,其中嵌入被解释为软分类概率的向量。该方法允许基于接收到的数据的物理解释参数找到最佳分类算法,均在基于物理的数值仿真和实验测量过程中获得。用于标记未标记数据集的虚拟教师簇是高斯混合物聚类算法。对于算法的功能,我们使用雷达获得了其他参数,在使用射线追踪和IRI-2012和IRI-2012和IGRF模型中,以相应地将雷达获得的参数扩展了其他参数。对于虚拟老师的聚类,我们使用了可用参数的整个数据集(测量和模拟参数)。为了通过包装分类器进行分类,我们仅使用很好的物理解释参数。结果,我们训练了分类网络,并从可用数据的物理角度找到了11个良好解释的类。从物理的角度来看,其他五个类别的类是无法解释的,这证明了考虑到正确分类的辐射式传播的重要性。
In the paper a method for automatic classification of signals received by EKB and MAGW ISTP SB RAS coherent scatter radars (8-20MHz operating frequency) during 2021 is described. The method is suitable for automatic physical interpretation of the resulting classification of the experimental data in realtime. We called this algorithm Wrapped Classifier with Dummy Teacher. The method is trained on unlabeled dataset and is based on training optimal physics-based classification using clusterization results. The approach is close to optimal embedding search, where the embedding is interpreted as a vector of probabilities for soft classification. The approach allows to find optimal classification algorithm, based on physically interpretable parameters of the received data, both obtained during physics-based numerical simulation and measured experimentally. Dummy Teacher clusterer used for labeling unlabeled dataset is gaussian mixture clustering algorithm. For algorithm functioning we extended the parameters obtained by the radar with additional parameters, calculated during simulation of radiowave propagation using ray-tracing and IRI-2012 and IGRF models for ionosphere and Earth's magnetic field correspondingly. For clustering by Dummy Teacher we use the whole dataset of available parameters (measured and simulated ones). For classification by Wrapped Classifier we use only well physically interpreted parameters. As a result we trained the classification network and found 11 well-interpretable classes from physical point of view in the available data. Five other found classes are not interpretable from physical point of view, demonstrating the importance of taking into account radiowave propagation for correct classification.