论文标题
甲骨文字符识别的无监督结构文本分离网络
Unsupervised Structure-Texture Separation Network for Oracle Character Recognition
论文作者
论文摘要
Oracle Bone Script是最早的著名中国写作系统,对考古和语言学是宝贵的。但是,实际扫描的Oracle数据很少见,很少有专家可供注释,这使得扫描的Oracle角色自动识别成为一项艰巨的任务。因此,我们旨在探索无监督的域适应以从手印的甲骨文数据转移知识,这些数据易于获取,以扫描域。我们提出了一个结构文本分离网络(STSN),该网络是一个端到端的学习框架,用于关节解开,转换,适应和识别。首先,通过生成模型将STSN DISENTANGLES特征成结构(字形)和纹理(噪声)组件,然后在结构特征空间中对齐的手工打印和扫描的数据,以便在适应时可以避免严重的噪声造成的负面影响。其次,通过交换跨域的学习纹理来实现转换,并且对最终分类的分类器进行了训练,以预测转换后的扫描字符的标签。这不仅保证了绝对分离,还可以增强学习特征的判别能力。 Oracle-241数据集的广泛实验表明,STSN的表现优于其他适应方法,即使在被扫描的数据受到长期埋葬和粗心的挖掘污染时,也可以成功地提高扫描数据的识别性能。
Oracle bone script is the earliest-known Chinese writing system of the Shang dynasty and is precious to archeology and philology. However, real-world scanned oracle data are rare and few experts are available for annotation which make the automatic recognition of scanned oracle characters become a challenging task. Therefore, we aim to explore unsupervised domain adaptation to transfer knowledge from handprinted oracle data, which are easy to acquire, to scanned domain. We propose a structure-texture separation network (STSN), which is an end-to-end learning framework for joint disentanglement, transformation, adaptation and recognition. First, STSN disentangles features into structure (glyph) and texture (noise) components by generative models, and then aligns handprinted and scanned data in structure feature space such that the negative influence caused by serious noises can be avoided when adapting. Second, transformation is achieved via swapping the learned textures across domains and a classifier for final classification is trained to predict the labels of the transformed scanned characters. This not only guarantees the absolute separation, but also enhances the discriminative ability of the learned features. Extensive experiments on Oracle-241 dataset show that STSN outperforms other adaptation methods and successfully improves recognition performance on scanned data even when they are contaminated by long burial and careless excavation.