论文标题
带文本的房间:用于覆盖文本检测的数据集
Rooms with Text: A Dataset for Overlaying Text Detection
论文作者
论文摘要
在本文中,我们介绍了带有覆盖和场景文本的新型房间内部图片数据集,总计为25个产品类别的4836个注释图像。我们提供有关数据集收集和注释过程的详细信息,并分析其统计数据。此外,我们提出了一种覆盖文本检测的基线方法,该方法利用字符吸引的文本检测框架来指导分类模型。我们验证我们的方法并在二元分类指标方面显示其效率,达到0.95 F1得分的最终性能,相应地,假阳性和假负率为0.02和0.06。
In this paper, we introduce a new dataset of room interior pictures with overlaying and scene text, totalling to 4836 annotated images in 25 product categories. We provide details on the collection and annotation process of our dataset, and analyze its statistics. Furthermore, we propose a baseline method for overlaying text detection, that leverages the character region-aware text detection framework to guide the classification model. We validate our approach and show its efficiency in terms of binary classification metrics, reaching the final performance of 0.95 F1 score, with false positive and false negative rates of 0.02 and 0.06 correspondingly.