论文标题
自动照片与意识形态漫画匹配
Automatic Photo to Ideophone Manga Matching
论文作者
论文摘要
照片应用程序提供通过文本和贴纸注释的工具。在图形小说中常见的意识形态,模拟和拟声词,尚待探索用于照片注释。我们提出了一种自动意识形态建议和文本定位的方法。这些注释是通过获取具有英语定义的意识形态列表并将视觉对象检测器应用于图像的。接下来,嵌入语义的视觉对象将视觉对象映射到可能的相关意识形态。我们的系统与传统的基于计算机视觉的注释系统形成鲜明对比,后者通过提供交流,有趣和引人入胜的注释来停止推荐对象和场景级注释。我们在日语中测试这些注释,发现它们具有强大的偏好,并增加了与未经通知和基于对象的注释的照片相比,享受和分享的可能性。
Photo applications offer tools for annotation via text and stickers. Ideophones, mimetic and onomatopoeic words, which are common in graphic novels, have yet to be explored for photo annotation use. We present a method for automatic ideophone recommendation and positioning of the text on photos. These annotations are accomplished by obtaining a list of ideophones with English definitions and applying a suite of visual object detectors to the image. Next, a semantic embedding maps the visual objects to the possible relevant ideophones. Our system stands in contrast to traditional computer vision-based annotation systems, which stop at recommending object and scene-level annotation, by providing annotations that are communicative, fun, and engaging. We test these annotations in Japanese and find they carry a strong preference and increase enjoyment and sharing likelihood when compared to unannotated and object-based annotated photos.