论文标题
AEGIS:一种实时的多模式增强现实现实视觉系统,以帮助自闭症患者的面部表达识别
AEGIS: A real-time multimodal augmented reality computer vision based system to assist facial expression recognition for individuals with autism spectrum disorder
论文作者
论文摘要
对于大多数人来说,解释社会提示的能力自然而然,但是对于那些患有自闭症谱系障碍(ASD)的人来说,有些人在这一领域遇到了不足。本文介绍了多模式增强现实(AR)系统的开发,该系统结合了计算机视觉和深度卷积神经网络(CNN)的使用,以帮助个人在社会环境中对面部表情的检测和解释。我们称为AEGIS(增强现实表达式解释系统)的拟议系统是可以在包括平板电脑,智能手机,视频会议系统或智能类型的各种用户设备上部署的辅助技术,展示了其极端的灵活性和广泛的用例,以使日常生活融合到日常生活中。给定流媒体摄像机源,每个现实世界的框架都被传递到宙斯盾,用于面部边界框,然后送入我们新颖的深卷卷积窗口窗口的神经网络(timeconvnet)。我们利用空间和时间信息来提供准确的表达预测,然后将其转换为相应的可视化,并在原始视频框架的顶部绘制。该系统实时运行,需要最少的设置,并且易于使用。通过使用宙斯盾,我们可以协助拥有ASD的个人学习更好地识别表达方式,从而改善他们的社会经验。
The ability to interpret social cues comes naturally for most people, but for those living with Autism Spectrum Disorder (ASD), some experience a deficiency in this area. This paper presents the development of a multimodal augmented reality (AR) system which combines the use of computer vision and deep convolutional neural networks (CNN) in order to assist individuals with the detection and interpretation of facial expressions in social settings. The proposed system, which we call AEGIS (Augmented-reality Expression Guided Interpretation System), is an assistive technology deployable on a variety of user devices including tablets, smartphones, video conference systems, or smartglasses, showcasing its extreme flexibility and wide range of use cases, to allow integration into daily life with ease. Given a streaming video camera source, each real-world frame is passed into AEGIS, processed for facial bounding boxes, and then fed into our novel deep convolutional time windowed neural network (TimeConvNet). We leverage both spatial and temporal information in order to provide an accurate expression prediction, which is then converted into its corresponding visualization and drawn on top of the original video frame. The system runs in real-time, requires minimal set up and is simple to use. With the use of AEGIS, we can assist individuals living with ASD to learn to better identify expressions and thus improve their social experiences.