论文标题

Swarman:无人机拟人群,带有身体跟踪和深度学习的手势识别

SwarMan: Anthropomorphic Swarm of Drones Avatar with Body Tracking and Deep Learning-Based Gesture Recognition

论文作者

Baza, Ahmed, Gupta, Ayush, Dorzhieva, Ekaterina, Fedoseev, Aleksey, Tsetserukou, Dzmitry

论文摘要

拟人化机器人化身为远程情感交流提供了一种概念上新颖的方法,使世界各地的人们对传统的2D和3D图像数据进行了更广泛的情感和社交交流。但是,当前远程敏感机器人的局限性存在几个局限性,例如高重量,防止其快速部署的系统的复杂性以及安装在静态或轮式移动平台上的头像的有限工作空间。 在本文中,我们通过机器人头像提出了一个新颖的电信概念,该概念基于拟人化的无人机群。 Swarman。开发的系统由通过手势识别接口远程控制的九个纳米载体组成。 Swarman允许操作员通过直接遵循其动作并认识到一种预先记录的情感模式,从而使被捕获的情感作为无人机照明,从而进行交流。 LSTM MediaPipe网络在收集的600个短视频的数据集中接受了培训,并具有五个情感手势。在测试数据集中,实现情绪识别的准确性为97%。 随着通过群体通过群体的交流大大改变了操作员的视觉外观,我们调查了用户识别和响应无人机群体表现的情绪的能力。实验结果表明,用户在评级情绪方面具有很高的一致性。此外,用户表示的物理需求较低(李克特量表为2.25),并且在通过Swarman界面进行通信时对其性能(李克特量表为1.38)感到满意。

Anthropomorphic robot avatars present a conceptually novel approach to remote affective communication, allowing people across the world a wider specter of emotional and social exchanges over traditional 2D and 3D image data. However, there are several limitations of current telepresence robots, such as the high weight, complexity of the system that prevents its fast deployment, and the limited workspace of the avatars mounted on either static or wheeled mobile platforms. In this paper, we present a novel concept of telecommunication through a robot avatar based on an anthropomorphic swarm of drones; SwarMan. The developed system consists of nine nanocopters controlled remotely by the operator through a gesture recognition interface. SwarMan allows operators to communicate by directly following their motions and by recognizing one of the prerecorded emotional patterns, thus rendering the captured emotion as illumination on the drones. The LSTM MediaPipe network was trained on a collected dataset of 600 short videos with five emotional gestures. The accuracy of achieved emotion recognition was 97% on the test dataset. As communication through the swarm avatar significantly changes the visual appearance of the operator, we investigated the ability of the users to recognize and respond to emotions performed by the swarm of drones. The experimental results revealed a high consistency between the users in rating emotions. Additionally, users indicated low physical demand (2.25 on the Likert scale) and were satisfied with their performance (1.38 on the Likert scale) when communicating by the SwarMan interface.

扫码加入交流群

加入微信交流群

微信交流群二维码

扫码加入学术交流群,获取更多资源