论文标题

现有基于惯性数据集的均质化以支持人类活动识别

Homogenization of Existing Inertial-Based Datasets to Support Human Activity Recognition

论文作者

Amrani, Hamza, Micucci, Daniela, Mobilio, Marco, Napoletano, Paolo

论文摘要

已经提出了几种技术来解决从信号中识别日常生活活动的问题。事实证明,应用于惯性信号的深度学习技术是有效的,具有显着的分类精度。最近,人类活动识别(HAR)模型的研究几乎完全以模型为中心。已经证明,训练样本及其质量的数量对于获得独立于其体系结构的深度学习模型至关重要,并且对类内变异性和类间相似性更为强大。不幸的是,公开可用的数据集并不总是包含Hight质量数据和足够大的样本数量(例如,主题的数量,进行的活动类型以及试验持续时间)。此外,数据集之间存在异质性,因此不能琐碎地组合以获得较大的集合。我们工作的最终目的是一个平台的定义和实施,该平台集成了惯性信号的数据集,以便在科学社区提供大型同质信号的大数据集,并在可能的情况下富含上下文信息(例如,主题和设备位置的特征)。我们平台的主要重点是强调数据质量,这对于培训有效的模型至关重要。

Several techniques have been proposed to address the problem of recognizing activities of daily living from signals. Deep learning techniques applied to inertial signals have proven to be effective, achieving significant classification accuracy. Recently, research in human activity recognition (HAR) models has been almost totally model-centric. It has been proven that the number of training samples and their quality are critical for obtaining deep learning models that both perform well independently of their architecture, and that are more robust to intraclass variability and interclass similarity. Unfortunately, publicly available datasets do not always contain hight quality data and a sufficiently large and diverse number of samples (e.g., number of subjects, type of activity performed, and duration of trials). Furthermore, datasets are heterogeneous among them and therefore cannot be trivially combined to obtain a larger set. The final aim of our work is the definition and implementation of a platform that integrates datasets of inertial signals in order to make available to the scientific community large datasets of homogeneous signals, enriched, when possible, with context information (e.g., characteristics of the subjects and device position). The main focus of our platform is to emphasise data quality, which is essential for training efficient models.

扫码加入交流群

加入微信交流群

微信交流群二维码

扫码加入学术交流群,获取更多资源