视频Swin Transformers用于以Egentric视频理解 @ ego4d挑战2022

论文标题

视频Swin Transformers用于以Egentric视频理解 @ ego4d挑战2022

Video Swin Transformers for Egocentric Video Understanding @ Ego4D Challenges 2022

论文作者

Escobar, Maria, Daza, Laura, González, Cristina, Pont-Tuset, Jordi, Arbeláez, Pablo

论文摘要

我们将视频Swin Transformer作为基础体系结构实现，用于无返回时间定位和对象状态变更分类的任务。我们的方法在这两个挑战上都取得了竞争性能。

We implemented Video Swin Transformer as a base architecture for the tasks of Point-of-No-Return temporal localization and Object State Change Classification. Our method achieved competitive performance on both challenges.

下载PDF全文

下载文献需遵守相关版权规定

论文标题

视频Swin Transformers用于以Egentric视频理解 @ ego4d挑战2022

Video Swin Transformers for Egocentric Video Understanding @ Ego4D Challenges 2022

论文作者

论文摘要

加入微信交流群