UFO：统一功能优化

论文标题

UFO：统一功能优化

UFO: Unified Feature Optimization

论文作者

Xi, Teng, Sun, Yifan, Yu, Deli, Li, Bi, Peng, Nan, Zhang, Gang, Zhang, Xinyu, Wang, Zhigang, Chen, Jinwen, Wang, Jian, Liu, Lufei, Feng, Haocheng, Han, Junyu, Liu, Jingtuo, Ding, Errui, Wang, Jingdong

论文摘要

本文提出了一种新颖的统一特征优化（UFO）范式，用于训练和在现实世界和大规模场景下进行深层模型，这需要集合多个AI功能。不明飞行物的目的是通过对所有任务进行大规模预修，从而使每个任务受益。与众所周知的基础模型相比，UFO具有两个不同的重点，即相对较小的模型尺寸，没有适应性成本：1）UFO将各种任务挤压为中等大小的统一模型，以多任务学习方式，并在传输到下游任务时将模型尺寸进一步修剪。 2）不明飞行物不强调转移到新任务。相反，它旨在使修剪模型专门用于一个或多个已经看到的任务。有了这两个特征，UFO为灵活的部署提供了极大的便利，同时保持了大规模预处理的好处。不明飞行物的一个关键优点是修剪过程不仅可以减少模型的大小和推理消耗，而且还提高了某些任务的准确性。具体而言，UFO考虑了多任务培训，并对统一模型产生了两倍的影响：某些密切相关的任务具有相互利益，而某些任务相互冲突。不明飞行物设法通过新颖的网络体系结构搜索（NAS）方法来减少冲突并保留互惠率。对广泛的深度表示学习任务（即面部识别，人员重新识别，车辆重新识别和产品检索）的实验表明，从UFO中修剪的模型的精度比单件任务训练的对应物的精度更高，但具有较小的模型尺寸，从而确保了UFO的概念。此外，UFO还支持发布170亿个参数计算机视觉（CV）基础模型，该模型是该行业中最大的CV模型。

This paper proposes a novel Unified Feature Optimization (UFO) paradigm for training and deploying deep models under real-world and large-scale scenarios, which requires a collection of multiple AI functions. UFO aims to benefit each single task with a large-scale pretraining on all tasks. Compared with the well known foundation model, UFO has two different points of emphasis, i.e., relatively smaller model size and NO adaptation cost: 1) UFO squeezes a wide range of tasks into a moderate-sized unified model in a multi-task learning manner and further trims the model size when transferred to down-stream tasks. 2) UFO does not emphasize transfer to novel tasks. Instead, it aims to make the trimmed model dedicated for one or more already-seen task. With these two characteristics, UFO provides great convenience for flexible deployment, while maintaining the benefits of large-scale pretraining. A key merit of UFO is that the trimming process not only reduces the model size and inference consumption, but also even improves the accuracy on certain tasks. Specifically, UFO considers the multi-task training and brings two-fold impact on the unified model: some closely related tasks have mutual benefits, while some tasks have conflicts against each other. UFO manages to reduce the conflicts and to preserve the mutual benefits through a novel Network Architecture Search (NAS) method. Experiments on a wide range of deep representation learning tasks (i.e., face recognition, person re-identification, vehicle re-identification and product retrieval) show that the model trimmed from UFO achieves higher accuracy than its single-task-trained counterpart and yet has smaller model size, validating the concept of UFO. Besides, UFO also supported the release of 17 billion parameters computer vision (CV) foundation model which is the largest CV model in the industry.

下载PDF全文

下载文献需遵守相关版权规定

论文标题