论文标题
大规模多任务动态ML系统的持续开发方法
A Continual Development Methodology for Large-scale Multitask Dynamic ML Systems
论文作者
论文摘要
传统的机器学习(ML)方法需要将开发和实验过程分解为断开的迭代,其反馈用于指导设计或调整选择。 This methodology has multiple efficiency and scalability disadvantages, such as leading to spend significant resources into the creation of multiple trial models that do not contribute to the final solution.The presented work is based on the intuition that defining ML models as modular and extensible artefacts allows to introduce a novel ML development methodology enabling the integration of multiple design and evaluation iterations into the continuous enrichment of a single unbounded intelligent system.我们将一种新的方法定义为生成动态多任务ML模型作为一系列扩展和概括。我们首先使用标准的ML经验评估方法来分析提出方法的功能。最后,我们提出了一种新型的连续发展方法,该方法允许动态扩展预先存在的多任务大规模ML系统,同时分析提出的方法扩展的性能。这导致了能够共同求解124个图像分类任务的ML模型的生成,以提高大小和计算成本,以实现最新的质量状态。
The traditional Machine Learning (ML) methodology requires to fragment the development and experimental process into disconnected iterations whose feedback is used to guide design or tuning choices. This methodology has multiple efficiency and scalability disadvantages, such as leading to spend significant resources into the creation of multiple trial models that do not contribute to the final solution.The presented work is based on the intuition that defining ML models as modular and extensible artefacts allows to introduce a novel ML development methodology enabling the integration of multiple design and evaluation iterations into the continuous enrichment of a single unbounded intelligent system. We define a novel method for the generation of dynamic multitask ML models as a sequence of extensions and generalizations. We first analyze the capabilities of the proposed method by using the standard ML empirical evaluation methodology. Finally, we propose a novel continuous development methodology that allows to dynamically extend a pre-existing multitask large-scale ML system while analyzing the properties of the proposed method extensions. This results in the generation of an ML model capable of jointly solving 124 image classification tasks achieving state of the art quality with improved size and compute cost.