论文标题
域科学工作流的数据传输和网络服务管理
Data Transfer and Network Services management for Domain Science Workflows
论文作者
论文摘要
本文介绍了在服务访问,调度,日程安排,生命周期管理和编排的情况下将网络资源和数据传输管理提升到相同水平的愿景和正在进行的工作。域科学工作流通常包括主动计算资源分配和管理,但数据传输和关联的网络资源协调并未以类似的方式处理。结果,数据传输可以引入工作流操作的一定程度的不确定性,并且缺乏网络信息不允许进行工作流操作或网络使用来优化。最终结果是,域科学工作流程过程被迫将网络视为不透明的基础架构,他们注入数据并希望它以可接受的经验质量出现在目的地。应用程序与网络交互的能力几乎没有能力交换信息,协商性能参数,发现预期的性能指标或实时接收状态/故障排除信息。开发机制允许应用程序工作流以获取有关网络服务,功能和选项的信息,以便在一定程度上与计算资源的可能性相似,这是这项工作的主要动机。最初的重点是开放科学网格(OSG)/紧凑型MUON电磁阀(CMS)大型强子对撞机(LHC)的工作流程,具有基于RUCIO/FTS/XROOTD的数据传输,以及与ESNET Sense(用于Exascale端到端网络科学的软件定义网络)系统的互操作。
This paper describes a vision and work in progress to elevate network resources and data transfer management to the same level as compute and storage in the context of services access, scheduling, life cycle management, and orchestration. While domain science workflows often include active compute resource allocation and management, the data transfers and associated network resource coordination is not handled in a similar manner. As a result data transfers can introduce a degree of uncertainty in workflow operations, and the associated lack of network information does not allow for either the workflow operations or the network use to be optimized. The net result is that domain science workflow processes are forced to view the network as an opaque infrastructure into which they inject data and hope that it emerges at the destination with an acceptable Quality of Experience. There is little ability for applications to interact with the network to exchange information, negotiate performance parameters, discover expected performance metrics, or receive status/troubleshooting information in real time. Developing mechanisms to allow an application workflow to obtain information regarding the network services, capabilities, and options, to a degree similar to what is possible for compute resources is the primary motivation for this work. The initial focus is on the Open Science Grid (OSG)/Compact Muon Solenoid (CMS) Large Hadron Collider (LHC) workflows with Rucio/FTS/XRootD based data transfers and the interoperation with the ESnet SENSE (Software-Defined Network for End-to-end Networked Science at the Exascale) system.