论文标题

管理云可靠性的服务依赖性:工业实践

Managing Service Dependency for Cloud Reliability: The Industrial Practice

论文作者

Yang, Tianyi, Li, Baitong, Shen, Jiacheng, Su, Yuxin, Yang, Yongqiang, Lyu, Michael R.

论文摘要

云服务之间的交互导致服务依赖性。评估和管理由服务依赖性造成的级联影响对云系统的可靠性至关重要。本文总结了云系统中的依赖性类型,并演示了依赖关系管理系统(DMS)的设计,这是一个管理生产云系统中服务依赖关系的平台。 DMS具有针对服务可靠性(即初始服务部署,服务升级,主动架构优化和缓解反应性故障的功能)的全面支持以及依赖强度强度的精致表征。

Interactions between cloud services result in service dependencies. Evaluating and managing the cascading impacts caused by service dependencies is critical to the reliability of cloud systems. This paper summarizes the dependency types in cloud systems and demonstrates the design of the Dependency Management System (DMS), a platform for managing the service dependencies in the production cloud system. DMS features full-lifecycle support for service reliability (i.e., initial service deployment, service upgrade, proactive architectural optimization, and reactive failure mitigation) and refined characterization of the intensity of dependencies.

扫码加入交流群

加入微信交流群

微信交流群二维码

扫码加入学术交流群,获取更多资源