论文标题
通过日志数据分析诊断分布式系统
Diagnosing Distributed Systems through Log Data Analysis
论文作者
论文摘要
基于日志的分析和故障射击仍然是集中式和时间繁殖系统的普遍且常用的方法。但是,对于事件之间关系不直接可直接可用的并行和分布式系统,在这种情况下,完全取决于基于日志的分析成为一个挑战。本文试图使用集中式系统的基于日志的性能分析来提供解决方案,并证明结果及其有效性,同时提出了挑战,并提出了在分布式和并行系统中进行性能分析的解决方案。
The log-based analysis and trouble-shooting has remained prevalent and commonly used approach for centralized and time-haring systems. However, for parallel and distributed systems where happen-before relations are not directly available between the events, it become a challenge to fully depend on log-based analysis in such instances. This article attempts to provide solutions using log-based performance analysis of centralized system, and demonstrates the results and their effectiveness, as well presents the challenges and proposes solutions for performance analysis in distributed and parallel systems.