论文标题

METFI:模型驱动的故障模拟框架

MetFI: Model-driven Fault Simulation Framework

论文作者

Kaja, Endri, Gerlin, Nicolas, Rivas, Luis, Bora, Monideep, Devarajegowda, Keerthikumara, Ecker, Wolfgang

论文摘要

安全至关重要的设计需要以一定程度的信心在敌对条件下确保可靠的操作。这些设计的不断更高的复杂性使它们更容易受到失败的风险。 ISO26262建议将故障注入作为验证和衡量安全 - 关键设计的可靠性的适当技术。为了应对复杂性,需要大量精力和严格的验证流。此外,许多故障注入工具仅提供有限程度的可控性。 我们提出了Metafi,这是一种模型驱动的模拟器独立的故障模拟框架,该框架提供了多功能故障注入策略,例如统计断层注入,直接断层注入,详尽的断层注入,同时减少了手动工作。该框架可以注射卡在断层,单事件瞬态故障,单事件障碍故障以及正时故障。故障仿真是在设计的寄存器传输级别(RTL)进行的,其中针对故障模拟的一部分用栅极级别(GL)粒度表示。 Metafi具有完整的芯片(SOC)设计的可扩展性,并且为了证明框架的适用性,将故障模拟应用于两个不同SOC的各个组件。一个SOC正在运行Dhrystone应用程序,另一个SOC正在运行指纹计算应用程序。在这两个设计上进行38个各种故障注射运动需要2个persondays的最小努力。该框架提供了有关组件故障率的重要数据。结果得出的结论是,SOC处理器的一个组成部分的预摘要比这两个SOC上的其他有针对性的组件都更容易受到故障的影响,而不论其运行应用如何。

Safety-critical designs need to ensure reliable operations under hostile conditions with a certain degree of confidence. The continuously higher complexity of these designs makes them more susceptible to the risk of failure. ISO26262 recommends fault injection as the proper technique to verify and measure the dependability of safety-critical designs. To cope with the complexity, a lot of effort and stringent verification flow is needed. Moreover, many fault injection tools offer only a limited degree of controllability. We propose MetaFI, a model-driven simulator-independent fault simulation framework that provides multi-purpose fault injection strategies such as Statistical Fault Injection, Direct Fault Injection, Exhaustive Fault Injection, and at the same time reduces manual efforts. The framework enables injection of Stuck-at faults, Single-Event Transient faults, Single-Event Upset faults as well as Timing faults. The fault simulation is performed at the Register Transfer Level (RTL) of a design, in which parts of the design targeted for fault simulation are represented with Gate-level (GL) granularity. MetaFI is scalable with a full System-on-Chip (SoC) design and to demonstrate the applicability of the framework, fault simulation was applied to various components of two different SoCs. One SoC is running the Dhrystone application and the other one is running a Fingerprint calculation application. A minimal effort of 2 persondays was required to run 38 various fault injection campaigns on both the designs. The framework provided significant data regarding failure rates of the components. Results concluded that Prefetcher, a component of the SoC processor, is more susceptible to failures than the other targeted components on both the SoCs, regardless of the running application.

扫码加入交流群

加入微信交流群

微信交流群二维码

扫码加入学术交流群,获取更多资源