论文标题
DeWolf:通过利用用户调查来改善解码
dewolf: Improving Decompilation by leveraging User Surveys
论文作者
论文摘要
分析恶意软件或固件等第三方软件对于安全分析师来说是至关重要的任务。尽管存在各种自动分析的方法,并且是正在进行的研究的主题,但分析师通常必须诉诸手动静态分析,以深入了解给定的二元样本。由于很少有遇到的样本的源代码,因此分析师会定期采用分解器,以便比分析二进制拆卸更容易,更快。 在本文中,我们介绍了我们的脱编译方法DeWolf。我们对以前的学术最新分解剂和一些新颖的算法进行了各种改进,以增强可读性和理解力,重点是手动分析。为了评估我们的方法并为分析师的需求提供更好的了解,我们进行了三项用户调查。结果表明,DeWolf适合恶意软件理解,其输出质量在某些方面明显超过了Ghidra和Hex射线。此外,我们的结果表明,针对手动分析的分解器应高度配置以尊重单个用户的偏好。此外,未来的反合符不一定遵循不成文规则,以遵守程序集规定的代码结构,以产生可读的输出。实际上,DeWolf已经破解此规则的少数情况导致其结果大大超过了其他二足球。我们在Github上发布了DeWolf的原型实施和所有调查结果。
Analyzing third-party software such as malware or firmware is a crucial task for security analysts. Although various approaches for automatic analysis exist and are the subject of ongoing research, analysts often have to resort to manual static analysis to get a deep understanding of a given binary sample. Since the source code of encountered samples is rarely available, analysts regularly employ decompilers for easier and faster comprehension than analyzing a binary's disassembly. In this paper, we introduce our decompilation approach dewolf. We developed a variety of improvements over the previous academic state-of-the-art decompiler and some novel algorithms to enhance readability and comprehension, focusing on manual analysis. To evaluate our approach and to obtain a better insight into the analysts' needs, we conducted three user surveys. The results indicate that dewolf is suitable for malware comprehension and that its output quality noticeably exceeds Ghidra and Hex-Rays in certain aspects. Furthermore, our results imply that decompilers aiming at manual analysis should be highly configurable to respect individual user preferences. Additionally, future decompilers should not necessarily follow the unwritten rule to stick to the code-structure dictated by the assembly in order to produce readable output. In fact, the few cases where dewolf already cracks this rule lead to its results considerably exceeding other decompilers. We publish a prototype implementation of dewolf and all survey results on GitHub.