Skip to content

Commit 7c8b32e

Browse files
committed
Update.
1 parent 2443b7d commit 7c8b32e

File tree

1 file changed

+1
-0
lines changed

1 file changed

+1
-0
lines changed

_publications/lacomis2019neural.markdown

Lines changed: 1 addition & 0 deletions
Original file line numberDiff line numberDiff line change
@@ -7,6 +7,7 @@ year: 2019
77
bibkey: lacomis2019neural
88
additional_links:
99
- {name: "ArXiV", url: "https://arxiv.org/abs/1909.09029"}
10+
- {name: "Code and Data", url: "https://github.com/pcyin/dire"}
1011
tags: ["deobfuscation", "naming", "compilation"]
1112
---
1213
The decompiler is one of the most common tools for examining binaries without corresponding source code. It transforms binaries into high-level code, reversing the compilation process. However, compilation loses information contained within the original source code (e.g. structure, type information, and variable names). Semantically meaningful variable names are known to increase code understandability, but they generally cannot be recovered by decompilers. We propose the Decompiled Identifier Renaming Engine (DIRE), a novel probabilistic technique for variable name recovery that uses both lexical and structural information. We also present a technique for generating corpora suitable for training and evaluating models of decompiled code renaming, which we use to create a corpus of 164,632 unique x86-64 binaries generated from C projects mined from GitHub. Our results show that on this corpus DIRE can predict variable names identical to the names in the original source code up to 74.3% of the time.

0 commit comments

Comments
 (0)