LLM4Decompile
LLM4Decompile is an open-source large language model dedicated to decompiling binaries into human-readable C source code. The tool supports the decompilation of Linux x86_64 binaries across different optimization levels (O0 to O3).
Key Features:
- Open Source: Allows users to access and modify the codebase.
- Large Model Support: Utilizes models ranging from 1.3 billion to 33 billion parameters for effective decompilation.
- High Accuracy: Achieves significant improvements in re-executability rates of decompiled functions compared to earlier models.
- User-Friendly Setup: Easy installation with clear instructions for getting started.
Benefits:
- Decompilation: Converts binary code back to a source file, facilitating better understanding and analysis of binary files.
- Model Availability: Models are available on Hugging Face for easy access and integration.
- Comprehensive Documentation: Includes a quick start guide, detailed examples, and script usage to help users effectively navigate the decompilation process.
Highlights:
- Supports various optimizations to enhance accuracy in the decompilation process.
- Leveraging Ghidra for improved pseudo-code refinement through a dedicated model.
- Drive advancements in reverse engineering with the latest machine learning techniques.
This repository is a collaborative project aimed at broadening its decompilation capabilities and integrating with popular tools for enhanced functionality.