4 months ago

Shellcode_IA32: A Dataset for Automatic Shellcode Generation

View Paper Details

Pietro Liguori Erfan Al-Hossami Domenico Cotroneo Roberto Natella Bojan Cukic Samira Shaikh

Shellcode_IA32: A Dataset for Automatic Shellcode Generation

Abstract

We take the first step to address the task of automatically generating shellcodes, i.e., small pieces of code used as a payload in the exploitation of a software vulnerability, starting from natural language comments. We assemble and release a novel dataset (Shellcode_IA32), consisting of challenging but common assembly instructions with their natural language descriptions. We experiment with standard methods in neural machine translation (NMT) to establish baseline performance levels on this task.

Code Repositories

dessertlab/Shellcode_IA32

Official

Mentioned in GitHub

https://arxiv.org/abs/2202.03755

Benchmarks

Benchmark	Methodology	Metrics
code-generation-on-shellcode-ia32	LSTM-based Sequence to Sequence	BLEU-4: 62.97 Exact Match Accuracy: 51.55

Build AI with AI

From idea to launch — accelerate your AI development with free AI co-coding, out-of-the-box environment and best price of GPUs.

AI Co-coding

Ready-to-use GPUs

Best Pricing

Hyper Newsletters

Subscribe to our latest updates

We will deliver the latest updates of the week to your inbox at nine o'clock every Monday morning

Powered by MailChimp

4 months ago

Shellcode_IA32: A Dataset for Automatic Shellcode Generation

View Paper Details

Pietro Liguori Erfan Al-Hossami Domenico Cotroneo Roberto Natella Bojan Cukic Samira Shaikh

Shellcode_IA32: A Dataset for Automatic Shellcode Generation

Abstract

We take the first step to address the task of automatically generating shellcodes, i.e., small pieces of code used as a payload in the exploitation of a software vulnerability, starting from natural language comments. We assemble and release a novel dataset (Shellcode_IA32), consisting of challenging but common assembly instructions with their natural language descriptions. We experiment with standard methods in neural machine translation (NMT) to establish baseline performance levels on this task.

Code Repositories

dessertlab/Shellcode_IA32

Official

Mentioned in GitHub

https://arxiv.org/abs/2202.03755

Benchmarks

Benchmark	Methodology	Metrics
code-generation-on-shellcode-ia32	LSTM-based Sequence to Sequence	BLEU-4: 62.97 Exact Match Accuracy: 51.55

Build AI with AI

From idea to launch — accelerate your AI development with free AI co-coding, out-of-the-box environment and best price of GPUs.

AI Co-coding

Ready-to-use GPUs

Best Pricing

Hyper Newsletters

Subscribe to our latest updates

We will deliver the latest updates of the week to your inbox at nine o'clock every Monday morning

Powered by MailChimp

Shellcode_IA32: A Dataset for Automatic Shellcode Generation | Papers | HyperAI