2 of 70

Andrey Konovalov

Work on Linux kernel bug detectors, fuzzers, and exploit mitigations

KASAN, syzkaller, Memory Tagging

xairy.io

Who am I?

3 of 70

Network fuzzing via syscalls

3 LPE exploits

External network fuzzing

External USB fuzzing

300+ bugs

My experience with Linux kernel fuzzing

4 of 70

Fuzzing
Fuzzing the Linux kernel

Legacy
Foundation
Charged

Approaches
Tips
Final note

Agenda

Concepts, from simplest to most involved

7 of 70

Fuzzing — feeding in random inputs until the program crashes

Fuzzing

Generate

input

Execute

program

Crash?

Yes

Great!

8 of 70

Fuzzing — feeding in random XML files until the parser crashes

Fuzzing an XML parser

Generate

random

XML file

Feed into

parser

Crash?

Yes

Great!

9 of 70

Fuzzing — feeding in random inputs until the program crashes

Programs:

Application
Library
Kernel
Firmware
...

Programs

10 of 70

Fuzzing — feeding in random inputs until the program crashes

— How do we execute the program?
— What are inputs?
— How do we inject inputs?
— How do we generate inputs?
— How do we detect bugs?
— How do we automate the process?

Fuzzing

11 of 70

Fuzzing — feeding in random inputs until the kernel crashes

— How do we run the kernel?
— What are inputs?
— How do we inject inputs?
— How do we generate inputs?
— How do we detect bugs?
— How do we automate the process?

Kernel fuzzing

12 of 70

Fuzzing the Linux kernel:

Legacy

13 of 70

Fuzzing — feeding in random inputs until the kernel crashes

— How do we run the kernel?
— What are inputs?
— How do we inject inputs?
— How do we generate inputs?
— How do we detect bugs?
— How do we automate the process?

Running the kernel

14 of 70

Running the kernel

	Physical device	VM (e.g. QEMU)

Fuzzing surface	Native (includes device drivers)	Only what the VM supports

Management (restarting, debugging, getting kernel logs)	Hard; hardware gets bricked	Easy

Scalability	Buy more devices	Spawn more VMs

15 of 70

Fuzzing — feeding in random inputs until the kernel crashes

— How do we run the kernel?
— What are inputs?
— How do we inject inputs?
— How do we generate inputs?
— How do we detect bugs?
— How do we automate?

QEMU or physical device

Kernel inputs

16 of 70

Kernel inputs

vmlinux

module.ko

Userspace

Kernel

Syscalls (open, write, ioctl, …)

17 of 70

Fuzzing — feeding in random inputs until the kernel crashes

— How do we run the kernel?
— What are inputs?
— How do we inject inputs?
— How do we generate inputs?
— How do we detect bugs?
— How do we automate?

QEMU or physical device

Syscalls

Execute a binary

Legacy approach

Works everywhere!

18 of 70

Fuzzing — feeding in random inputs until the kernel crashes

— How do we run the kernel?
— What are inputs?
— How do we inject inputs?
— How do we generate inputs?
— How do we detect bugs?
— How do we automate?

QEMU or physical device

Syscalls

Execute a binary

Generating inputs

19 of 70

In case of an XML file parser
How do we generate inputs for it when fuzzing?

Idea #1: just generate random data

Generating inputs for userspace apps

20 of 70

if (input[0] == '<')

if (input[1] == 'x')

if (input[2] == 'm')

if (input[3] == 'l')

// Need to reach at least here.

Parser expects the file to start with "<xml" header

Fuzzer needs ~2^32 guesses to get past the header check

Random inputs

21 of 70

Random binary data works poorly as inputs
So what should we do?
Generate better inputs, duh

How?
Structured inputs (a.k.a. structure-aware fuzzing)
[Discussed later]
[Discussed later]

Better inputs

22 of 70

XML_GRAMMAR = {

"<start>": ["<xml-tree>"],

"<xml-tree>": ["<text>", "<xml-open-tag><xml-tree><xml-close-tag>",

"<xml-openclose-tag>", "<xml-tree><xml-tree>"],

"<xml-open-tag>": ["<<id>>", "<<id> <xml-attribute>>"],

"<xml-openclose-tag>": ["<<id>/>", "<<id> <xml-attribute>/>"],

"<xml-close-tag>": ["</<id>>"],

"<xml-attribute>" : ["<id>=<id>", "<xml-attribute> <xml-attribute>"],

"<id>": ["<letter>", "<id><letter>"],

"<text>" : ["<text><letter_space>","<letter_space>"],

"<letter>": srange(string.ascii_letters + string.digits +"\""+"'"+"."),

"<letter_space>": srange(string.ascii_letters + string.digits +"\""+"'"+" "+"\t"),

}

Structured inputs

23 of 70

Can generate structured blobs

But the kernel does not accept blobs as inputs

(Except when limiting fuzzing surface to e.g. a single syscall)

Generating kernel inputs

24 of 70