Introduction

In a previous post, we studied how to fuzz a simple homemade 64-bit program using AFL. We found that we could cause a segmentation fault in the target using some specific inputs. In this post (and in this video), we will cover the next step: confirming if the crash can lead to a vulnerability. To do so, we’ll use GDB, the GNU debugger, and PEDA to analyze the execution of the target while processing the inputs previously generated by AFL. By doing so, we will find a way to hijack the execution flow from the Vuln1 program in order to execute our own code.

Analyzing and Exploiting Vuln1

The hard part is now about to start, as we need to delve into to assembly code of the target, analyze the values of the registers and understand how they are related to the input. Fortunately, this step is greatly eased with the introduction of tools such as PEDA and Pwntools. Let’s install these right now. To install gdb-peda, simply run this shell script provided by the developers:

https://gist.github.com/DeepCodeSec/bb459e61ad74a4903791f2888dbed836#file-install-peda-sh

Then install Pwntools using pip with sudo -H pip install pwntools . Once done, we are set to start analyzing the crash. PEDA will automatically load whenever you start GDB. Before we start, it’s essential to understand some details about GDB, namely that it uses environment variables and hooks within the code of the debugged program. Why is this important? Exploits can heavily depend on knowing some specific memory addresses within the program itself. These locations within the code will vary depending on the defined environment variables and any additional instruction added by GDB. You must be aware that many Linux distributions are enabling Address space layout randomization (ASLR) by default. ASLR will defeat the exploit developed in this tutorial and must be disabled by editing the /proc/sys/kernel/randomize_va_space file. This file can contain one of the following values:

  • 0 – No randomization;
  • 1 – Conservative randomization. Shared libraries, stack, mmap(), VDSO and heap are randomized; and
  • 2 – Full randomization. In addition to elements listed in the previous point, memory managed through brk() is also randomized.

For the purposes of this exercise, set the value contained in this file to 0. You can see the impact of disabling ASLR has on our experiment in this companion video of this post.

That being said, let’s start gdb with the vuln1 program by typing gdb ./vuln1 at the shell. Doing so will load the vuln1 program within the debugging process, but will not execute it until you execute the run command or r  for short. Once you do so, you will see that you are prompted to enter the username and password just as when you are executing the program from the shell. If you type in regular strings, the program will execute as expected and terminate. Quite uneventful. Let’s try something more interesting: execute the program with the inputs generated by AFL. Assuming you have the inputs in a file called crash1.txt, type r < crash1.txt to start the program using the AFL inputs. This time, you’ll notice that the execution ends with a Segmentation Fault (SIGSEGV) and quite a few things are displayed in your terminal.

The execution ends with a Segmentation Fault (SIGSEGV)

Figure 1: Running the vuln1 program in GDB with the fuzzed input generated by AFL.

Let’s take a closer look at the information on the screen. When the fault was caught by the debugger, it stopped the execution of vuln1. At that point, PEDA captured the current state of the application, including the values of the registers, the program stack and the instruction which resulted in the fault. In this case, after reviewing the data, you should notice the value pointed by the RSP register.

After reviewing the data, you should notice the value pointed by the RSP register.

Figure 2: Value of the RSP register when vuln1 is ran with AFL-generated inputs.

If we take a closer look into our input using the hexdump –C crash1.txt command. We’ll notice a familiar pattern:

https://gist.github.com/DeepCodeSec/bb459e61ad74a4903791f2888dbed836#file-crash-hex

Notice the various a7 61b4 6161 6161 a761 values. Well, it seems that these bytes somehow ended up being pointed by the RSP register. There’s an easy way to confirm this using PEDA using its pattern generator, but first, we need to know how long our input is:

https://gist.github.com/DeepCodeSec/bb459e61ad74a4903791f2888dbed836#file-input-len-sh

The wc command tells us that there are 114 bytes in the crash1.txt file, so let’s round this up to 120 bytes for simplicity. Going back to GDB will use PEDA to generate a cyclic pattern of 120 characters as input using the following command:

https://gist.github.com/DeepCodeSec/bb459e61ad74a4903791f2888dbed836#file-peda-pattern-sh

This will create a file named pat120 with the generated cyclic pattern. If you take a quick look at it, you’ll have an idea of how it looks. For those who used Metasploit before, this functionality is similar to the pattern_create.rb script.

https://gist.github.com/DeepCodeSec/bb459e61ad74a4903791f2888dbed836#file-pattern-show-sh

Now let’s run vuln1 using this newly generated input: gdb-peda> run < pat120

Once again, a segmentation fault is thrown by vuln1 and caught by GDB, however this time the values on the stack are different. The value being pointed to by RSP is also different. Looking back at our pattern, we clearly see that our input is overwriting whatever value is pointed by the RSP register.

The RSP register is pointing to the cyclic pattern generated at address 0x7fffffffdec8
The RSP register is pointing to the cyclic pattern generated at address 0x7fffffffdec8.

Why is this important? Well because the value pointed by this register will eventually be used as a return address once we hit the RET instruction at main+149. Therefore we can put a specific address in RSP and redirect execution flow to a location we control in memory and execute our own code. First thing we need to figure out is: which bytes of our input are being pointed by RSP? Again, PEDA provides an easy way to do so using the pattern_search  command. This function will provide you with information about where the pattern is detected in memory and in the registers:

gdb-peda$ pattern_search 
Registers contain pattern buffer: 
RBP+0 found at offset: 32 
Registers point to pattern buffer: 
[RSP] --> offset 40 - size ~80 
Pattern buffer found at: 0x00005555557543e7 : offset 27003 - size 4 (/home/infectedpackets/targets/vuln1/vuln1) 
0x0000555555755420 : offset 0 - size 120 ([heap]) 
0x00007fffffffdea0 : offset 0 - size 120 ($sp + -0x28 [-10 dwords]) 
References to pattern buffer found at: 
0x00007ffff7dd18c8 : 0x0000555555755420 
0x00007ffff7dd18d0 : 0x0000555555755420 
0x00007ffff7dd18d8 : 0x0000555555755420 
0x00007ffff7dd18e0 : 0x0000555555755420 
0x00007ffff7dd18e8 : 0x0000555555755420 
0x00007ffff7d18f0 : 0x0000555555755420 
0x00007ffff7dd18f8 : 0x0000555555755420

In the output above, the value pointed by RSP is overwritten after reading 40 bytes of the given input. Which means that any 8-byte combination (32 bits) located at position 40 will end up being pointed to by the RSP register and eventually, loaded into RIP and used as the address of the next instruction to execute once we reach the RET instruction. Let’s test this by creating a really simple payload that will fill the RSP with “B”s. We do so by generating 40 “A”, then 8 “B” and 112 “C”s. We can use the printf command from the Bash shell, or you can also do the same in Python or Perl. We will use Bash here:

printf "%0.sA" {1..40} > test 
printf "%0.sB" {1..8} >> test printf "%0.sC" {1..112} >> test

Restart vuln1 by providing input2 as input and take a look at the value of the RIP register:

The RIP register didn’t take the address stored in RSP.

Something didn’t go the way we expected. We should have RSP pointing to 0x4242424242424242 ('BBBBBBBB'), but that’s not the case and the reason for this is that we’re trying to fill RIP with an invalid 64-bit address, which causes a segmentation fault. Despite being able to address 64 bits, current processors actually only support 48 bits, so the upper two bytes should always be null, i.e. 00 00. Knowing this, we’ll modify slightly our payload to include 6x “B” rather than 8.

printf "%0.sA" {1..40} > test2 
printf "%0.sB" {1..6} >> test2 
printf "%0.s\0" {1..2} >> test2 
printf "%0.sC" {1..112} >> test2

Note that we are in little endian and therefore, we will write the 2 null bytes AFTER the 6 “B”s. And if we try again, we should be more successful:

gdb-peda$ nexti
[-----------------------------registers--------------------------------]
RAX: 0x1
RBX: 0x0
RCX: 0x7ffff7b088d0 (<__write_nocancel+7>: cmp rax,0xfffffffffffff001)
RDX: 0x7ffff7dd3760 --> 0x0
RSI: 0x555555755010 ("Access Denied.\n")
RDI: 0x1 RBP: 0x4141414141414141 ('AAAAAAAA')
RSP: 0x7fffffffded0 ('C' )
RIP: 0x424242424242 ('BBBBBB')
R8 : 0x7ffff7fd4700 (0x00007ffff7fd4700)
R9 : 0x7fffffffdeb0 ('A' , "BBBBBB")
R10: 0x7ffff7dd1b58 --> 0x555555756420 --> 0x0
R11: 0x246
R12: 0x5555555545b0 (<_start>: xor ebp,ebp)
R13: 0x7fffffffdfa0 --> 0x1
R14: 0x0
R15: 0x0 EFLAGS: 0x202 (carry parity adjust zero sign trap INTERRUPT direction overflow) 
[----------------------------------code---------------------------------]
Invalid $PC address: 0x424242424242

Crafting the Shellcode

Now that we have demonstrated the vulnerability, we need to decide what kind of payload we want. We’ll also determine where our shellcode is stored and then test our payload. This is the trickiest part of software exploitation as many variables come into play. The addresses obtained in this post will differ from system to system and yours will likely be different as well. 

To generate our payload, we will use Pwntools, a Python module extremely efficient at prototyping workable exploits. You will often find it used by participants of CTF. First, install the Pwntools module using pip install pwn. Afterwards, open a Python interpreter such as ipython and import the Pwn-tools module with from pwn import *.

We’ll use the shellcraft sub-module, which offers a wide selection of payloads for multiple architectures. To generate the shellcode, simply select the appropriate function from the shellcraft module. for the target architecture and operating system, i.e. ‘amd64’ and ‘linux’. You can obtain more information by using help(shellcraft) in a Python interpreter. You can also combine multiple payloads. Prior to using any of the shellcode, make sure to specify the architect by setting the context object adequately:

[ReSyst]Py>>> context.arch='amd64' 
[ReSyst]Py>>> context.os='linux' 
[ReSyst]Py>>> print(shellcraft.echo("Hello!\n") + shellcraft.exit()) 
/* push 'Hello!' / 
mov rax, 0x101010101010101 
push rax 
mov rax, 0x101010101010101 ^ 0x216f6c6c6548 
xor [rsp], rax 
/ call write('1', 'rsp', 6) / 
push (SYS_write) 
/ 1 / 
pop rax 
push (1) 
/ 1 / 
pop rdi 
push 6 
pop rdx 
mov rsi, rsp 
syscall 
/ call exit() / 
push (SYS_exit) 
/ 0x3c */ 
pop rax

For the purpose of this post, we’ll use a lame shellcode that will simply output “Hello!” and exit cleanly. In future posts, we’ll have more useful payloads. We’ll structure our payload as follow:

Basic payload structure for the Vuln1 exploit.

The first 40 bytes are simply used to fill the initial buffers and variables to reach the value pointed by RSP. While we used ‘NOP‘ instructions, any value could be used. Bytes 40 to 48 will store the address to our shellcode, which follows it immediately. Our shellcode will occupy the last 112 bytes if it fits that space:

>>> context.arch='amd64' 
>>> shellcode = asm(shellcraft.amd64.linux.echo("Hello!\n")+shellcraft.amd64.linux.exit()) 
>>> print len(shellcode) 44

Since our payload is 44 bytes – less than 112 bytes – it will easily fit in the space we have after the address, i.e in the “C” segment of the payload. Had it been greater than 112 bytes, we would have to split it. But let’s stick to the basics for now.

Jump Around

Next, we have to determine the address of our shellcode, i.e. where is our shellcode is located on the stack? Looking back at figure 4,  our place holder of the address (i.e. “BBBBBB”) is at 0x7fffffffdec8. The length of the address is 8 bytes and thus our shellcode will start at 0x7fffffffdec8+8:

Detailed structure of the Vuln1 payload.

Notice that we added a NOP sled after the address. Although not strictly needed, doing so will give us some leeway for error, especially when testing our payload outside GDB as you will see later. Now let’s craft a quick python function to create our payload and store it into a file. This will automate testing and make things much faster.

https://gist.github.com/DeepCodeSec/bb459e61ad74a4903791f2888dbed836#file-gen-payload-draft-py

Let’s explain some of the code above:

  • start_addr :  contains the address to jump to;
  • nop = asm(‘nop’, arch=”amd64″) returns the opcode for the “NOP” instruction for amd64 architectures.
  • nop1 = nop*40 : Creates 40 NOPs instructions at the beginning of our payload. 
  • This line, struct.pack(“<Q”, addr)  will store the address of our shellcode on our payload in little-endian format. The Q is used to specify to the pack function that we are storing the address as a unsigned long long . See the related Python docs for more information about using struct

Start GDB again, make sure you have a breakpoint at the RET instruction, so we can follow the execution of our shell code. If you look back at previous runs, you’ll notice that our RET instruction is located at main+149.  To put a breakpoint to this address, simply type b* main+149  in GDB. This will cause the execution to pause at this address. Once paused, type nexti and you’ll notice that we are jumping into our short NOP sled.

https://gist.github.com/DeepCodeSec/bb459e61ad74a4903791f2888dbed836#file-pdisass-out-dump

You can move forward faster thru each of the instructions by using nexti <count> or just type continue or c to continue execution.  If everything goes well, the program will output “Hello!” and exit without any error:

Our exploit correctly prints “Hello!” and exits without error.

We now have a working exploit in GDB…admittedly not a terribly useful one. However, if you try to launch this shell code directly from the command-line, you’ll notice that you will get a Segmentation Fault warning:

https://gist.github.com/DeepCodeSec/bb459e61ad74a4903791f2888dbed836#file-replay-payload-sh

As mentioned at the beginning, This is due to GDB adding environment variables on the stack when loading the Vuln1 program. You compare the environment variables between your shell and GDB by typing env at the shell and using  show env at the GDB prompt. You’ll notice that GDB adds the “LINES” and “COLUMNS” variables. Also notice that different systems will have different variables defined, adding to the instability of our exploit. To prevent environment variables from interfering with our exploit, we will unset all of them in GDB by using the unset env command. When running our target from the shell, we will prefix it with env – to unset all variables in the shell. For example:

https://gist.github.com/DeepCodeSec/bb459e61ad74a4903791f2888dbed836#file-replay-sh

To make our exploit work in the shell, we will need to adjust our jumping address. If we rerun our target with a new cyclic pattern without the environment variables, we obtain a different address:

https://gist.github.com/DeepCodeSec/bb459e61ad74a4903791f2888dbed836#file-registers-hex

In our first attempt with the environment variables, RSP contained the address 0x7fffffffdec8, this time RSP has 0x7fffffffed18 as value. Adjust this value in the script provided above and try again.

Despite our best efforts, we still can’t exploit from the shell, but we are getting closer. At this point, there is no trick to getting it right but only trial and error. GDB may be adding extra instructions in the code for debugging purposes and still making our jump invalid. There are 2 variables you can adjust to guess to fix the issue:

  1. Gradually increase the jump address; and
  2. Increase the size of the NOP sled preceding the shellcode.

In my case, the exploit finally worked by increasing the jump address by 140 bytes and increasing the NOP sled to 80 bytes. Below is the update Python script used. Note that these values will be different based on your system:

https://gist.github.com/DeepCodeSec/bb459e61ad74a4903791f2888dbed836#file-payload-py

Conclusion

Crafting exploit is a mixture of technology and art. While there is a process and a methodology to go about it, there’s always something different that will require additional research, experimenting and tweaking. Even then, your exploit is not guaranteed 100%, given the multiple variables that can affect offsets or memory locations. Practice and experience will take care of honing these skills. This post is a very timid first step and has many failings: we removed common memory protections, we have include quite a useless exploit and we need to remove environment variables for it to work outside of GDB. We’ll gradually improve the realism of this exercise. However we now understand the overall exploit development process better, along with the various difficulties involved.

https://youtu.be/6MnC3CiT_tc

References

  • Zalewski, M. American Fuzzy Lop. Retrieved June 24, 2017, from http://lcamtuf.coredump.cx/afl/
  • Salzman, P. %. (n.d.). Using GNU’s GDB Debugger. Retrieved June 24, 2017, from http://www.dirac.org/linux/gdb/
  • PEDA – Python Exploit Development Assistance for GDB. Retrieved June 24, 2017, from https://github.com/longld/peda
  • 64-bit Linux stack smashing tutorial: Part 1. Retrieved June 24, 2017, from https://blog.techorganic.com/2015/04/10/64-bit-linux-stack-smashing-tutorial-part-1/
  • Mr.Un1k0d3r. 64 Bits Linux Stack Based Buffer Overflow. Retrieved June 24, 2017, from https://www.exploit-db.com/docs/33698.pdf
  • Bravon, A. “How Effective is ASLR on Linux Systems?” Security et alii. February 03, 2013. Accessed July 12, 2017. https://securityetalii.es/2013/02/03/how-effective-is-aslr-on-linux-systems/.
  • Retrieved June 24, 2017, from http://www.mathyvanhoef.com/2012/11/common-pitfalls-when-writing-exploits.html

Additional Reading

  • Klein, Tobias. A bug hunter’s diary: a guided tour through the wilds of software security. No Starch Press, 2011
  • Koziol, Jack, David Litchfield, Dave Aitel, Chris Anley, Sinan Eren, Neel Mehta, and Riley Hassell. “The Shellcoder’s Handbook.” Edycja polska. Helion, Gliwice (2004). 
  • Dowd, Mark, John McDonald, and Justin Schuh. The art of software security assessment: Identifying and preventing software vulnerabilities. Pearson Education, 2006.