Final Exam Review - FAQ

NOTE: this website is out of date. This is the course web site from a past quarter. If you are a current student taking the course, you should visit the current class web site instead. If the current website is not yet visible by going to cs107.stanford.edu, it may be accessible by visiting this link until the new page is mounted at this address. Please be advised that courses' policies change with each new quarter and instructor, and any information on this out-of-date page may not apply to you.

General Concepts

What is the difference between 1 << 32 and 1L << 32?

Practice Final 1

Problem 2

Could we have the condition be count != 0 instead of count > 0?

Why does “0x400577 <+17>: shr %rsi” result in count/2? Why is there no second argument to shr?

Where in the assembly code are we setting x to arr? How do we know that %r12 is x?

Here are the set of instructions leading up to the jmp associated with the for loop:

0x40053d <+14>: mov %rdi,%r13
0x400540 <+17>: mov %rsi,%r14
0x400543 <+20>: mov %edx,0xc(%rsp)
0x400547 <+24>: mov %rcx,%r15
0x40054a <+27>: mov %rdi,%r12
0x40054d <+30>: mov $0x1,%ebp
0x400552 <+35>: jmp 0x400574

The declaration and initialization of x needs to be associated with one of the lines preceding that jmp, but because only the first argument passed to mystery is a void * (the others are size_t, int, and function pointer), that should tell you that either mov %rdi,%r13 or mov %rdi,%r12 is in place for the void *x = ________ line, and since both realize void *x = arr;, that would be my first guess as to what that line of C code should be.
As to why r12 is being used instead of r13? You'll notice that r13 looks to be a copy of whatever the first argument is, but r13 isn't itself changed every again. It's used in read-only capacity, whereas r12 is updated by the instruction at offset +62. Something that's read-only could be a variable, but something that's updated almost certainly needs to be a variable. And because the updated values of r12 are repeatedly passed at the first argument of the comparison function, that's even more evidence suggesting r12 is being used on behalf of x.

Problem 3

How does the print statement in line 19 ultimately go on to print the password?

Note that get_realpw() doesn't possess a return value, but instead stores the result in the buffer that was passed in, i.e. the 16-byte realpw. But the real issue is that strncpy() will copy only 16 bytes from the user's input password (userpw ) into the userpwcopy buffer, with no guarantee that a null-terminator \0 will be copied over. However, to be a proper, null-terminated string, userpwcopy must have a null-terminator somewhere in its 16 bytes; otherwise, %s in the printf will keep going in search of a null-terminator, \0 printing out more characters than it intends--even beyond the bounds of the array realpw. Furthermore, as realpw and userpwcopy are both buffers / arrays in the function authenticate() , their memory regions will both reside in that function's local stack memory: to draw out a stack / memory diagram:

-- bottom of authenticate's stack memory

[16 bytes of realpw, characters of real password written going up]
...
[16 bytes of userpwcopy, characters of user attempt going up]

  %rsp -- top of authenticate's stack memory (growing downwards)

Hence, by printing userpwcopy, without a null-terminator, %s will keep printing characters / bytes going upward in the memory, going beyond the userpwcopy buffer, and infiltrating into the buffer space of realpw, resulting in the real password being printed as well.

Problem 4

In Part (A), for get_size, instead of the casting to an int *, could we do (headerT *)curr->payloadsz & mask;?

In the myfree function, why do we subtract one from the header pointer? does this free function get passed in a pointer to the payload?

Part C: If a block was indeed allocated (second to least significant bit turned on), is_reallocated would return 0x2 (the & mask operation would turn off all bits except the second the least significant bit), right? If this is the case, since is_reallocated is meant to return a bool, are we assuming that 0 is false and any other integer is true?

In myrealloc, why does it & with 0x2, not 0x3?

Practice Final 2

Problem 1

Why does this print "Tessi"?

Why couldn't we have just done substring(&name, 3, 2)?

Problem 2

How do we calculate bytes_to_move? What does (*p_nelems * width) mean?

In shortest(), why do we pass &min to extract_min?

Problem 3

Why don’t we have local += 3 in the code? Wouldn't lea 0x3(%rdx) effectively add 3 to that variable?

Would having written return (local >= 0 ? local : local + 3) >> 2; be acceptable as well?

Problem 5

For part B, I'm having difficulty understanding the solution. Is it possible to explain this with a diagram?

For C, I understand how the assembly instructions with the printf stack frame are garbling the contents. But, I don't understand why the code prints that garbage?

Problem 6

How does prev = *(void **)to_remove manage to remove to_remove from the free list?

Practice Final 3

What is endian?

Practice Final 4

Problem 1

When linking two nodes together, why can we do assignment with ‘=’ and not use strcpy/memcpy? How do I know when I can use memcpy?

What is the intuition for building the linked list "backwards" from the n-1th item in the array to the front?

On the line *(char **)(node + strlen(strings[i]) + 1) = head How can we cast node + strlen(strings[i]) + 1 to (char **) when (node + strlen(strings[i]) + 1) is a char *

Problem 2

Could you re-explain the solution to part (b)? Why doesn’t the optimized version need to push %r12?

Could you re-explain the solution to part (d)? Why doesn’t the optimized version need to push %r12? What does callq do that jmpq doesn’t, and why can the optimized version use jmpq instead of callq?

Practice Final 5

Problem 2

What’s happening with the i and cinnamon variables? Could you provide an alternative explanation?

Problem 3

In (b), why would *start == info.start guarantee that it's a function pointer?

From the solution
* The stack frames of all functions are, well, stacked at higher addresses, and each stack frame is separated from the one below by a return address.

The function is basically going through the stack as if it is an array void *s and checking if the current address is a valid return address (in other words does it fall within a known function’s memory range?). The reason we use *start > info.start to avoid function pointers is because it wouldn't make sense for the return address to be the start of a function. To see this, recall that a return address in assembly is where we should continue running from after we finish the current function. In other words, it takes us back to where we were in the caller function after we finish running the callee function. I'm going to borrow some assembly from binary bomb below

000000000040229a :

 40229a: 48 83 ec 08  sub $0x8,%rsp

 40229e: bf d8 3d 40 00   mov $0x403dd8,%edi

 4022a3: b8 00 00 00 00   mov $0x0,%eax

 4022a8: e8 43 ee ff ff   callq 4010f0 

 4022ad: 48 83 c4 08  add $0x8,%rsp

 4022b1: c3   retq

It’s important to note that it wouldn't make sense for a callee function to return to address 000000000040229a, since we would need to have at least one instruction where we actually call it using callq. This is why we say that if *start is 40229a, it is probably being used to store a function pointer, not a return address. For backtrace, we just care about the return addresses since we want to know what functions were called up to this point

Problem 4

Why do we have to use 1L here?

Why do HEAP_SIZE and the size_t size variable need to be divided by sizeof(size_t)? Similarly, why do we have size/sizeof(size_t)?

Can you clarify to me how the solution for question 4d achieves its purpose? I don't understand how it is rerouting the pointers in the inactive heap to point to the right thing, especially if the free spaces are put somewhere else in the inactive heap, so wouldn't there need to be an offset that counts how much free space was traversed in the active heap, and then apply that as the offset in the inactive heap?

Ok, here it goes.

void rewrite_addresses() {
    size_t *inactive_curr = inactive_start;
    size_t *inactive_end = inactive_start + HEAP_SIZE/sizeof(size_t);
    while (inactive_curr < inactive_end && node_is_allocated(inactive_curr)) {
        size_t *payload_curr = inactive_curr + 1;
        size_t num_words = node_get_size(inactive_curr)/sizeof(size_t);
        size_t *payload_end = inactive_curr + num_words;

             // omitted for the moment

        inactive_curr = payload_end;
      }
    }

The first two lines compute the boundaries of the inactive heap, and the while loop test ensures that we only pay attention to allocated nodes in the inactive heap and therefore need to be replicated in the active one. My guess is that you more or less figured this part out and that you're interested in the meat of the while loop body.
Part of what's omitted above is presented below, which looks at each word's worth of payload within an allocated node:

       while (payload_curr < payload_end) {
            if (within_active((size_t *)(*payload_curr))) {
               // still omitted for the moment
            }
            payload_curr++;
        }

Because payload_curr is a size_t *, levying a ++ against it advances the number it stores by 8 bytes. *payload_curr evaluates to the size_t that resides at the address stored in payload_curr, and by casting it to a size_t *, we're entertaining the possibility that it's a valid heap address. (We rely on the within_active to return true if and only if that's true.) It's true that the eight bytes there are incidentally a value that just looks like a heap address, but the problem says we should assume that anything that looks to be an address within the heap actually is one.

The full while loops is this:

       while (payload_curr < payload_end) {
            if (within_active((size_t *)(*payload_curr))) {
                size_t offset = *(size_t *)(*payload_curr);
                *payload_curr = (size_t)(inactive_start + offset/sizeof(size_t));
            }
            payload_curr++;
        }

The line is bold is the crucial one. If the address is actually within the heap, we assume the eight bytes at that address comprise the offset placed there in part c of the problem. That offset is in bytes, so we need to divide that offset by sizeof(size_t) in the next line knowing the pointer arithmetic will automatically multiply it back by sizeof(size_t). The second line:



       *payload_curr = (size_t)(inactive_start + offset/sizeof(size_t));

could have instead been written like this:



       *payload_curr = (size_t)((char *) inactive_start + offset);

Extra problems

Pointers and generics #1

Why do we have to get the memory address of the pointers for (b), but not for (a)?

Assembly #1

In the function, I'm not sure what imul &0x31, %esi is doing?

That's the compiler being smart about the part (c) line:

burr[__________] = eliza[0] * eliza[1] * eliza[2] * eliza[3];

eliza[0] * eliza[1]

can just be substituted with a constant 49, i.e. 0x31.

Now, you might wonder how we know what eliza[3] is not equal to 0x31*6*burr[0], since esi seems to be the register storing eliza[3]. It's a little tricky, but basically, esi is immediately reused to store the result of the 4-element multiplication. If it was the case that eliza[3] == 0x31 * 6 * burr[0], then you'd need a second multiplication by 0x31 to multiply by eliza[0] * eliza[1] for line (c), but there is none.