L9

Today: Debugging 1-2-3, debug printing, string upper/lower, string case-sensitive, reversed, movie example, grid, grid testing, demo HW3

Will CS106A Get Harder and Harder Each Week?

Is this course going to get harder and harder each week until nobody is left? Mercifully, no! We're actually going to settle down a little from here on out.

How To Test If Index 5 Valid in s?

How do you know if an index number is valid in a string? In other words, is this index number too big? The loosely idiomatic way to write this is: 5 < len(s). This shows up in HW3 and later.

if 5 < len(s):
    # 5 is valid in s
    ...

# equivalent but avoided:
if 5 <= len(s) - 1:

This is congruent with the UBNI convention in the range(n) function, where we use the number 5 to get the numbers 0..4.

Consider - `best10(a, b)`

Here is a small code example showing a common pick-off strategy.

> best10()

Suppose we want a function that takes two numbers, a and b, and returns the highest possible score, like this:

both a and b are 10   -> 100

one of a and b is 10  -> 50

a + b is 10           -> 20

none of above -> 0

What does the code look like for this?

1. best10() With Else

Could do it with "else", but I don't prefer this way. The later cases are indented more and more.

def best10(a, b):
    if a == 10 and b == 10:
        return 100
    else:
        if a == 10 or b == 10:
            return 50
        else:
            ...

2. If/Return Pick-off Series

Could solve with a series of if/return to pick off the various cases, working from highest score to lowest.

def best10(a, b):
    # (1)
    if a == 10 and b == 10:
        return 100
    
    # (2)
    if a == 10 or b == 10:
        return 50
        
    # (3)
    if a + b == 10:
        return 20
    
    return 0

# What about this case?
a = 0, b = 10?

Question: What about `a = 0, b = 10`?

Question: Suppose a = 0, b = 10 .. that will satisfy the (3) if, a + b == 10, which returns 20. But the correct answer for that case should be 50, since one of the numbers is 10. Is this a bug?

Answer: Not a bug, the code above works fine. If one of the numbers is 10, the (2) code detects that and does a return with the correct answer, exiting the function. Since the function exits at (2), it doesn't run (3) as hypothesized. This shows how we leverage the exit-the-function feature of return to make the cases independent.

The problem statement uses "otherwise" - do one thing if both numbers are 10, and otherwise do this other thing. Because the return exits the function for each case, we get the otherwise behavior with the cases proceeding down the function without extra indentation. This is a nice way to pick-off a series of cases in a function.

The code would not work with the cases in a different order. We pick-off the highest score case first (both are 10), and if that fails, the next highest (one is 10) and so on. Try switching the order of those two in the code see what happens.

Pick-Off Pattern

The pick-off code is like a vertical to-do list of tests. The code goes down through the tests until one is True.

def pick_offs(x):
    if *case1*:
        return *answer1*

    if *case2*:
        return *answer2*

    if *case3*:
        return *answer3*

    return *nocaseworked*

We're going to weave together two things below - show some string algorithms, but also show some debugging techniques.

Debugging 1-2-3

To write code is to see a lot of bugs. We'll mention the 3 debug techniques here, and do some concrete examples of two of these below.

For more details, see the Python Guide Debugging chapter

Debug Technique-1
Read the error message

An "exception" in Python represents an error during the run that halts the program. Read the exception message and line number. Read the exception text (which can be quite cryptic looking), from the bottom up. Start the bottom of the error report, looking for the first line of code that is your code and read the error message which can be quite helpful. Go look at the line of your code. Many bugs can be fixed right there, just knowing the error message and which line caused it.

Debug Technique-2
Look at: got + expected + code (Doctest)

Don't ask "why is this not working?". Ask "why is the code producing this output?".

The code and the output are not shifting around - they are crisp and repeatable, just sitting there. Look at the first part of the output which is wrong. What line of the code produced that?

This can work well with Doctests which can show you what you need with one click. Run the Doctest and you have the code, the input, and the output all to work with. It can also be handy to write a small Doctest.

We talked about writing a simple, obvious test case as a first Doctest. e.g. for the alpha_only() function that returns the alphabetic chars from a string, an input like '@Ax4**y', looking for output 'Axy'. That's fine. For debugging, sometimes it's nice to add a tiny test that still shows the bug, maybe '@A' - the loops and everything run so few times, there's less chaos to see through.

Sometimes looking at the code to see how it produced the output is too hard! In that case try print() below.

Debug Technique-3
Add print() in the code

This is a more rarely used technique. Instead of tracking the code in your head, add print() to see the state of the variables. It's nice to just have the computer show the state of the variables in the loop or whatever. This works on the experimental server and in PyCharm - demo below.

Note that return is the formal way for a function to produce a result. The print() function does not change that. Print() is a sort of side-channel of text, alongside the formal result. We'll study this in more detail when we do files.

The experimental server shows the function result first, then the print output below. This can also work in a Doctest. Be sure to remove the print() lines when done - they are temporary scaffolding while building.

Characters - Upper/Lower Case

We're going to think about the little details of characters. Some characters have uppercase and lowercase versions, e.g. 'A' vs. 'a', and some chars just have the one form like '@'.

'a'   # Lowercase
'b'   # Lowercase
'A'   # Uppercase relative of 'a'
'@'   # Doesn't have case

String Upper/Lower Functions
`s.upper() s.lower() s.isupper() s.islower()`

In the Roman A-Z alphabet, each alpha char has lower and upper forms:
'a' is the lowercase form of 'A'
'A' is the uppercase form of 'a'
s.upper() - returns uppercase form of s
s.lower() - returns lowercase form of s
Immutable: s.upper() returns a new, converted string
The original string s is unchanged
s.isupper() - True if made of uppercase chars
s.islower() - True if made of lowercase chars
A char with no upper/lower difference, e.g. '@' or '2'
Returned unchanged by upper()/lower()
isupper()/islower() return False
Subtle: an '@' among alpha chars is ignored by .isupper()/.islower()

>>> # Return with all chars converted to upper form
>>> 'Kitten123'.upper()
'KITTEN123'
>>> 'Kitten123'.lower()
'kitten123'
>>>
>>> 'a'.upper()
'A'
>>> 'A'.upper()
'A'
>>> '@'.upper()
'@'
>>> 
>>> 'A'.isupper()
True
>>> 'a'.isupper()
False
>>> 'a'.islower()
True
>>> '@'.islower()
False
>>> 'ab@'.islower()  # '@' ignored by .islower()
True
>>>

Problem: Immutable String Not Changed By Function Call

Recall - strings are "immutable", which means that once created they are not changed — no changing individual characters in the string and no adding or removing characters.

Suppose we have a variable storing a string: s = 'Hello'

Calling a function like s.upper() returns a new answer string, but the original string is always left unchanged. This is a very common point of confusion. The .upper() function is being called, so it's easy to get the impression that the string is changed.

alt: s points to 'Hello', s.upper() does not change s

Immutable String Examples

We can write code in the interpreter to see this immutable vs. function call in action.

Experimental server interpreter >>>

>>> s = 'Hello'
>>> s.upper()    # Returns uppercase form of s
'HELLO'
>>>
>>> s            # Original s unchanged
'Hello'
>>>
>>> s + '!!!'    # Returns + form
'Hello!!!'
>>>
>>> s            # Original s unchanged
'Hello'
>>>

Working With Immutable String: `x = change(x)`

So how do you change a string variable? Each time we call a function to compute a changed string, use = to change the variable, say s, to point to the new string.

Say we have a string s, and want to change it to be uppercase and have '!!!' at its end. Here is code that works to change s, using the x = change(x) pattern.

>>> s = 'Kitten'
>>> s = s.upper()  # Compute upper, assign back to s
>>> s
'KITTEN'
>>> s = s + '!!!'
>>> s
'KITTEN!!!'
>>>

Mnemonic: x = change(x)

Case Sensitive, `'A'` vs. `'a'`

The chars 'A' and 'a' are two different characters. This is called "case sensitive" and is the default behavior in the computer.

>>> 'A' == 'a'
False
>>> s = 'red'
>>> s == 'red'
True
>>> s == 'Red'   # Must match exactly
False

If we ask you to write some string code, and don't say anything about upper/lower case, assume it should be case-sensitive.

What Does "Not Case-Sensitive" Mean?

Computing something not case-sensitive means the logic treats uppercase and lowercase versions of a char as being equal. For example, if you are looking at a web page and search for the word 'dog', you would expect 'Dog' and 'DOG' to count as a matches. That's "not case-sensitive" logic, and it's what regular people expect.

How To Write Not Case-Sensitive Code

Not case-sensitive code: convert the strings to lowercase form, then do comparisons, logic etc. on the lowercase forms.
1. Case sensitive is the default within the computer, and on CS106A problems
2. If CS106A needs not case-sensitive code, the problem statement will call for it specifically

Example: only_ab()

only_ab(): Given string s. Return a string made of the 'a' and 'b' chars in s. The char comparisons should not be case sensitive, so 'a' and 'A' and 'b' and 'B' all count. Use the string .lower() function.

'aABBccc' -> 'aABB'

> only_ab()

Strategy: write the code using s[i].lower() to look at the lowercase form of each char in s.

only_ab(s) - v1 Case Sensitive

Here is the case-sensitive approach using boolean or, which detects only the chars 'a' and 'b'. Use this as a starting point.

def only_ab(s):
    result = ''
    for i in range(len(s)):
        if s[i] == 'a' or s[i] == 'b':
            result += s[i]
    return result

Debug Technique-2 - Look at: got + expected + code

Here's an opportunity to demonstrate Debug technique-2 — look at the "got" output vs. the expected, and then the code that produced the got output.

In this case the output is:

only_ab('aaABBbccc') -> 'aab'

Expected output: 'aaABBb'

Compare the "got" output to the expected. We can see the code that produced it at the same time.

Key question: Where does the output first go wrong vs. the expected? In this case it fails to grab the first 'A'. Look at the code. Why is the 'A' missing?

In reality, when not working, your thoughts are like "why is this stupid thing not working?" But as a practical matter, Looking at the got output and the code that produced it is the path to fixing the code.

(later practice) alpha_up()

Here's another exercise involving upper/lower logic.

> alpha_up()

'12abc$z' -> 'ABCZ'

Given string s. Return a string made of all the alphabetic chars in s, converted to uppercase form.

Use string functions .isalpha() and .upper()

`reversed()` Function

The Python built-in reversed() function: return reversed form of a sequence such as from range().

Here is how reversed() alters the output of range():

          range(5) -> 0, 1, 2, 3, 4

reversed(range(5)) -> 4, 3, 2, 1, 0

This fits into the regular for/i/range idiom to go through the same index numbers but in reverse order:

for i in reversed(range(5)):
    # i in here: 4, 3, 2, 1, 0

For more detail, see the guide Python range()

The reversed() function appears in part of homework-3.

Reverse String Example

> reverse2()

Say we want to compute the reversed form of a string:

'Hello' -> 'olleH'

There are many ways to do this, and we might make a study of it later. Here is a plan for today:

1. Start with the regular double_char() code, but change it to add a single s[i] per iteration, so it makes a plain copy of the input string.

def reverse2(s):
    result = ''
    for i in range(len(s)):
        result += s[i]
    return result

2. Add reversed() to the loop: reversed(range(len(s)))

i goes through: 4, 3, 2, 1, 0

3. We have result += s[i] in the loop, and i is going through the indexes last to first. This adds the last char 'o', then the next to last char 'l', and so on until it gets to 'H'. So in effect, it builds a reversed version of the string.

Write the code with that plan, then see next section.

There's actually a whole section of reverse string problems we may play with later - trying out various techniques.

Reverse Debug with print()

At this spot, we can look at Debug-3 technique — add print() inside the code temporarily to get visibility into what the code is doing. This works on the experimental server and in Doctests. The printing will make the Doctests fail, so it should only be in there temporarily.

reverse2() Solution With print()

Here's the reverse2() code with print() added

def reverse2(s):
    result = ''
    for i in reversed(range(len(s))):
        result += s[i]
        print(i, s[i], result)
    return result

Heres's what the output looks like in the experimental server - it shows the formal result first, and the print() output below that. This is kind of beautiful, revealing what's going on inside the loop:

'olleH'

4 o o
3 l ol
2 l oll
1 e olle
0 H olleH

Debug Technique-3 Add print() To Understand the Code

This is a more rarely used technique, but it can be very powerful.

Suppose you have a bug and the code is not computing what it's supposed to. First you just look at the output and try to just see what the bug is. Sometimes that is enough. In your mind, you are thinking about what s[i] is going to be for each loop - a thought experiment.

However if you are staring at the code and cannot figure out the bug, you could put some print() calls in there and it will show you exactly what s[i] is for each run of the loop. This can be a very clarifying technique if you are not spotting the bug at first. Instead of using your brain to think what's going on with s[i], just let the computer show you.

This works with Doctests too - the printed output appears in the Doctest window. Unfortunately, the printed output interferes with the Doctest success/fail logic, causing it to always fail, even if the code is correct. So you can print() temporarily to see what's going on, but you need to remove it when you are done.

Today we will use this "movie" example and exercise: movie.zip

The movie-starter.py file is the code with bugs, and movie.py is a copy of that to work on, and movie-solution.py has the correct code.

Movie Project + Testing Themes

alt: movie output grid of letters

Goal: we want an animation where letters fly leftwards on a black background
The world is 90% blank
10% random letters from the word 'doofus'
Kind of like The Matrix
Divide and Conquer - our mantra
Test each function separately - our other mantra
Don't debug the animation as it runs
Debug a tiny, frozen Doctest case
Huge time saver
Today: Doctest for a 2d algorithm function

Recall: Grid

Reference: Grid Reference
grid = Grid(4, 3) - create, all None initially
Zero based x,y coordinates for every square in the grid:
origin at upper left
x: 0..grid.width - 1
y: 0..grid.height - 1
grid.width - access width or height
grid.get(0, 0) - returns contents at x,y (error if out of bounds)
grid.set(0, 0, 'a') - set at x,y
grid.in_bounds(2, 2) - returns True if x,y is in bounds

How To Compute Random Numbers?

"Anyone who attempts to generate random numbers by deterministic means is, of course, living in a state of sin."
-John von Neumann (early CS giant)

Computing random numbers with a computer turns out to be a real problem.

Computer - Deterministic and Repeatable

A computer program is "deterministic" - each time you run the lines with the same input, they do exactly the same thing.

>>> x = 6
>>>
>>> x = x + 1
>>> x
7
>>>

Every time the code runs, the answer is the same.

Repeatable - on a related note, our black-box functions are "repeatable" - calling a function with the same input returns the same output every time. e.g. 'hello'.upper() -> 'HELLO'

Pseudorandom Numbers

Creating random numbers with deterministic code and inputs is impossible, so we settle for pseudorandom numbers. These are numbers which are statistically random looking, but in fact are generated by a deterministic algorithm producing each "random" number in turn. Running the algorithm with the same inputs will yield the same "random" series of numbers again.

Aside: it is possible to create true random numbers by measuring a random physical process - getting the randomness from outside the determinism of the computer. Someday I would like to run a seminar where we build such a device as a project.

Aside: How To Get an Interpreter `>>>`

For more details, see the Python Guide Interpreter chapter

Ways to get an interpreter (apart from the experimental server)

1. With an open PyCharm project, click the "Python Console" button at the bottom .. that's an interpreter.

2. In the command line for your computer, type "python3" ("py" on Windows), and that runs the interpreter directly. Use ctrl-d (ctrl-z on windows) to exit.

Keep in mind that there are two different places where you type commands - your computer command line where you type commands like "date" or "pwd". Then there's the Python interpreter with the >>> prompt where you type python expressions.

The Random Module

A "module" is a library of code we want to use
Python has many built-in modules containing useful functions
More module information later in the quarter
Here the "random" module
import random - this line once at the top of your file
random.randrange(n) - returns 0..n-1 at random, uniformly distributed
random.choice('string') - returns 1 char at random, uniformly distributed

Try random module in the "Python Console" tab at the lower-left of your PyCharm window to get an interpreter. This won't work right in the experimental server interpreter, so try PyCharm.

>>> import random   # hw3 starter code has this already
>>>
>>> random.randrange(10)
1
>>> random.randrange(10)
3
>>> random.randrange(10)
9
>>> random.randrange(10)
1
>>> random.randrange(10)
8
>>> random.choice('doofus')
'o'
>>> random.choice('doofus')
'u'
>>> random.choice('doofus')
'o'
>>> random.choice('doofus')
'o'
>>> random.choice('doofus')
's'
>>> random.choice('doofus')
's'
>>> random.choice('doofus')
'o'
>>> random.choice('doofus')
's'

Example: random_right() Function

The code for this one is provided to fill in letters at the right edge, so we'll just look at it. Demonstrates some grid code for the movie problem. We're not testing this one - testing random behavior is a pain, although it is possible.

def random_right(grid):
    """
    Set the right edge of the grid to some
    random letters from 'doofus'.
    (provided)
    """
    for y in range(grid.height):
        if random.randrange(10) == 0:  # 10% of the time
            ch = random.choice('doofus')
            grid.set(grid.width - 1, y, ch)
    return grid

scroll_left(grid)

The algorithmic core of this project
Kind of tricky
The Doctests are going to save us here
Typical sequence
1. Think about what we want
2. Sketch out the algorithm steps
3. Work out the Python code to do it

scroll_left() - What We Want

For every x,y
Want to "move" the 'b' or 'c' or whatever one to the left, e.g. its x - 1
Don't move 'a' or 'd' since its x - 1 is out of bounds
Don't move None which is 90% of squares
Using drawing vs. "in your head" - a lot of detail here

Think about scroll_left()
alt: 'a b c' top row, move the b and c each one to the left, don't move the a

scroll_left() v1 Plan

Write some code in scroll_left() - version 1
Move square like 'b' to its x - 1
Don't move the 'a', its left is out of bounds
Version 1 shown below
Has some bugs

scroll_left() v1 with bugs

def scroll_left(grid):
    """
    Implement scroll_left as in lecture notes.
    """
    # v1 - has bugs
    for y in range(grid.height):
        for x in range(grid.width):
            # Move char at x,y leftwards
            ch = grid.get(x, y)
            if ch != None and grid.in_bounds(x - 1, y):
                grid.set(x - 1, y, ch)
    return grid

Run v1 GUI

Run this in the full GUI. It's buggy, but at least it's funny.
Observe: small bugs can create big output effects
Your output may be totally haywire
But the bug may just be a -1 somewhere
Running the whole program .. not a good way to debug
Want: small case to debug - a Doctest

Make Test Case - Input and Expected

Need concrete cases to write Doctest. They can be small! An input grid, and the expected output grid. That's what makes one test case - an input and expected. We could also call these "before" and "after" pictures.

Doctest input grid (before)
alt: top row is a b c

[['a', 'b', 'c'], ['d', None, None]]

Doctest expected grid (after)
alt: top row is b c None

[['b', 'c', None], [None, None, None]]

Note: Doctests Picky

If the got differs in any little way from the expected, the test fails, e.g. having an extra space, or using " instead of the expected '.

# Don't write the expected output like this - will fail
[["b",     "c", None], [None, None, None]]

# Write it exactly syntactically as the function returns it
[['b', 'c', None], [None, None, None]]

Debug scroll_left() With Doctests

Run the Doctest to debug the code.

def scroll_left(grid):
    """
    Implement scroll_left as in lecture notes.
    >>> grid = Grid.build([['a', 'b', 'c'], ['d', None, None]])
    >>> scroll_left(grid)
    [['b', 'c', None], [None, None, None]]
    """

Here is the failed Doctest, compare output to expected:

Expected:
    [['b', 'c', None], [None, None, None]]
Got:
    [['b', 'c', 'c'], ['d', None, None]]

See that v1 fails to erase where the 'c' moved from.

Debug scroll_left() - The Key Moment

How do you debug a function? Run its small, frozen, visible Doctests, look at the output, expected and the code - all of which the Doctest makes visible.

The bug has to do with blanking out a square copying from
Also dealing with x=0
Look at the got output from the failed Doctest
Then look at the corresponding code .. often that's enough
Fix the code in lecture. Need to blank out moved-from squares.
Fix-1. Add within if: grid.set(x, y, None)
Run program for fun, then run Doctest again for more debugging
Fix-2. Un-indent above, so outside the if
Instead of in_bounds(), checking x > 0 would work too
Since we are only worried about going off the left edge
Doctest passes .. run the GUI .. now it's perfect

scroll_left() Solution

Here is the code with bugs fixed and the Doctest now passes.

def scroll_left(grid):
    """
    Implement scroll_left as in lecture notes.
    >>> grid = Grid.build([['a', 'b', 'c'], ['d', None, None]])
    >>> scroll_left(grid)
    [['b', 'c', None], [None, None, None]]
    """
    for y in range(grid.height):
        for x in range(grid.width):
            # Move letter at x,y leftwards
            ch = grid.get(x, y)
            if ch != None and grid.in_bounds(x - 1, y):
                grid.set(x - 1, y, ch)
            grid.set(x, y, None)
    return grid

Run Movie

Then run the movie program again
Can specify width/height numbers, 30 30 is the default

$ python3 movie.py
$
$ python3 movie.py 80 40  # bigger window

Key Lesson - Doctest got/expected/code

To debug, we want output which is: small, frozen, and visible

The Doctest gives us exactly this.

Looking at the failing Doctest, we have the expected output, got output, and the code - looking at these three is a good step for debugging.

The failing Doctest is like a to-do item — what is the first bit of the got that is wrong? What line produced that?

Note also that the data for the Doctest case is small and made visible by the system. It's not moving around. We can take our time. Contrast this to watching the animation.

Other Doctest Observations

A small bug can produce big, crazy output
Nice that the Doctest data is small and visible
Note we debugged scroll_left() with a ordinary looking and small 3x2 test case
Once the Doctest passed this small case, the whole big program worked perfectly

Demo: HW3 Sand Program

Watching a demo of the program
looks like a lot of work
The hardest program yet - we'll have some easier in the future
This program is very decomposable
It's 4 functions
Each function with its own Doctests
We provide many Doctests, you write some
It's possible it will work the first time you run all your code
Doctests FTW
This thing is a little fun to play with once done
Even your parents can understand what it does!
Then try to explain Doctests and grid literals to them!

Here is an additional Movie example for more practice.

(not in lecture) Basic Grid Example: set_edges()

This example function sets all the squares on the left edge to 'a', and also all the squares on the right edge.

Implement set_edges(), then write Doctests for it. We're not doing this in lecture, but it's an example.

def set_edges(grid):
    """
    Set all the squares along the left edge (x=0) to 'a'.
    Do the same for the right edge.
    Return the changed grid.
    """
    pass

Solution code:

...
    for y in range(grid.height):
        grid.set(0, y, 'a')               # left edge
        grid.set(grid.width - 1, y, 'a')  # right edge
    return grid

Q: How can we tell if that code works? With our image examples, at least you could look at the output, although that was not a perfect solution either. Really we want to be able to write test for a small case with visible data.

Doctest for set_edges()

Write a test for set_edges()
Literal format used to create and check grid values
Can set a variable within the >>> Doctest, use on later line
>>> grid = Grid.build([['b', 'b', 'b'], ['x', 'x', 'x']])

Here's a visualization - before and after - of grid and how set_edges() modifies it.
alt: set_edges() grid before and after

Here are the key 3 lines added to set_edges() that make the Doctest: (1) build a "before" grid, (2) call fn with it, (3) write out the expected result of the function call

    ...
    >>> grid = Grid.build([['b', 'b', 'b'], ['x', 'x', 'x']])
    >>> set_edges(grid)
    [['a', 'b', 'a'], ['a', 'x', 'a']]
    ...

Run Doctest in PyCharm

Lower right of window - interpreter should be set
Look at set_edges() in PyCharm
Select the ">>>" Doctest
Right click it .. Run Doctest
If the Run Doctest is not present in the menu
You may need to close PyCharm and open the "movie" folder using the PyCharm open... menu
Sometimes there's a PyCharm delay, and waiting 4 seconds fixes it
With the set_edges() code correct, the test should pass
(optional) Can try putting in a bug