Today: string char class, more string loops, if/else, Doctests, grid, peeps example
You don't need to know these, but they are handy. Show these when we are on the experimental server.
See Python Guide: Keyboard Shortcuts
These work on the experimental server and other places, including the PyCharm tool we'll show you on Fri. Ctrl-k works in GMail .. so satisfying!
double_char(s): Given string s. Return a new string that has 2 chars for every char in s. So 'Hello'
returns 'HHeelllloo'
. Use a for/i/range loop.
A string is made of characters. The chars are numbered 0..length-1
How to loop to see each char in a string? Standard for/i/range loop to loop over the index numbers of the chars.
Then use s[i]
within the loop to refer to each char.
double_char(s)
PatternLook at every char, building up a result, like this:
1. At the beginning set result = ''
2. Loop over chars: for i in range(len(s))
3. Use result += xxx
to add something to end of result
4. At the end return result
def double_char(s): result = '' for i in range(len(s)): result += s[i] + s[i] return result
Pattern strategy: for look-at-each-char problems below, we'll think back to the double_char() code as a first step.
s.isalpha()
- True
for alphabetic word char, i.e. 'a-z'
and 'A-Z'
. Python uses "unicode" to support many alphabets, e.g. 'Ω'
is another alphabetic char.
s.isdigit()
- True
if all chars in s are digits '0' '1' .. '9'
s.isspace()
- True
for whitespace char, e.g. space, tab, newline
>>> 'a'.isalpha() True >>> '$'.isalpha() False >>> 'abc'.isalpha() # works for multiple chars too True >>> 'Z'.isalpha() True >>> 'Ω'.isalpha() # Greek Omega char (unicode) True >>> '水'.isalpha() # Mandarin "water" char (unicode) True >>> '9'.isdigit() True >>> 'Ω'.isdigit() False >>> ' '.isspace() True >>> '\n'.isspace() True >>>
'H4ip$' -> 'Hip' '12ab34ba' -> 'abba'
Hint: use s[i].isalpha()
to test each char
Aside: handy lecture music during coding exercises - Creative Commons (CC) licensed music.
def alpha_only(s): result = '' # Loop over all index numbers for i in range(len(s)): # Access each s[i] if s[i].isalpha(): result += s[i] return result
Just as a reminder, here is the regular if-statement we've used many times. If the test is True
, the lines are run. Otherwise if the test is False
, the lines are skipped.
if test-expression: lines
One of: (1) run lines, (2) do nothing
See guide for more if/else details: Python-if
Adding the else:
clause to the if-statement:
if test-expression: lines-1 # run if test True else: lines-2 # run if test False
One of: (1) run lines-1, (2) run lines-2
> str_dx
'1a24$' -> 'dxddx'
def str_dx(s): result = '' for i in range(len(s)): if s[i].isdigit(): result += 'd' else: result += 'x' return result
Suppose you want to do something if a test is False
. Sometimes people sort of back into using else to do that, like the following, but this is not the best way:
if some_test: pass # do nothing here else: do_something
The correct way to do that is with not
:
if not some_test: do_something
The double_char code needs to go all the way to its last line to have its answer.
BUT sometimes an algorithm can figure out an answer earlier, using a return
in that case to provide an answer without running the lines below. This is form of the pick-off strategy where sometimes we figure out the answer early.
You know at some level that the interior of the computer is very logical. This problem embodies that theme of very neat, sharp logic.
Consider this function:
has_alpha(s): Given string s. Return True if there is an alphabetic char within s. If there is no alphabetic char within s, return False.
'3$abc' -> True '42$@@' -> False
We might say this algorithm is like the book Are You My Mother?
Think about solving this problem. Look through the chars of the string. When do you know the answer?
Logic strategy: look at each char. (1) If we see an alpha char, return True
immediately. We do not need to look at any more chars. (2) If we look at every char, and never see an alpha char, must conclude that there are no alpha chars and the result should be False
. The (1) code goes in the loop. The (2) code goes after the loop - a sort of "by exhaustion" strategy.
def has_alpha(s): for i in range(len(s)): if s[i].isalpha(): # 1. Return True immediately if find # alpha char - we're done. return True # 2. If we get to here, there was no # alpha char, so return False. return False
Big Picture - program made of many functions. Want to build out the functions efficiently, concentrating on one function at a time.
Bad news: You can't tell if a function is correct by looking at it.
Good news: It's pretty easy to have a few input/output tests for a function. Tests don't prove 100% that the function is correct, but in practice they work very well.
We're going to show you have to run and write your own function tests in Python. Python is very advanced for this - tests are easy to write, easy to run.
In the old days, you could write separate functions, but it was hard to test them one at a time. You would end up testing the functions all together, and so you would have bugs in function a() and at the same time bugs in b(). The output would be a mixture of both bugs interfering with each other. It could be a real disaster. Now we can test each function on its own. Much better!
If it says "no interpreter" at the lower right of PyCharm window. Click it. Select a recent Python version, such as 3.11 or 3.12. If they are not in the list, click Add Local Interpreter. Then System Interpreter, and select a python3 version. Any recent python3 version is fine. On my laptop, the most recent has the cryptic name "/usr/bin/python3"
We'll use this function to see how tests work.
Given string s, return a string of its digit chars.
'Hi4x3' -> '43'
Doctests are the Python technology for easily testing a function.
For more details, see the Doctest chapter in the guide
def digits_only(s): """ Given a string s. Return a string made of all the chars in s which are digits, so 'Hi4!x3' returns '43'. Use a for/i/range loop. (this code is complete) >>> digits_only('Hi4!x3') '43' >>> digits_only('123') '123' >>> digits_only('xyz') '' >>> digits_only('') '' """ result = '' for i in range(len(s)): if s[i].isdigit(): result += s[i] return result
Here are the key lines that make one Doctest:
>>> digits_only('Hi4!x3') '43'
Usually we run Doctests from within PyCharm. It's also possible to run from the command line like this:
$ python3 -m doctest str1.py
The output is nothing if the tests pass, otherwise it lists all the failures. This runs all the tests in the file.
We'll use Doctests to drive the examples in section and on homework-3.
Today's grid example peeps.zip
grid = Grid(3, 2) grid.width # returns 3 grid.set(2, 0, 'a') grid.set(2, 1, 'b') grid.get(2, 0) -> 'a' grid.in_bounds(2, 0) -> True grid.in_bounds(3, 0) -> False
Suppose we have a 2-d grid of peeps candy bunnies. A square in the grid is either 'p'
if it contains a peep, or is None
if empty.
We'll say an x, y has a happy peep, if that square contains a peep and there is another peep immediately to its left or right.
Look at the grid squares again. For each x, y .. is that a happy peep x, y?
x, y happy? (top row) 0, 0 -> False (no peep there) 1, 0 -> True 2, 0 -> True (2nd row, nobody happy) 0, 1 -> False 1, 1 -> False 2, 1 -> False
Here is the syntax for the above grid. The first [ .. ] is the first row, the second [ .. ] is the second row. This is fine for writing the data of a small grid, which is good enough for writing a test.
grid = Grid.build([[None, 'p', 'p'], ['p', None, 'p']])
def is_happy(grid, x, y): """ >>> grid = Grid.build([[None, 'p', 'p'], ['p', None, 'p']]) >>> is_happy(grid, 0, 0) False >>> is_happy(grid, 1, 0) True >>> is_happy(grid, 2, 0) True >>> is_happy(grid, 0, 1) False >>> is_happy(grid, 2, 1) False """ pass
Checking 5 representative squares. Removed """doc words""" so tests and drawing fit on screen at once.
Workflow. Run, look at the top failed case. Look at the wrong output and the code that produced it to think of your next step. Easier to do this than staring at a blank screen. Also we are more confident it's right when we're done, since we do have a range of tests.
This code is fine. Using the "pick-off strategy, looking for cases to return True
. Then return False
as the bottom if none of the cases found another peep.
def is_happy(grid, x, y): """ Given a grid of peeps and in bounds x,y. Return True if there is a peep at that x,y and it is happy. A peep is happy if there is another peep immediately to its left or right. >>> grid = Grid.build([[None, 'p', 'p'], ['p', None, 'p']]) >>> is_happy(grid, 0, 0) False >>> is_happy(grid, 1, 0) True >>> is_happy(grid, 2, 0) True >>> is_happy(grid, 0, 1) False >>> is_happy(grid, 2, 1) False """ # 1. Check if there's a peep at x,y # If not we can return False immediately. if grid.get(x, y) != 'p': return False # 2. Happy because of peep to right? # Must check that x+1 is in bounds before calling get() if grid.in_bounds(x + 1, y): if grid.get(x + 1, y) == 'p': return True # 3. Similarly, is there a peep to the left? if grid.in_bounds(x - 1, y): if grid.get(x - 1, y) == 'p': return True # 4. If we get to here, not a happy peep, # so return False return False
and
The in_bounds() checks can be done with and
instead nesting 2 ifs. This works because the "and" works through its tests left-to-right, and stops as soon as it gets a False
. This code is a little shorter, but both approaches are fine.
# 2. Happy because of peep to right? # here using "and" instead of 2 ifs if grid.in_bounds(x + 1, y) and grid.get(x + 1, y) == 'p': return True
(Do this if we have time.)
Say we want to know for one column in the grid, is there a happy peep in there? A column in the grid is identified by its x value - e.g. x == 2 is one column in the grid.
Below is the def for this. The parameters are grid and x - the x identifies the column to check. The return True/False strategy is similar to the one seen in the has_alpha() example above. Doctests are provided, write the code to make it work.
def has_happy(grid, x): """ Given grid of peeps and an in-bounds x. Return True if there is a happy peep in that column somewhere, or False if there is no happy peep. >>> grid = Grid.build([[None, 'p', 'p'], ['p', None, 'p']]) >>> has_happy(grid, 0) False >>> has_happy(grid, 1) True """ # your code here pass
def has_happy(grid, x): """ Given grid of peeps and an in-bounds x. Return True if there is a happy peep in that column somewhere, or False if there is no happy peep. >>> grid = Grid.build([[None, 'p', 'p'], ['p', None, 'p']]) >>> has_happy(grid, 0) False >>> has_happy(grid, 1) True """ # x is specified in parameter, loop over all y for y in range(grid.height): if is_happy(grid, x, y): return True # if we get here .. no happy peep found return False
We're just starting down this path with Doctests. Doctests enable writing little tests for each black-box function as you go, which turns out to be big productivity booster. We will play with this in section and on homework-3.