L26

Today: AI

Modern Software - Data Layers

A common pattern in modern code is that there's some layered data, with an outer dict, and within it layers of lists and dicts. You should be able to write code that accesses into this layered structure. In places, using in to check that a key exists before accessing it.

Here is an example problem:

> msg_score()

Given a dict msg which summarizes a message in the system. It is guaranteed to contain a key 'words' with a nested list of words. It is also guaranteed to contain a key 'scores' which points to a nested dict where each key is a word, and its value is an int score. Write code that looks at all the words in the nested words list, looks up the score for each, and returns the sum of all these scores. Some words may not be present in the scores dict, and these should be ignored.

msg = {
   'words': ['this', 'and', 'that', 'kitten'],
   'scores': {'and': 10, 'kitten': 12}
}

I often add variables, here words and scores, that point to the nested structures.

words = msg['words']
scores = msg['scores']


words----------
              |
msg = {       v
     'words': ['this', 'and', 'that', 'kitten'],

     'scores': {'and': 10, 'kitten': 12}
}              ^
               |
scores---------

Then write code with the variables.

def msg_score(msg):
    total = 0
    # Var points to inner
    words = msg['words']
    scores = msg['scores']
    for word in words:
        # Check "in" before [word]
        if word in scores:
            total += scores[word]
    return total

Here is a solution without added variables. This works perfectly and it's fine to write it this way. It's just a matter of how many [ ] can you layer up on a line while keeping your ideas straight. I think with the variables, I'm less likely to make mistakes.

def msg_score(msg):
    total = 0
    for word in msg['words']:
        if word in msg['scores']:
            total += msg['scores'][word]
    return total

AI

Let's talk about AI, which will feature in HW 8. Talk about AI and creative work today, and AI and programming in week 10.

Gettysburg Address

Say we start with the Gettysburg address:

Four score and seven years ago our fathers brought forth on this continent, a new nation, conceived in Liberty, and dedicated to the proposition that all men are created equal.

Bigrams Dictionary

We'll build a "bigrams" dictionary with a key for every word in the text. (It's "bigrams" since we consider pairs of words, and the technique scales to looking at more than 2 words.) The value is a list of all the words that occur immediately after that word in the text. This is not a difficult dict to build with some Python code.

{
 'Four': ['score'],
 'score': ['and'],
 'and': ['seven', 'dedicated', 'so', 'proper', 'dead,', 'that'],
 'seven': ['years'],
 ...
}

Bigrams Random Output

The bigrams dict forms a sort of word/arrow model (aka a "Markov model), where each word has arrows leading to the words which might follow it. alt: markov model of words

We can write code to randomly chase through this bigrams model - output a word, then randomly select a "next" word, output that word, and keep going.

Here are 3 randomly generated texts using this simple bigrams Gettysburg model:

1. Four score and dedicated to the living, rather, to the great civil war, testing whether that all men are met on a great task remaining before us to dedicate a great battle-field of that we can not have come to add or any nation so nobly advanced.

2. Four score and seven years ago our poor power to dedicate -- we can never forget what we can not hallow -- and dead, who fought here to be dedicated to add or any nation so nobly advanced. It is for those who struggled here, have thus far so nobly advanced.

3. Four score and dedicated to add or detract. The world will little note, nor long endure. We are met on a portion of freedom -- that this nation, or any nation might live. It is for those who fought here to add or any nation might live. It is for us to be dedicated to dedicate a portion of devotion -- that nation might live.

Here are some examples from Alice in Wonderland, and the Apple software license

4. Alice's side of the question?' said Alice, very humble tone, and he stole those serpents! There's no label this he spoke. `As wet as he was all this be getting tired of that squeaked. This sounded promising, certainly: but he called a tiny hands up and went on, without speaking, so Alice said Alice, feeling a very neatly and the right thing the Queen to the Dodo.

5. SOFTWARE PACKAGE WITHIN THE TERMS OF THIRD PARTY RIGHTS. D. H.264/AVC Notice. To the fonts included with the Photos App Features may download was encoded by visiting http://www.apple.com/privacy/. At all warranties, expressed or services using the Apple Software, full force and conditions of the Apple Software. You acknowledge and how the GPL or control.

Bigrams Output Observations

1. It's kind of vaguely similar to the input. It's impressive considering that we just look at pairs of words and that's all.

2. However the output doesn't really make sense or have sense in it; it's just replaying fragments vaguely imitating the source text.

This Is Roughly How AI Works

This simple bigrams model reflects a bit of how ChatGPT / LLM AI works — training phase and output phase.

1. Train The Model

Read in a huge source text, building a "model" that in some way summarizes or captures patterns in the original. It's not a copy of the original, but a sort of distillation.

The contents of the model reflects and depends on the source text it's built from. A sort of summary or distillation.

2. Output = Traverse The Model

The output is made by traversing and working through model (possibly with a prompt). The output just reflects the mode contents, so when we produce output from different models, the output looks very different too (e.g. Alice vs. Apple).

3. Bigrams is Microscopic

The bigrams example shows the structure, but the real AI is, say, a million times deeper. If the bigrams example is the scale of a shoe, then the LLMs are the scale of a whole automobile.

So with this in mind, here are 2 points to keep in mind about AI..

1. The Model is not a Big Brain

After training, we don't have a big brain, or something that thinks. It does have a lot of intelligent patterns in it. We should not imagine that this big brain will provide analytical answers for us. It is able to reflect the sources it was trained on.

e.g. "Should we fund new nuclear reactors" or "how does the US help achieve peace in the middle east" are deep and complicated questions, in an environment with many unknowns. Nobody should think that addressing these questions to an AI is going to be a big improvement. I mention this, to avoid some wish-projection onto the AI, that we now have a source of great wisdom to solve problems for us.

2. Intelligence vs. Replay

The AI output sounds intelligent, but this is mostly a mirage. It is replaying human fragments with intelligence in them. This why the AI is not good at avoiding falsehoods.

We could say there is range of interpretations of the AI ranging from "intelligent" to "replay" explanations of what it's doing. There is certainly a lot of replay in the AI output.

BUT now I'll try to portray AI's strengths..

3. Replay is a form of intelligence?

Maybe replay is not 100% separate from intelligence. Isn't replay a big part of how you compose sentences, pulling up fragments from your experience?

Corollary: many problems maybe can be solved with a lot of replay. Problems that are kind of repetitive, like say, answering customer service questions that come in to some corporate help line. Many of the questions will resemble past questions.

Niches where good answers can be mined from the corpus of previous answers are where AI will do best at first.

4. It's Day 1 of the AI Revolution

However lame in some ways the AI is now, this is day 1. Vast amounts of money and talent are going into a race to further develop AI. I expect to see big improvements from today's state.

Infinite Poetry Example

Say we have the beginnings of poems, each stored as a dict json text, like this. Each poem has a title and some other things, and a list of text lines under the key 'lines', as shown below. (I built this patterned off Chris Piech's material for calling AI from Python, and homework 8 will work similarly.)

Here each poem is represented by JSON text of a dict about the poem. JSON mostly looks like Python, but requires double quotes.

# poem1.txt
{
    "title": "lecture poem",
    "tone": "silly",
    "comment": "this one seems to induce some rhyming, fewer lines may work",
    "lines": [
        "There once was a bird named flappy",
        "Who liked to get kind all kind of nappy",
        "We stood in a tree",
        "And they said unto me"
    ]
}


# poem3.txt
{
    "title": "lecture poem",
    "tone": "jaunty",
    "lines": [
        "Bippity bip bip",
        "Bap bap bap",
        "Fiddly dit dit"
    ]
}

Prompt String

The Python code builds a prompt string to feed to the AI. In the prompt, the first poem gives the format we want, and the second is spelling out what the poem looks like thus far. We want the AI to add a line.

In the code we have EXAMPLE_POEM

EXAMPLE_POEM = {
    "title": "lecture poem",
    "tone": "silly",
    "lines": [
        "It was not so hot",
        "A thing I say a lot",
    ]
}

Here is an example prompt (i.e. these are instructions we feed the AI):

Return this poem with one line added. The result should be formatted in json like this: {"title": "lecture poem", "tone": "silly", "lines": ["It was not so hot", "A thing I say a lot"]}. The poem thus far is: {"title": "lecture poem", "tone": "silly", "comment": "this one seems to induce some rhyming, fewer lines may work", "lines": ["There once was a bird named flappy", "Who liked to get kind all kind of nappy", "We stood in a tree", "And they said unto me"]}.

Recall Format String

We have the neat f'hi {name}' format string feature here, where the python expression inside curly braces is pasted into the surrounding text. We use this below to construct the prompt string.

>>> name = 'alice'
>>> 
>>> f'Hi there {name.upper()}'
'Hi there ALICE'

Recall JSON

The json.dumps(x) function takes in any data structure, and returns it encoded as JSON text, which we will use in the prompt.

>>> d = {'a': 'b'}
>>> import json
>>> json.dumps(d)
'{"a": "b"}'

Code To Create Prompt

Here is the key function, making the prompt. The prompt is just a string the python code is putting together. The code uses format strings f' ..{expr} ..'. The format string has an 'f' to the left of the string. Then curly braces in the string enclose expressions. Each expression is evaluated and its result is pasted into the string at that spot. In this code, calls to json.dumps(d) are used to compute the json string for each dict, and paste that into the prompt. Recall that "dump" in JSON refers to creating a big string that represents the input data structure.

EXAMPLE_POEM = {
    "title": "lecture poem",
    "tone": "silly",
    "lines": [
        "It was not so hot",
        "A thing I say a lot",
    ]
}

def extend_poem(poem):
    """
    Given poem dict, AI adds something to make a longer poem,
    which is returned.
    """
    print('[Suspenseful music plays as the AI thinks...]')
    # Form the prompt, giving example json, and mentioning
    # the poem thus far.
    prompt = ('Return this poem with one line added. ' +
        f'The result should be formatted in json ' +
        f'like this: {json.dumps(EXAMPLE_POEM)}. ' +
        f'The poem thus far is: {json.dumps(poem)}.')
    
    # This is the boilerplate to send the prompt to the AI
    chat_completion = CLIENT.chat.completions.create(
        messages=[
            {
                "role": "user",
                "content": prompt,
            }
        ],
        model="gpt-3.5-turbo",
        response_format={"type": "json_object"},
    )
    # Get the response back from the AI, convert back to a dict
    json_response = chat_completion.choices[0].message.content
    poem_new = json.loads(json_response)
    return poem_new

Interaction Demo

What this example does is call the AI to make a new line, then prompt the user to choose: "k" keep means to take the AI suggested poem and loop around to add a new line to that. Or "t" try again means to discard the AI suggestion, and loop around to have it try again with the poem as is.

In this example, I "t" try-again the first AI suggestion, and then "k" keep all the later ones.

[Suspenseful music plays as the AI thinks...]
{
    "title": "lecture poem",
    "tone": "jaunty",
    "lines": [
        "Bippity bip bip",
        "Bap bap bap",
        "Fiddly dit dit",
        "As we sipped on some tea"
    ]
}
Keep (k), Try again (t), Quit (q) ? t
[Suspenseful music plays as the AI thinks...]
{
    "title": "lecture poem",
    "tone": "jaunty",
    "lines": [
        "Bippity bip bip",
        "Bap bap bap",
        "Fiddly dit dit",
        "Lalalalalala"
    ]
}
Keep (k), Try again (t), Quit (q) ? k
[Suspenseful music plays as the AI thinks...]
{
    "title": "lecture poem",
    "tone": "jaunty",
    "lines": [
        "Bippity bip bip",
        "Bap bap bap",
        "Fiddly dit dit",
        "Lalalalalala",
        "Sippy sip sip"
    ]
}
Keep (k), Try again (t), Quit (q) ? k
[Suspenseful music plays as the AI thinks...]
{
    "title": "lecture poem",
    "tone": "jaunty",
    "lines": [
        "Bippity bip bip",
        "Bap bap bap",
        "Fiddly dit dit",
        "Lalalalalala",
        "Sippy sip sip",
        "On a bright sunny day"
    ]
}

Ethics and AI

Now we'll talk a little about ethics and AI.

Embedded Bias

The AI reflects whatever bias is in its source material.

"Bias" here just means that the data is not representative of the whole population, which may be a problem depending on what the AI is used for.

e.g. Suppose you trained an AI on nothing but TikTok videos. What's the average age there, like 20? You would not be surprised to get advice and views geared for 20 year olds from that AI.

e.g. A bias in US elections is that some groups are more likely to vote. In particular, old people vote at higher percentages. We are not surprised that Social Security, which benefits the elderly, is a big priority with politicians, reflecting the bias in who votes. (This is not necessarily a flaw in the voting system. The non-elderly are choosing to vote less.)

Currently AIs are trained on books and internet content, biased towards the developed world, and towards culture that is written (books, newspapers, magazines) or say, reddit type internet content. There is no doubt some bias in that source material.

We may have time to get to this today. Let's think about how AI might develop, and in part this is my opinion.

AI - Mostly Not Reasoning

IMHO: AI systems are more recall than reasoning at this time. Therefore, you should not count on AI having an analytical insight about something. Sam Altman has said their going to create an AI, and then ask it how to make money. I don't think this is likely to work - the AI is just a distillation of what it's seen. There's an effort to make the AI reason on its own, but I'm skeptical of this so far.

The Creative Economy

Think about things you consume in the creative economy:music, videos, books, the news, video games .. it's a big part of your day once you start listing it all out. You may pay for this in an creator-direct way, like you pay Netflix, and then Netflix pays the actors, directors etc. Spotify being another example. Of you "pay" with by watching ads attached to the content, like Youtube, or (traditional media) magazines, or broadcast TV. The point is, there is a stream of money going to the creators, motivating them to make the content.

Aside: In Medieval times, there was not copyright, so anybody could re-print a book. With copyright law, the book author gets paid, so there's an incentive to write and publish books. You may be annoyed at paying for some works of art, but you should simultaneously appreciate that paying for the art is helping to bring it into existence.

Where Does AI Get The Training Content?

Currently, the story is that AI is just taking it
All the hours of youtube
All the writing on the internet
Hard to police, as the training is done privately within the AI org
Someday, AI could arrange a payment to the training data holders
I don't think this will change the dynamics much
There's so much data to train on, leaving some out does not seem infeasible

Future Scenarios Creative Economy

Think about some ways AI might work out for the creative economy
Not predictions of what is most likely
Instead, think about the space of what is possible
An exercise in thinking about possible futures

Think About Actors

Think about an actor working in LA. What are some scenarios of how things might work out.

1. Suppose "AI Actors" Are Really Good

Suppose Generative AI for making "AI Actors" is really good and inexpensive, what would be some effects?

Winner: consumers of video would benefit, as the cost of producing this would go down
The volume of video might go up
Currently there's just one "Barbie Movie"
But maybe videos could be made in smaller niches, like "Vampire Themed Jane Austen" .. and the AI just makes it. A variety we don't have today.
Limitation: we already seem to live in an age of lots of video, so is there much room for improvement?

The loser in this would be the regular actors. They are being out-competed by the AI. Akin to candles being competed away by electric light.

On theory is that the AI will create a mass of content cheaply, but having "taste" about what's good is still human. In that case the human job will be shaping all this AU content.

Remember The Candlemaker

History of Illumination (20 minute podcast). For our ancestors, light at night was expensive and rare. Now light is so abundant, we scarcely think of it. Such technological advances are key driver in life being better. BUT some jobs will go away, e.g. candlemaker.

Suppose AI Actor is Mediocre, "slop"

Maybe AI content is cheap, but mediocre.

This case is what we have now, so not hard to imagine
I expect in this case, the slop and the human-content will exist side by side
High volume, low cost AI content .. increasing
Relatively high cost, high quality human content

We see lots of examples where the low cost thing thrives, but the co-exists

The Story of Restaurants

People eat restaurant food more, but home cooking is still common, and some people choose to cook. There is a balance, and people chose their own balance.

The Story of Music

In history there was only live music.
hen recording came along, and the minutes of music you could consume went up.
This did not destroy live music - now the co-exist
Many of you consume recorded music
And also at times you consume, live music
Live music thriving part of the economy, and exists in balance with recorded music

You can tell a story where the slop "crowds out" the high quality content, but that's not what we see happen in other domains. That said, being an actor or author may pay less, as AI is competing away part of the economy.

Slop vs. Creativity - e.g. Harry Potter

Certainly there's going to be more AI content. A question to ponder: think of a work of art that moved you. Could the AI in the future come up with something that inventive. I really enjoyed the Harry Potter book series, especially as a an audiobook on family road trips. Could an AI come up with something that inventive?

I don't think anybody knows. You could argue that there's an artistic lightening bolt that JK Rowling had, and that's human, and the AI will struggle to do it. OR you could argue that she read thousands of books, and then did a kind of re-mix in her head.

I chose Harry Potter as an example, because I don't think it resembles much other literature, so I question if generative AI can get there. Maybe? We'll find out!