2b: Variables

2b: Variables#

Learning goals:#

Explain the function of variables in programs
Articulate basic principles of variable naming
Recognize good and bad examples of variable naming
Recognize NameErrors and common fixes

What are variables?#

Variables are a named place in the computer’s memory where a programmer can store data and later retrieve it using the variable name.

Think of a variable as a box with a label on it. You can put stuff in the box, take stuff out of the box.

For example:

x = 12.2
y = 14
print("x has the value ", x)
print("y has the value ", y)

Python will remember what’s in the x and y boxes, so you can do more stuff with it.

Like this:

x + y

And this:

x > y

You can also switch out what is in the variable boxes.

For example, let’s change what’s in x box.

# first print what the value of x is
print("x is ", x)
# then change it
x = 100
print("x is ", x)
# and change it again!
x = y + 35
print("x is ", x)

Variables are a kind of abstraction: a crucial element of computational thinking#

In computational thinking, we want to model data and develop/select algorithms to solve classes of problems, not just a specific individual problem. So one question programmers ask a lot is: what’s the underlying repeating structure here that I can or want to generalize and compose with other things?

Here’s a basic example of generalizing from “do multiplication with only these two specific numbers”, to “do multiplication with any two numbers” (i.e., the class of multiplication problems)

# a machine that multiplies 2 and 3
print(2 * 3)

# a machine that multiplies 4 and 5
print(4 * 5)

# a machine that multiplies 3 and 10
print(3 * 10)

# a machine that multiplies two numbers
x = 2
y = 20.5
print(x * y)

And another that goes from “add this specific person’s name to the end of each hello” to “add an input name to the end of each hello”

# a machine that adds "Joel" to the greeting
print("hello " + "Joel")

# a machine that adds "Rony" to the greeting
print("hello " + "Rony")

# a machine that adds a name to the greeting
username = "Joel"
print("hello " + username)

We could even generalize the greeting from hello if we want to!

# a machine that prints out a personalized greeting
username = "Joel"
greeting = "hello"
print(greeting + " " + username)

# a machine that prints out a personalized greeting
username = "Joel"
greeting = "ni hao"
print(greeting + " " + username)

# a machine that prints out a personalized greeting
username = "Joel"
greeting = "what's up"
print(greeting + " " + username)

HowTo: Create and update variables#

We assign a value to a variable using an assignment statement, which consists of:

An expression on the right-hand side that tells you what value should go in the variable,
An assignment operator (=), and
The name you want for the variable

NOTE THE DIFFERENCE BETWEEN = and ==!!!

# multiply 3 by 5 and put the resulting value in the variable box labeled "x"
x = 3 * 5
x

y = "joel" + " chan"
y

Updating a variable also happens with an assignment statement

x = 3 * 5 # create the variable x and assign its initial value
print("x has the value", x)
x = 22 # update the value of the variable x with the value 22
print("x now has the value", x)

Choosing names for your variables#

Syntax#

In terms of syntax (remember our division between computational thinking and coding? this is coding), there aren’t a ton of restrictions for naming variables:

Must contain at least one letter
Must start with a letter or an underscore (_)
Must not be a “reserved word”
- Non-exhaustive list: False, None, class, if, and, as, else
- Full list here (can also Google “python reserved words”). Don’t need to memorize (you’ll naturally remember this over time), but definitely keep handy

So this is ok:

ten2 = 5

This is bad:

2 = 5

Running it will yield a somewhat helpful error message:

  File "/var/folders/xz/_hjc5hsx743dclmg8n5678nc0000gn/T/ipykernel_21680/2360489726.py", line 1
    2 = 5
    ^
SyntaxError: cannot assign to literal

Remember that bottom left bit? It says “syntax error” which is helpful: it basically always means there’s something about the way you wrote the code that’s not valid Python code. Think of it like a grammatical or spelling error in English. The bottom right bit, in this case? Not so helpful if you’re a beginner, but here it’s basically saying “hey you’re trying to assign a thing to a variable, but it’s… not a valid variable, it’s a value (literal)!”

This is also bad (None is reserved)

None = 6

Will yield this error message:

  File "/var/folders/xz/_hjc5hsx743dclmg8n5678nc0000gn/T/ipykernel_21680/774819309.py", line 1
    None = 6
    ^
SyntaxError: cannot assign to None

Semantics#

The more important piece is the computational thinking piece. How do you choose variable names that assist with your ability to formulate problems, model data, and debug your programs?

Our fundamental principle here is: choose names that make the logic of the program legible. In other words, it should be easy for someone to read the code and guess what the program is doing at least in part based on the names of the variables.

Tip

Choose variable names that make the logic of the program legible.

For example, consider this chunk of code:

# compute pay for an employee
a = 35.0
b = 12.50
c = a + b
print(c)

What do you think this code does? What do you think the value types of the variables should be? What about the operators/expressions? Do you spot anything that might be wrong here? (hint: there is no syntax error here, only a semantic one!)

How about now?

# compute pay for an employee
hoursWorked = 35.0
hourlyRate = 12
pay = hoursWorked + hourlyRate
print(pay)

Answer:

For me at least, the 2nd version makes it clearer that the program shouldn’t have + in there: it should be *, since pay is a function of hours worked times hourly rate.

Also, say an employee told you they needed to update their number of hours worked. Which variable would you need to update?

You’ll be surprised how often you can get unstuck simply by clarifying the names of the variables (which makes the structure of the program clearer, and the source of the problem obvious).

Example: debug a program that is supposed to compute a total check with 20% tip after accounting for 7% tax

# compute a total check with 20% tip after accounting for 7% tax
a = 15.00
b = 0.2
c = 0.07

d = c * (a + a*b)
e = a + d
e

Compare:

# compute a total check with 20% tip after accounting for 7% tax
baseAmount = 15.00
tipRate = 0.2
taxRate = 0.07

tipAmount = taxRate * (baseAmount + baseAmount*tipRate)
totalCheck = baseAmount + tipAmount
totalCheck

Answer:

For me at least, the 2nd version makes it clearer that the program is mixing up the tip and the tax rate!

tipAmount = taxRate * (baseAmount + baseAmount*tipRate)

Should instead be:

tipAmount = tipRate * (baseAmount + baseAmount*taxRate)

Again, these are the same exact programs, from Python’s perspective! The variable names make all the difference.

If possible, I also like to name my variables in a way that makes clear what kind of data is in it. This helps me keep track of what data types are in my variables, since, as we’ve discussed, operators in expressions expect certain data types, and can (as in +) have different meanings depending on the values involved.

For example:

userName instead of a, which makes it clear that there’s probably some kind of str in there.
isFunny instead of x, which makes it clear that there’s probably a bool in there
numCredits instead of y, which makes it clear that there’s probably some kind of number in there

By convention, you might see people use certain names for certain kinds of things. For example, i is often used to refer to a counter value. s (or some variant of it) is often used to refer to a string.

To sum up, you should feel free to name variables whatever makes sense to you, as long as you feel they accurately signal the logic of the program they’re in. Your future self (and current/future collaborators) will thank you for following this fundamental principle.

To reinforce the point, I recommend:

a collection of programming horror stories about variable naming here
this StackOverflow thread for discussion of the importance of variable naming (in the context of discussing code readability, a central thing we care about it in this class, enough to make it a rubric item on your Projects!). The thread includes some links to style guides from Microsoft, Python, and other sources.
and this discussion of variable naming in a data science context

Let’s practice naming variables!#

Drawing on the rules and principles we’ve discussed here, practice defining the key variables for Python programs that will solve the following problems:

You’re writing a Python program to help instructors triage requests to join a class off the waitlist. To start with, have your program consider factors like how large the room is relative to the number of students, and your instructional team capacity.
You’re writing a Python program to help a small coffee shop track its inventory. The program needs to consider how much of each ingredient is available, how much is used per drink, and when to reorder.
A public library wants a simple script to calculate late fees for book returns. The program should consider factors like how many days a book is overdue, the daily fine rate, and any maximum penalty cap.
You’re designing a basic traffic light control system for an intersection. The program should consider traffic density, pedestrian crossing time, and standard light cycle durations.

The `NameError`#

Remember: computers (and Python) are very literal. For variables, this means everything needs to be exactly the same when you’re referring to a variable.

For example, what do you think will happen if you run the following code?

myNumber = 125
anotherNumber = 65
mynumber + anotherNumber

Answer:

You should get an error with this message on the bottom:

NameError: name 'mynumber' is not defined

Remember our map for reading errors? Bottom left says it’s a “NameError”, and bottom right says “you’re asking me to do something with the variable mynumber, but I don’t know what it is: you haven’t defined it for me! It’s like asking someone who knows nothing about football, “what play did they run on third down?” (error: third down is not defined)

The NameError is probably going to show up a lot this semester. It’s basically this:

“not defined” = “I can’t find the box you’re asking me to find”

Reasons this can happen:

You misspelled the variable
You did not have an assignment statement that defined the variable before you asked Python to do something with it

For the first one, a fun tip is to use the tab autocomplete feature in your editor. Basically, if you have a variable defined already, you can start typing, hit tab, and editors like VSCode will autocomplete for you. This helps reduce/eliminate misspellings. Nifty!

_images/variable-tab-autocomplete-ide.gif

If there are multiple similar ones, you can choose between them with arrow keys, like this:

_images/variable-tab-autocomplete-multiple-ide.gif

Another tip is to use a “linter” (like the Ruff extension we recommend for VSCode), which will alert you to a potential NameError as you’re typing, like spell check!

Let’s practice detecting and fixing NameErrors!#

The following code will yield a NameError when you try to run it. Fix the bug!

cost = 80
Discount = 0.25
SalePrice = Cost * (1 - Discount)
SalePrice

The following code will yield a NameError when you try to run it. Fix the bug!

numChars = 5
hasNumbers = True
(numChars >= 8) and (hasLetters == True) and (hasNumbers == True)

The following code will yield a NameError when you try to run it. Fix the bug!

isRaining = True
temp = 35

isRaining == true and temp < 40

Managing “types” with variables#

Remember how we said that data types matter? Because some operators only work with certain data types?

This means you need to make sure you keep track of / control what data types are going in your expressions. If you never use variables, it’s a bit easier, bc you can clearly see what type the values are.

But with variables, keeping track of data types can be tricky in Python. This is because Python is a dynamically typed language. This means that when the computer runs a Python program, it dynamically guesses the “type” of a variable box. It also means that the type of data that can go in a variable box is “dynamic” (i.e., can be changed). This removes some of the overhead to writing code, but you do need to be careful, since Python’s guesses may not always match your intentions! And we know that mixing data types in statements leads to bugs.

Side note: if you’ve learned another programming language before, you might find this unfamiliar. For example, in Java, which is a statically typed language, you have to declare what type a variable is when you create it, and the type won’t change.

Find out what type a variable is with `isinstance()` or `type()`#

You can use the built-in functions isinstance() or type() to figure out what is inside a variable.

a = 1
b = 2
c = a/b
type(c)

a = "1"
b = 1
c = 1.0
print(a, "is a", type(a))
print(b, "is a", type(b))
print(c, "is a", type(c))

Aside: here I’m using a , to join multiple things into a string, instead of the +. Ignore it for now, but if you’re curious, the reason is this , operator tells Python to automatically convert all the things into strings before trying to concatenate them together.

You can also write an expression that can test this

# is a a string?
type(a) == str

It’s easier to do this with isinstance(), but a bit tricky to understand it completely for now until we understand functions.

For example, to check if a is a string, we can do:

# is a a string?
isinstance(a, str)

The first thing in the parenthesis is the variable you want to check, and the second thing is the data type you want to match it to.

Another example:

a = 1
# check if a is an int
print(isinstance(a, int))
b = 2
c = a/b
type(c)

“Casting” variables to change their type#

If we really want to make sure that data types are what we expect them to be, we can use “cast” functions. These are the same name as data types, and they basically “force” a value to become a certain data type. You can pass in raw values (or “literals”) or variables.

For example:

# an int
x = 2
print("x is ", x)
print("x is a ", type(x))

# change to a str
x = str(x)
print("x is ", x)
print("x is a ", type(x))

# change to a float
x = float(x)
print("x is ", x)
print("x is a ", type(x))
x = int(x)
print("x is ", x)
print("x is a ", type(x))

Let’s go back to a common use case for this. Making sure that the data that will go in a math expression are all number types (otherwise we run into issues!)

# this will produce our TypeError
x = 3
y = "2"
x + y

Here’s a fix:

# if we want to do math, need to convert y to a number
x = 3
y = "2"
# cast the value of y to be an int before doing addition
x + int(y)

And if we want to make sure we’re doing concatenation:

# if we want to do concatenation, need to convert x to a string
x = 3
y = "2"
# cast the value of x to be an str before doing concatenation
str(x) + y

One thing to keep in mind: you can only cast something into a data type if it “looks like” the “literal” for that data type. Almost anything “looks like” the literal for a string, since you can just slap quotes around it and it becomes a string. But some data types are more fussy about their literals: for example, the literal for an int must be a valid set of digits.

So, for example, this will yield an error:

int("three")

Because "three" doesn’t “look like” the literal for an int, you can’t turn it into an int.

What do you think will happen with this? Feel free to paste this code into the Python REPL to find out!

int("$5,000")

Answer:

An error! It sorta looks like a number to humans, but notice what’s in there that’s not a number? The $ and ,! This is a common situation we’ll return to in the next module when we talk about strings (e.g., how to parse a string to get values we want out of it, such as numbers).

2b: Variables

Contents

2b: Variables#

Learning goals:#

What are variables?#

Variables are a kind of abstraction: a crucial element of computational thinking#

HowTo: Create and update variables#

Choosing names for your variables#

Syntax#

Semantics#

Let’s practice naming variables!#

The NameError#

Let’s practice detecting and fixing NameErrors!#

Managing “types” with variables#

Find out what type a variable is with isinstance() or type()#

“Casting” variables to change their type#

The `NameError`#

Find out what type a variable is with `isinstance()` or `type()`#