Python Workshop: Basics I
[If you are looking for the Python, VSCode & packages installation guide, it's here.]
Based on:
this git of Zhiya Zuo
&
tutorials from tutorialspoint
Introduction
Python is an interpreted, high-level, general-purpose programming language. Created by Guido van Rossum and first released in 1991, Python's design philosophy emphasizes code readability. The language construction and object-oriented approach aims to help programmers write clear, logical code for small and large-scale projects. [Wikipedia]
Python is Interpreted - Python is processed at runtime by the interpreter. You do not need to compile your program before executing it. This is similar to PERL, PHP and MATLAB.
Python is Interactive - You can actually sit at a Python prompt and interact with the interpreter directly to write your programs.
Python is Object-Oriented - Python supports Object-Oriented style or technique of programming that encapsulates code within objects.
Popular Language for Data Analysis - Most of the time, you will need external packages to assist data analyses.
VSCode
VSCode is currently (2019) the most commonly used IDE (Integrated development environment) for programing in python.
for complete tutorial on installation of Python + VSCode + additional packages please refer to this page.
Jupyter notebook
Jupyter is an easy way to merge code and explanations in a beautiful way.
The easiest way to interact with such notebook (.ipynb) is with google colab. There you can run each cell independently or all cells combined through 'Runtime' section or the play button.
the main disadvantage of google colab is that debugging there is problematic.
Naming convention
There are two commonly used style in programming:
- camelCase
- snake_case or lower_case_with_underscore
Always make sure you use one convention consistently across one project.
All variable (function and class) names must start with a letter or underscore (_). You can include numbers, but it can't be the first char.
myStringHere = "my string" # valid
x = 3 # valid
x_3 = "xyz" # valid
# 3_x = "456" # invalid. Numbers cannot be in the first position.
if True:
print("True")
else:
print("False")
x = [0, 1, 2, 3, 4, 5, 6, 7, 8, 9]
y = x[0] # get element 0
print("y = " + str(y))
y = x[2] # get element 2
print("y = " + str(y))
y = x[0:2] # get elements 0,1
print("y = " + str(y))
y = x[:2] # same as [0:2]
print("y = " + str(y))
y = x[3:10]
print("y = " + str(y))
y = x[3:] # same as above
print("y = " + str(y))
You can also go to last element easily:
y = x[-1] # last element
print("y = " + str(y))
y = x[:-1] # all until the last element - noninclusive
print("y = " + str(y))
y = x[-3:] # last three elements
print("y = " + str(y))
Step size is also an option:
y = x[0:3:2] # only evens until 3 noninclusive
print("y = " + str(y))
y = x[1:5:2] # only odds until 5 noninclusive
print("y = " + str(y))
y = x[::3] # +3 step size - entire list
print("y = " + str(y))
y = x[::-1] # reverse!
print("y = " + str(y))
Primitives
In this section, we go over some common primitive data types in Python. While the word primitive looks obscure, we can think of it as the most basic data type that cannot be further decomposed into simpler ones.
Numbers
numbers without fractional partsare called integer. In Python, they are abbreviated as int
x = 3
type(x)
numbers with fractional part are floating point numbers. They are named float
in Python.
y = 3.0
type(y)
We can apply arithmetic to these numbers. However, one thing we need to be careful about is type conversion. See the example below.
z = 2 * x # int times int
type(z)
z = y**x # int to the power float
print(z)
type(z)
z = x / 2 # what will happen when dividing two ints?
z
my_error = 1.6
compare_result = my_error < 0.1
print(compare_result)
print(type(compare_result))
There are two valid Boolean values: True
and False
. We can also think of them as 1
and 0
, respectively.
my_error > 0
When we use Boolean values for arithmetic operations, they will become 1/0
automatically
(my_error > 0) + 2
my_str1 = "see you"
print(my_str1)
print(type(my_str1))
my_str2 = "see you later"
print(my_str2)
print(type(my_str2))
We can also use +
to concatenate different strings
my_str1 + " tomorrow"
One way of formatting strings is equivalent to c language:
print("1/3 is approximately %.2f" % (1 / 3)) # %f for floating point number
print(" '%s' != '%s'" % (my_str1, my_str2)) # %s for string
you can also simply do string concatenation:
print("Printing a string: " + my_str1 + ", and printing a number: " + str(3))
str
is an iterable object, meaning that we can iterate through each individual character:
print(my_str1[0])
print(my_str1[2:6])
a_list = [1, 2, 3] # commas to separate elements
print("Length of a_list is: %i" % (len(a_list)))
print("The 3rd element of a_list is: %s" % (a_list[2])) # Remember Python starts with 0
print("The last element of a_list is: %s" % (a_list[-1])) # -1 means the end
print("The sum of a_list is %.2f" % (sum(a_list)))
We can put different types in a list
b_list = [20, True, "good", "good"]
print(b_list)
Update a list: pop, remove, append, extend
print(a_list)
print("Pop %i out of a_list" % a_list.pop(1)) # pop the value of an index
print(a_list)
print("Remove the string good from b_list:")
b_list.remove("good") # remove a specific value (the first one in the list)
print(b_list)
a_list.append(10)
print("After appending a new value, a_list is now: %s" % (str(a_list)))
merge a_list
and b_list
:
a_list.extend(b_list)
print("Merging a_list and b_list: %s" % (str(a_list)))
We can also use +
to concatenate two lists
a_list + b_list
a_tuple = (1, 2, 3, 10)
print(a_tuple)
print("First element of a_tuple: %i" % a_tuple[0])
You can't change the values of a tuple:
# a_tuple[0] = 5
In order to create a single value tuple, you need to add a ','
a_tuple = 1 # this would create a int type
print(type(a_tuple))
b_tuple = (1,) # this would create a tuple type, take note of the comma.
print(type(b_tuple))
d = {} # empty dictionary
# add a key-value by using bracket (key). You can put anything in key/value.
d[1] = "1 value"
print(d)
# Use for loop to add values
for index in range(2, 10):
d[index] = "%i value" % index
print(d)
print("All the keys: " + str(d.keys()))
print("All the values: " + str(d.values()))
for key in d:
print("Key is: %i, Value is : %s" % (key, d[key]))
Side note: mutable Vs. immutable objects
Everything in Python is an object, and object can be either mutable (changeable after creation) or immutable. Almost all Python objects are mutable, except from the primitives (numbers, booleans, strings) and tuples
Why it's interesting? because when you reference a new variable, it's always soft link (like a shortcut in windows), and if you change a mutable object, it reference changes too! Something that can cause big bugs!
# immutable object: no problem
a = "Hello"
b = a
b = b + " World!"
print(a)
print(b)
# mutable object: big problem
a = ["Hello"]
b = a
b[0] = "World!"
print(a)
print(b)
# use .copy() to overcome this:
c = a.copy()
c[0] = "other world"
print(a)
print(c)