Welcome to the Using Tabular data in Python Course

This course is aimed at people who have completed the Enviromental Scientist’s Introduction to Python Course. If You have not completed this course, we recommend at least reviewing the course to make sure you have comparable background in basic python. The goal of this course is to introduce tabular data manipulation in python and to get you to the point that you no longer need to use excel for your data analysis.

Python build for this course

If you are not familar with using virtual python enviroments, we reccomend you review our lesson on installing python. For This course we recommend installing python as follows:

This installation includes basic python, the packages numpy and pandas, and the IDE spyder. It also installs matplotlib, which is a pandas dependency if you are using the plotting functionality.

  1. Install miniconda (if you haven’t already)
    1. go to https://conda.io/miniconda.html and download the appropriate python 3.6 installer and accept all of the defaults
  2. Create a virtual environment for this course (tdip) for Tabular Data In Python
    1. open anaconda prompt and enter:
    conda create -n tdip python=3.6 spyder numpy pandas matplotlib
    
  1. When conda asks you to proceed, type y:

    proceed ([y]/n)?
    
  2. That’s it python and spyder for this course should now be installed. To use python with spyder, in the start menu (under anaconda) you should see spyder (tdip). Open that up and get cracking!

Practice Exercises

We have developed a set of practice exercises to give you a taste of doing your own scripting. These exercises are facillitated through Github Classroom. If you want to get on to the exercises, this link will create a new repository with a copy of the exercises for you to begin working.

exercise prerequisites / associated lessons
exercise 1
exercise 2
exercise 3
exercise 4
exercise 5
exercise 6