Oct 26-27, 2017
9:00 am - 5:00 pm
Instructors: Daniel Chen
Helpers:
Cleaning Data in Python
Supervised Learning with scikit-learn
Unsupervised Learning in Python
Pandas Foundations
Manipulating Time Series Data in Python
Intro to Python for Data Science
Importing Data in Python (Part 1)
Importing Data in Python (Part 2)
Importing & Managing Financial Data in Python
Python Data Science Toolbox (Part 1)
Python Data Science Toolbox (Part 2)
Manipulating DataFrames with pandas
Intermediate Python for Data Science
Introduction to Data Visualization with Python
Interactive Data Visualization with Bokeh
Statistical Thinking in Python (all the parts)
Natural Language Processing Fundamentals in Python (as needed)
Introduction to Databases in Python
09:00 | Different ways you can interface with Python |
09:30 | Pandas DataFrame basics |
10:30 | Break |
10:40 | Pandas data structures |
11:40 | Quick overview of plotting methods |
12:00 | Lunch |
13:30 | Assembling data |
14:00 | Missing Values |
14:30 | Break |
14:45 | Data Reshaping |
15:45 | Data Types |
16:15 | Functions |
17:00 | Finish Day 1 |
09:00 | Strings and Text Data in Python |
10:00 | Functions and Apply |
10:30 | Break |
10:45 | Applying over rows and columns |
11:15 | Grouped operations |
12:00 | Lunch |
13:00 | Dates and Times |
14:00 | Linear Models |
14:30 | Break |
14:45 | Fitting Models |
15:45 | Thinking about performance |
16:00 | Good Practices and Wrap-up |
17:00 | Finish Day 2 |
To participate in a workshop, you will need access to the software described below. In addition, you will need an up-to-date web browser.
We maintain a list of common issues that occur during installation as a reference for instructors that may be useful on the Configuration Problems and Solutions wiki page.
Python is a popular language for research computing, and great for general-purpose programming as well. Installing all of its research packages individually can be a bit difficult, so we recommend Anaconda, an all-in-one installer.
Regardless of how you choose to install it, please make sure you install Python version 3.x (e.g., 3.4 is fine).
We will teach Python using the IPython notebook, a programming environment that runs in a web browser. For this to work you will need a reasonably up-to-date browser. The current versions of the Chrome, Safari and Firefox browsers are all supported (some older browsers, including Internet Explorer version 9 and below, are not).
bash Anaconda3-and then press tab. The name of the file you just downloaded should appear. If it does not, navigate to the folder where you downloaded the file, for example with:
cd DownloadsThen, try again.
yes
and
press enter to approve the license. Press enter to approve the
default location for the files. Type yes
and
press enter to prepend Anaconda to your PATH
(this makes the Anaconda distribution the default Python).
conda install xlwt openpyxl feather-format seaborn statsmodels scikit-learn regex wget odo numba
pip install lifelines pandas-datareader