STAT 29000

Only the best 10 of 14 projects will count towards your grade.

Topics are subject to change. While this is a rough sketch of the project topics, we may adjust the topics as the semester progresses.

Project Release date Due date

Project 1: Introduction to XML

January 6

January 21

Project 2: Web scraping in Python: part I

January 20

January 28

Project 3: Web scraping in Python: part II

January 27

February 4

Project 4: Web scraping in Python: part III

February 3

February 11

Project 5: Web scraping in Python: part IV

February 10

February 18

Project 6: Plotting in Python: part I

February 17

February 25

Project 7: Plotting in Python: part II

February 24

March 4

Project 8: Writing Python scripts: part I

March 3

March 11

Project 9: Writing Python scripts: part II

March 17

March 25

Project 10: Plotting with ggplot: part I

March 24

April 1

Project 11: Plotting with ggplot: part II

March 31

April 8

Project 12: Tidyverse and data.table: part I

April 7

April 15

Project 13: Tidyverse and data.table: part II

April 14

April 22

Project 14: Tidyverse and data.table: part III

April 21

April 29

Projects are released on Thursdays, and are due 1 week and 1 day later on the following Friday, by 11:59pm. Late work is not accepted. We give partial credit for work you have completed — always submit the work you have completed before the due date. If you do not submit the work you were able to get done, we will not be able to give you credit for the work you were able to complete.

Always double check that the work that you submitted was uploaded properly. See here for more information.

Each week, we will announce in Piazza that a project is officially released. Some projects, or parts of projects may be released in advance of the official release date. Work on projects ahead of time at your own risk. These projects are subject to change until the official release announcement in Piazza.

Syllabus

See here.