TDM 20200

Only the best 10 of 14 projects will count towards your grade.

Topics are subject to change. While this is a rough sketch of the project topics, we may adjust the topics as the semester progresses.

Project Release date Due date

Project 1: Review: Jupyter Lab

8-Jan

19-Jan

Project 2: Introduction to web scraping with BeautifulSoup

11-Jan

26-Jan

Project 3: Introduction to web scraping with XPath

25-Jan

2-Feb

Project 4: Analyzing more than one hundred thousand XML files at once

1-Feb

9-Feb

Project 5: Extracting information about No Starch Press books from the OReilly website using Selenium

8-Feb

16-Feb

Project 6: Data Visualization

Feb 15

Feb 23

Project 7: Learning Dash

22-Feb

1-Mar

Project 8: Introduction to Spark SQL

Feb 29

Mar 8

Project 9: More Spark SQL and also streaming Spark SQL

7-Mar

22-Mar

Project 10: Introduction to Machine Learning

21-Mar

29-Mar

Project 11: More information about Machine Learning

28-Mar

5-Apr

Project 12: Introduction to containerization

4-Apr

12-Apr

Project 13: More information about containerization

11-Apr

19-Apr

Project 14: Feedback about Spring 2024

18-Apr

26-Apr

Projects are released on Thursdays, and are due 1 week and 1 day later on the following Friday, by 11:55pm. Late work is not accepted. We give partial credit for work you have completed — always submit the work you have completed before the due date. If you do not submit the work you were able to get done, we will not be able to give you credit for the work you were able to complete.

Always double check that the work that you submitted was uploaded properly. See here for more information.

Each week, we will announce in Piazza that a project is officially released. Some projects, or parts of projects may be released in advance of the official release date. Work on projects ahead of time at your own risk. These projects are subject to change until the official release announcement in Piazza.

Piazza

Piazza links remain the same from Fall 2023 to Spring 2024.

Syllabus

Navigate to the syllabus.