Spring 2024 Syllabus - The Data Mine Corporate Partners

Course Information

Course Number and Title CRN

TDM 11200 – Corporate Partners II

CRNs vary

TDM 21200 – Corporate Partners IV

CRNs vary

TDM 31200 – Corporate Partners VI

CRNs vary

TDM 41200 – Corporate Partners VIII

CRNs vary

TDM 51200 – Corporate Partners

CRNs vary

Meeting times: Two times per week (50 minutes LEC + 1 hour 50 minutes LAB), dependent on your assigned Corporate Partner Team. The team meeting times are listed on your registered course schedule.

Instructor Modality: Hybrid. LEC is online and LAB is in-person.

Course credit hours: 3 credits

Prerequisites:

  • Undergraduate students: To be enrolled concurrently in TDM 10200, 20200, 30200, or 40200 The Data Mine II or IV or VI or VIII (1 credit seminar).

  • Graduate students: Application required. No other prerequisites.

Course Site

All course material will be posted here in the Corporate Partners section of The Examples Book.

Grades can be viewed in Brightspace and assignments will be submitted through Gradescope.

Course Schedule & Due Dates

Due dates are listed in the Spring 2024 Student Course Schedule

Information about the Instructors

The Data Mine Staff

Check out our staff welcome page to see the team that supports The Data Mine

Email Policy

The Data Mine Team uses a shared email which functions as a ticketing system. Using a shared email helps the team manage the influx of questions, better distribute questions across the team, and send out faster responses.

Send your questions to [email protected]

  • Please use your official @purdue.edu email address to communicate with us.

  • If you have not received a response within 1 business day (Mon-Fri), please resend the email.

  • When emailing us, please place your Corporate Partner team in the subject line of the email (e.g., Merck Biometrics Team – Symposium Poster Question). This will help us respond to your emails quickly.

Use this table to send your questions to the most appropriate email. If you’re not sure who to ask, navigate to the Questions page for more details.

Who should you email? Types of Questions/Topics

Your CRP TA

Weekly tasks, catching up on a missed meeting, general team questions

[email protected]

General Data Mine/Corporate Partners, grades, technical project or topic questions

Your CRP Mentor

(Check with your TA first) Project specific questions

Corporate Partner Mentors

Your Corporate Partner Mentor(s) are employees of the company you are working with. You will meet once a week with them online (unless the company is in West Lafayette, IN or visiting Purdue for a meeting). Please treat all communication with them in a professional manner – they are like your supervisor during an internship. They also act as a product owner and can help to answer questions or find resources for any product specific questions that you have during the project.

Corporate Partner TAs

Our Corporate Partner TAs will serve as peer mentors and team leaders for each of our projects. Nearly all of our CRP TAs have been in The Data Mine Corporate Partners program in past years and many are returning to the same project. The CRP TAs will lead the weekly student meeting (lab – 2 hours) when you are not meeting with the Corporate Partner Mentor(s). Your CRP TA should also be the first person to go to when you have a technical question or questions about your project.

Data Scientists

The data scientists employed by The Data Mine are here to help students with the technical topics and concepts that they will encounter during their projects. The data scientist team has a varied background in topics such as natural language processing (NLP), geospatial information systems (GIS), high performance computing, and machine learning.

If you have a question for a member of the Data Science team, please email [email protected] and your question will be answered by a member of the Data Science Team. If our doors are open, you can drop by and ask for help. They will bring in other members of the team as needed. They are here to help so don’t be worried when asking questions!

When scheduling a meeting with a data scientist keep in mind that they are designed to be collaborative. The team wants to see any solutions that you’ve attempted and where you may be getting stuck. Also, for more complicated questions it helps to give some advance notice of the topics over email. We aren’t experts in all of data science and some research may be required.

Course Description

Students in The Data Mine Corporate Partners Learning Community will work in interdisciplinary teams with Corporate Partner Mentors on a variety of data-driven projects. They will analyze real data related to questions that the Corporate Partner proposes. Most projects will last for a full academic year (late August through late April), with multiple reports and presentations given more frequently. The mentor is expected to meet with the students weekly by Microsoft Teams, or (more rarely) in person. Students are expected to actively participate in these meetings and in all individual and group work. The goal of the course is to help students build impactful industry related skills in data science, visualization, and data engineering. The Data Mine staff also has data scientists who can assist students with technical questions focused on the skills being built and the research conducted. Students can work on real-world industry facing issues that have a high value add for the corporate partner.

Learning Outcomes

By the end of this course, you will be able to:

  1. Discover and apply data science tools to manage data sets from Corporate Partners through researching, cleaning, processing, analyzing, and visualizing data.

  2. Apply Agile project management methodology to plan task ownership and decision making, collaborate with scrum teams to accomplish the increment during 2-week sprints, review the product backlog, and reflect on areas of success and improvement.

  3. Engage with peers to identify and overcome complex challenges in the data sciences.

  4. Effectively communicate findings of technical research through detailed documentation and team presentations.

  5. Discover professional development opportunities in order to prepare for your career.

Logistics

Office Hours

The Data Mine staff offer office hours by request. Please email [email protected] if you need to request a meeting. Students are always welcome to stop by staff offices Monday - Friday in Convergence located at 101 Foundry Dr., West Lafayette, IN 47906.

Class/Team Meeting Times

50-minute team meeting

This meeting will occur synchronously online via MS Teams unless your Corporate Partner Mentors are located in West Lafayette or visiting campus. Online links are shared via a calendar invite at the beginning of the year. You can join this meeting from anywhere, but please follow the “net-etiquette” guidelines to find a quiet space.

1 hour 50-minute student labs

This meeting will occur in person for all teams except for National Data Mine Students which will occur virtually for all teams. This is dedicated work time with your team members to collaborate on your project and to work as a larger group or as sub-teams. The meetings will be held in Hillenbrand Residence Hall (HILL) at 1301 Third Street, West Lafayette, IN 47906 or Shreve Residence Hall (SHRV) at 1275 3rd Street, West Lafayette, IN 47906 unless otherwise noted.

Our image
Figure 1. Map of campus featuring Hillenbrand (HILL), Streve (SHRV), and Convergence (CONV)

Required Materials

Assignments and Grades

Late Policy

We do NOT accept late work, unless there are extenuating circumstances.

Extenuating circumstances do NOT include:

  • Having exams near or on the due date

  • Working on other course projects on or near the due date

  • Being sick for a few days on or near the due date

  • Traveling for any reason

  • Forgetting the due date

  • Having technical difficulties (wifi, computer, etc)

It is better to submit a partially done report than nothing at all. Partial credit can be earned for work turned in on time. The electronic submission systems also do not allow for late work.

Grade Expectations

This is a research-type, project-based course, so the majority of your grade for the semester will be determined holistically based on work with Corporate Partners in addition to reports and other assignments per the schedule. Students will receive their own individual grade, but the success of the group will be a component of that individual grade.

It is very important to check your @purdue.edu email, Brightspace, Gradescope, and The Examples Book pages frequently! Please review the schedule. More details for each assignment will be available in The Examples Book.

Due dates are listed in the semester schedule.

You will need to complete the tasks detailed on each sprint page. The first sprint is covered here: Sprint 1. Additional tasks specific to your project will be discussed with your CRP Mentor, TA, and team.

On Wednesday, April 24, 2024, 4:00 pm - 6:30 pm there will be a Corporate Partners Symposium at France A. Córdova Recreational Sports Center to showcase the work you have done throughout the year to corporate partner mentors and guests. All Corporate Partner students will be required to attend the symposium presentation to present their work. More details will be forthcoming and posted in The Examples Book.

The Data Mine does not conduct an exam during the final exam period. Therefore, Corporate Partner Courses are not required to follow the Quiet Period in the Academic Calendar.

Grade Breakdown

Agile 2-week Sprints

60%

Seven 2-week sprints. Click on the pages for each sprint for specific assignments.

Sprint 1

12%

Sprint 2

8%

Sprint 3

8%

Sprint 4

8%

Sprint 5

8%

Sprint 6

8%

Sprint 7

8%

Corporate Partners Mentor and TA Evaluation

15%

Mid-Semester Evaluation

5%

Final Evaluation (cumulative of entire spring 2024 semester)

10%

Symposium

25%

Drafts (poster, video script)

5%

Final Poster, Final Video & Presentation at Symposium on April 24, 2024

20%

TOTAL

100%

This course will follow the 90-80-70-60 grading scale for A, B, C, D cut-offs. If you earn a 90.000 in the class, for example, that is a solid A. +/- grades will be given at the instructor’s discretion below these cut-offs. If you earn an 89.11 in the class, for example, this may be an A- or a B depending on the course grade distribution at the end of the semester.

  • A: 100.000% – 90.000%

  • B: 89.999% – 80.000%

  • C: 79.999% – 70.000%

  • D: 69.999% – 60.000%

  • F: 59.999% – 0.000%

Agile

The Data Mine will be applying Agile project management to all of our Corporate Partner projects. Most of our Corporate Partners use Agile methods at their workplace. Agile allows complex projects to be broken down into small manageable tasks that can be assigned to individuals or teams. Agile also has built-in processes that help to enable team communication and collaboration.

Many corporations utilize Agile in environments from software development to data science. While the specifics of each Agile practice may vary by corporation it is beneficial to understand the high-level architecture of the Agile practices and how they can be beneficial in a team development environment. Agile implementation specifics may differ by team. However, each team will be working toward the same goals focused on the breakdown and accomplishment of work tasks and the constant open collaboration between team members.

To become more familiar with Agile methodologies you will complete online training and interactive team training focused on Agile. You will also take a quiz on applying Agile to The Data Mine. Since The Data Mine Corporate Partners is a learning environment (and not your typical 8 AM - 5 PM workplace), we have modified some of the practice to best suit the student schedule.

The MS Teams Planner (or other Agile software) application will also be available to teams for task tracking. The Data Mine staff will provide resources on the use of MS Teams Planner and how it related to the Agile concepts in the materials above. The tool that the team utilizes for Agile task tracking can be determined on a project-by-project basis between the students and the Corporate Partner Mentor or TA.

Course Policies

This course permits you, the student to participate in a class project that has been sponsored by a third party other than the University. The University encourages and supports your participation in this practical learning experience. Although your course requirements may include a practical learning project, you are not required to participate in a project that is sponsored by an outside third party. Prior to your participation in a project sponsored by an outside third party, we would like you to carefully consider that your participation (i) may require you to assign your intellectual property (IP) rights to any intellectual property for which a student would retain ownership under the University’s Policy I.A.1 on Intellectual Property and/or (ii) may require you sign a non-disclosure (confidentiality) agreement with the sponsor. If you sign an agreement regarding intellectual property rights or a non-disclosure agreement, you may incur personal liability (with respect to a breach of a non- disclosure agreement) or you may lose economic benefits associated with your ownership of intellectual property (with respect to a license or assignment of intellectual property). You are encouraged to retain independent legal counsel for advice on these types of agreements. In addition, if you choose not to sign a non-disclosure or intellectual property rights agreement, you may be reassigned to a different project or you may not be able to participate in The Data Mine Corporate Partners.

Confidentiality of The Data Mine Corporate Partner Projects

It is important to note that you are working on real-world problems that your Corporate Partner is trying to solve. These projects weren’t created as busywork to keep you occupied for 9 months; you have the opportunity to make a real impact with your Corporate Partner. Past work from Data Mine students has been put into production code!

With that being said, the work you do and the data you have access to must be kept fully confidential! Nearly all Corporate Partner students will be required to sign an NDA and/or IP agreement with the company. Even if you do not have to sign an NDA for your project, please keep the project details private. While each NDA will have unique terms, some basics include:

  • Do not move or copy the data from the original storage. Never email data, text it to your teammates, copy it to MS Teams, or put it in Google drive (or any other cloud storage system). For example, if the data lives on Anvil, do not move it off Anvil and do not move it to a different folder. including your home directory.

  • Do not share any screenshots of the data or any findings (graphs, pictures, etc.) from the project with those who are not on your team.

  • You cannot share things you learn from the data with anyone who is not working on the project. This includes your roommate, your parents, and your best friend.

  • Do not disclose project specifics to anyone, including:

    • In an interview for an internship or job

    • On your LinkedIn profile

    • Your family/friends/roommate/boyfriend/girlfriend/professor

  • Do not discuss the details of projects when you are in a public space. You should find a private place to join the weekly online team meetings. Also, be careful working on the project in a public space when others could walk by and see your screen.

  • If you ever have questions about what you can talk about, always ask your Corporate Partner Mentor first. If you’re ever in doubt about what to share it’s often best to not share initially and check with your corporate partner. They can help clarify any confusion.

Guidance on Generative AI

Use of generative AI tools needs to be approved by your company mentor prior to being used in the project.

Work with your TA to check for approval and document it with The Data Mine.

As the world of machine learning, deep learning, and AI continues to evolve we wanted to offer some guidance on The Data Mine’s perspective for generative AI tools, such as ChatGPT.

New emergent technologies can be incredibly valuable tools. However, at the same time it’s important to keep perspective on how and when we utilize these new systems.

When using ChatGPT (or other generative AI) on a Data Mine project:

  • Never share a company’s code, data, information, or any other proprietary property with the tool.

    • While not all tools incorporate user input into their training, it’s a very common practice and can lead to breaches in the NDA agreements.

  • Always question the response that the tool provides.

    • It’s OK to ask different apps for suggestions on things like common algorithms or good starting points for problem solutions. However, it’s VITAL to understand factors like where the solutions fit, how they perform, and how to measure their performance.

    • It’s OK for a tool to recommend an algorithm for research. It’s unacceptable to assume that the algorithm is the only correct answer and to not be able to explain why it was chosen. (ChatGPT told me won’t be accepted.)

    • It’s also occasionally possible that the tool will make up an answer, and you don’t want to get stuck presenting false information.

  • If you’re ever unsure about if a tool can be used, ask your mentor and The Data Mine BEFORE you use it.

    • We want to use new tools and adapt to the new environments, but our number 1 priority is to provide a safe and secure data environment. We can’t do anything that puts that at risk.

  • When using generative AI for code it’s very important to understand the fundamental code’s functionality.

    • While generative AI can easily write if/else functions or for loops, if you don’t understand how they work you will have a much harder time when it comes to writing a novel or highly specific code function.

    • Generative AI is great to help with ideas, but shouldn’t be used with no thought.

As with any new technologies, the world of generative AI is changing quickly. We encourage open discussion and welcome any feedback to The Data Mine concerning these technologies.

Data Mine Approval Process

  1. The TA should reach out to the company project mentor and get written approval for the use of generative AI tools in the project.

  2. The approval email should then be forwarded to [email protected] for documentation.

    • The email subject line should read Generative AI Approval - <team name>. With the "team name" replace with your group’s name.

Attendance Policy

This course follows Purdue University Academic Regulations regarding class attendance, which states that students are expected to be present for every meeting of the classes in which they are enrolled. For the purposes of this course, being “present” means attending all face-to-face meetings and all online meetings, unless you are ill or need to be absent for reasons excused by University regulations: grief/bereavement, military service, jury duty, parenting leave or emergent medical care. Attendance will be taken at the beginning of each class and lateness will be noted.

Regardless if your absence is planned or unplanned, excused or unexcused, please notify your TA as soon as possible and work with them to catch up on missed information and work.

Dropped Absences

All students will get to drop one missed LAB (1 hr 50 min) and one missed LEC (50 min) per semester. The missed class will still show up on your sprint report when graded by your TA, but The Data Mine staff will add in the drops at the end of the semester.

Excused Absences

The Office of the Dean of Students is able to verify and provide notifications for absences that meet the criteria of the excused absence policies established by University Senate.

The University Senate recognizes the following as types of absences that must be excused:

  • Grief Absence Policy for Students

  • Jury Duty Policy for Students

  • Medical Excused Absence Policy for Students

  • Military Absence Policy for Students

  • Parenting Leave Policy for Students- Facilitated by the Office of Institutional Equity

Students needing an absence notification sent for one of the above-listed excused absence policies should complete the corresponding request form.

Unexcused Absences

What if the absence does not meet the criteria of one of the excused absence policies? (link)

Absences outside of those covered by the University’s excused class absence policies are at the discretion of the individual course instructors. Students should work with their instructors directly to discuss their absence and the opportunity to complete missed coursework. The Office of the Dean of Students cannot verify or provide notification for an absence outside of the excused class absence policies.

What should you do if it does not meet the criteria for an excused absence?

  1. Do not come to class if you are feeling ill, but DO email/message your TA immediately. They do not need details about your symptoms; simply let them know you are feeling ill and cannot come to class. If it is an emergency situation, please follow the University regulations on emergent medical care (see above).

  2. Unless it falls under the University excused absence regulations (see above), any work due should be submitted on time.

Most absences not excused by ODOS will not be excused by The Data Mine. However, if you believe you have an extenuating circumstance, please notify us at [email protected].

Class Behavior

You are expected to behave in a way that promotes a welcoming, inclusive, productive learning environment. You need to be prepared for your individual and group work each week, and you need to include everybody in your group in any discussions. Respond promptly to all communications and show up for any appointments that are scheduled. If your group is having trouble working well together, try hard to talk through the difficulties—this is an important skill to have for future professional experiences. If you are still having difficulties, ask The Data Mine staff to meet with your group. Visit the Student Code of Conduct page to understand expectations on “Net-etiquette,” dress-code, in-person meetings, meal etiquette, work expectations, networking expectations, written communication, and time management.

Adding The Data Mine to your Resume

Please see the Professional Development section to learn how to add The Data Mine to your resume.

Disclaimer

This syllabus is subject to change. Changes will be made by an announcement via email and the corresponding course content will be updated.