TDM 10100: Project 13 — 2022

Motivation: This semester we took a deep dive into R and it’s packages. Lets take a second to pat ourselves on the back for surviving a long semester and review what we have learned!

Make sure to read about, and use the template found here, and the important information about projects submissions here.

Dataset(s)

The following questions will use the following dataset(s):

  • /anvil/projects/tdm/data/beer/beers.csv

Questions

ONE

Read in the dataset and into a data.frame called beer

  1. What is the file size, how many rows, columns and type of data?

  2. What is the average score for a stout? (consider a stout any named beer from the column name with the word stout in it)

  3. How many Pale Ale’s are on this list? (consider a stout any named beer from the column name with the word pale and ale in it)

Items to submit
  • Code used to solve this problem.

  • Output from running the code.

TWO

  1. Plot or Graph all the beers that are available in the summer and their ratings.

Items to submit
  • Code used to solve this problem.

  • Output from running the code.

THREE

  1. Create a plot of the average rating of beer by country.

Items to submit
  • Code used to solve this problem.

  • Output from running the code.

FOUR

  1. Do limited runs of beer have a greater median rating than all others? (consider limited to be any beer that has the word Limited in the availablity column)

  2. Use the unique function to investigate the availablity column. Why are there different labels that are technically the same?

Please make sure to double check that your submission is complete, and contains all of your code and output before submitting. If you are on a spotty internet connection, it is recommended to download your submission after submitting it to make sure what you think you submitted, was what you actually submitted.

In addition, please review our submission guidelines before submitting your project.