Homework 01 - Hello R

Load packages

library(tidyverse) 
library(datasauRus)

Exercise 1

Based on the help file, how many rows and how many columns does the datasaurus_dozen file have? What are the variables included in the data frame? Add your responses to your lab report. When you're done, commit your changes with the commit message "Added answer for Ex 1", and push.

Solution

(Type your answer to Exercise 1 here. This exercise does not require any R code.)

Exercise 2

Plot y vs. x for the dino dataset. Then, calculate the correlation coefficient between x and y for this dataset.

Solution

(The answers for this Exercise are given for you below. But you should clean up some of the narrative so that it only includes what you want to turn in.)

First let's plot the data in the dino dataset:

dino_data <- datasaurus_dozen %>%
  filter(dataset == "dino")

ggplot(data = dino_data, mapping = aes(x = x, y = y)) +
  geom_point()

And next calculate the correlation between x and y in this dataset:

dino_data %>%
  summarize(r = cor(x, y))

Exercise 3

Plot y vs. x for the star dataset. You can (and should) reuse code we introduced above, just replace the dataset name with the desired dataset. Then, calculate the correlation coefficient between x and y for this dataset. How does this value compare to the r of dino?

Solution

(Add code and narrative as needed. Note that the R chunks are labeled with plot-star and cor-star to provide spaces to place the code for plotting and calculating the correlation coefficient. To finish, clean up the narrative by removing these instructions.)

Blah blah blah...


I'm some text, you should replace me with more meaningful text...




Try the parsermd package in your browser

Any scripts or data that you put into this service are public.

parsermd documentation built on May 20, 2021, 5:08 p.m.