Data Visualization

Code for quiz 9

  1. Load the R package we will use.
  1. Quiz questions

Question: e_charts-1

Create a bar chart that shows the average hours Americans spend on five activities by year. Use the timeline argument to create an animation that will animate through the years.

 spend_time <- read_csv("https://estanny.com/static/week8/spend_time.csv")

e_charts-1

Start with `spend_time`

-   THEN group_by `year`

-   THEN create an e_chart that assigns `activity` to the x-axis and will show activity by `year` (the variable that you grouped the data on)

-   THEN use `e_timeline_opts` to set autoPlay to TRUE

-   THEN use `e_bar` to represent the variable `avg_hours` with a bar chart

-   THEN use `e_title` to set the main title to 'Average hours Americans spend per day on each activity'

-   THEN remove the legend with `e_legend`
spend_time  %>%
  group_by(year) %>%
 e_charts(x = activity, timeline = TRUE) %>%
  e_timeline_opts(autoPlay = TRUE)  %>% 
  e_bar(serie = avg_hours)  %>% 
  e_title(text ='Average hours Americans spend per day on each activity') %>% 
  e_legend(show = FALSE)

Question: echarts-2

Create a line chart for the activities that American spend time on.

Start with spend_time

spend_time %>%
  mutate(year = paste(year, "12","31", sep = "-"))  %>% 
  mutate(year = lubridate::ymd(year))  %>% 
  group_by(activity) %>%
  e_charts(x  = year)  %>% 
  e_line(serie = avg_hours)  %>% 
  e_tooltip()  %>% 
  e_title(text = 'Average hours Americans spend per day on each activity')  %>% 
  e_legend(top = 40) 

Question - modify slide 82

ggplot(spend_time, aes(x = year, y = avg_hours, color = activity)) + 
geom_point() +
geom_mark_ellipse(aes(filter = activity == "leisure/sports",
description = "Americans spend on average more time each day on leisure/sports than the other activities"))

Question - tidyquant

Modify the tidyquant example in the video

Retrieve stock price for SEE QUIZ, ticker: SEE QUIZ, using tq_get

from 2019-08-01 to 2020-07-28 assign output to df

df  <- tq_get("FB", get = "stock.prices", 
          from = "2019-08-01", to = "2020-07-28")

Create a plot with the df data

ggplot(df,aes(x = date, y = close)) +
geom_line() +
geom_mark_ellipse(aes(
filter = date == "2020-01-08",
description = "The CDC issue its first public alert about the coronavirus"
), fill = "yellow") +
geom_mark_ellipse(aes(
filter = date == "2020-03-23",
description = "WHO describes pandemic as 'accelerating'"
), color = "red", ) +
labs(
title = "Facebook",
x = NULL,
y = "Closing price per share",
caption = "Source: https://en.wikipedia.org/wiki/Timeline_of_the_COVID-19_pandemic_in_the_United_States")

Save the previous plot to preview.png and add to the yaml chunk at the top

ggsave(filename = "preview.png", 
       path = here::here("_posts", "2021-04-19-data-visualization"))