could anyone please explain to me the function of this code? and what it is intenteded to do?
airports %>%
semi_join(flights, c("faa" = "dest")) %>%
ggplot(aes(lon, lat))+
geom_point()
could anyone please explain to me the function of this code? and what it is intenteded to do?
airports %>%
semi_join(flights, c("faa" = "dest")) %>%
ggplot(aes(lon, lat))+
geom_point()
If you get confused by a piece of code, break it down in smaller steps and see what each line does. If you are not aware of certain functions, it's documentation is available at ?function_name (example - ?semi_join).
It seems that the datasets airports and flights are from package nycflights13. Load the libraries that are required for running the code.
library(dplyr)
library(nycflights13)
library(ggplot2)
semi_join keeps the common rows that match between the two datasets. The column to match on which the datasets are joined is faa in airports and dest in flights. If you don't know how join works take a smaller example and try this code on it. You may also take a look at How to join (merge) data frames (inner, outer, left, right) for a general explanation of different join functions.
airports %>% semi_join(flights, c("faa" = "dest"))
# faa name lat lon alt tz dst tzone
# <chr> <chr> <dbl> <dbl> <dbl> <dbl> <chr> <chr>
# 1 ABQ Albuquerque International Sunport 35.0 -107. 5355 -7 A America/Denver
# 2 ACK Nantucket Mem 41.3 -70.1 48 -5 A America/New_York
# 3 ALB Albany Intl 42.7 -73.8 285 -5 A America/New_York
# 4 ANC Ted Stevens Anchorage Intl 61.2 -150. 152 -9 A America/Anchorage
# 5 ATL Hartsfield Jackson Atlanta Intl 33.6 -84.4 1026 -5 A America/New_York
# 6 AUS Austin Bergstrom Intl 30.2 -97.7 542 -6 A America/Chicago
# 7 AVL Asheville Regional Airport 35.4 -82.5 2165 -5 A America/New_York
# 8 BDL Bradley Intl 41.9 -72.7 173 -5 A America/New_York
# 9 BGR Bangor Intl 44.8 -68.8 192 -5 A America/New_York
#10 BHM Birmingham Intl 33.6 -86.8 644 -6 A America/Chicago
# … with 91 more rows
After running the code, the above dataframe is plotted with ggplot2 library where x-axis is lon and y-axis is lat. It uses geom_point to plot as scatterplot.
airports %>%
semi_join(flights, c("faa" = "dest")) %>%
ggplot(aes(lon, lat))+
geom_point()