Getting data from the web: scraping
Overview
- Define HTML and CSS selectors
- Introduce the rvestpackage
- Demonstrate how to extract information from HTML pages
- Demonstrate how to extract tables and convert to data frames
- Practice scraping data
Before class
Class materials
- Web scraping
- rvest- Load the library (library(rvest))
- demo("tripadvisor")- scraping a Trip Advisor page
- demo("united")- how to scrape a web page which requires a login
- Scraping IMDB
 
- Load the library (
What you need to do after class
- Start homework 8
