Event box
Web Scraping in R with rvest
Sometimes data we find on the internet isn’t formatted for downloading and easy importing into our statistical program of choice. It’s simply displayed on a static web page as a table (if we’re lucky) or scattered about the page in various locations. To get this data requires “web scraping”. This means pulling out specific parts of a web page that we want to keep and wrangling into a structure suitable for further analysis. A recently-developed R package called rvest makes this process easier. In this workshop we’ll introduce how to use rvest for scraping web pages by way of several examples. We’ll also present a general strategy for web scraping and demonstrate some basic programming approaches to scraping multi-page web sites. Previous experience with R will be helpful.
- Date:
- Thursday, February 2, 2017
- Time:
- 10:00am - 11:30am
- Location:
- Brown Library 133 (Clark Hall)
- Campus:
- Brown Science & Engineering
- Categories:
- Data Workshop
- Presenter:
- Clay Ford
Click to add to your Calendar! >> Add to a Calendar using iCal
The UVA Library is committed to hosting library events that are inclusive and welcoming to all. If you need certain accommodations to participate fully in this event, please contact libevents@virginia.edu.